A complete fullstack application for multimodal AI featuring image + text embeddings, hybrid search, and RAG capabilities. Built with modern technologies and professional architecture.
- Multimodal Embeddings: Generate embeddings from both text and images
- Vision Transformers: State-of-the-art image understanding with ViT models
- Sentence Transformers: Advanced text embeddings
- Hybrid RAG: Combined text + image retrieval augmented generation
- Vector Similarity Search: Find similar content using cosine similarity
- Beautiful UI: Gradient designs with Tailwind CSS
- Three Main Interfaces: Embeddings, Search, and RAG Chat
- Drag & Drop: Easy file uploads with preview
- Real-time Updates: Live search results and AI responses
- Responsive Design: Works perfectly on all devices
- Type Safety: Full TypeScript implementation
- FastAPI: Modern, fast web framework
- ChromaDB: Vector database for efficient similarity search
- Professional Architecture: Clean service-based architecture
- Comprehensive API: RESTful endpoints with OpenAPI documentation
- Error Handling: Robust error handling and logging "