Smart File Finder & Chat Assistant v2 is a powerful Python-based desktop application designed to streamline file management and information retrieval for Chinese-speaking users. With a sleek, Chinese-localized interface, this tool leverages advanced AI to search, summarize, and interact with documents and images.
Built with customtkinter for a modern GUI and Flask for a robust API backend, itβs perfect for professionals, researchers, and teams handling large file collections in Chinese environments. π
Download FileFinderSetup.exe from Releases.
- Semantic File Search π: Find documents (DOCX, PDF, TXT) and images (JPG, PNG) using natural language queries in Chinese.
- AI-Driven Insights π€: Generate embeddings for text and images with models optimized for Chinese content, powered by SentenceTransformers and ChineseCLIP.
- OCR Excellence πΈ: Extract and summarize text from images and PDFs using Tesseract OCR, tailored for Chinese characters.
- Smart Chat Assistant π¬: Engage in contextual conversations or summarize files with a Chinese-focused AI, supporting both Normal and RAG modes.
- Chinese Interface π¨π³: Fully localized UI with intuitive navigation, designed for native Chinese speakers.
- Dark/Light Mode π: Toggle themes for comfortable use in any lighting.
- System Tray Integration π₯οΈ: Minimize to tray with
Alt+Qhotkey to restore. - Continuous Indexing π: Automatically updates file indexes for monitored directories.
To be updated with v2βs Chinese interface.
- Python π: 3.10 or higher
- Tesseract OCR π: For Chinese text extraction
- Windows: Download, add to PATH, ensure
chi_simdata is installed - Linux:
sudo apt install tesseract-ocr tesseract-ocr-chi-sim - macOS:
brew install tesseract
- Windows: Download, add to PATH, ensure
- Poppler π: For PDF processing
- Windows: Download, add to PATH
- Linux:
sudo apt install poppler-utils - macOS:
brew install poppler
# Clone the Repository π₯
git clone https://github.com/BotirBakhtiyarov/Filefinder.git
cd Filefinder
# Set Up Virtual Environment π οΈ
python -m venv .venv
source .venv/bin/activate # Linux/macOS
.venv\Scripts\activate # Windows
# Install Dependencies π¦
pip install -r requirements.txtUpdate MODEL_DIR in api_server.py:
MODEL_DIR = "models/" # Path to Chinese-optimized modelsMake sure models like:
paraphrase-multilingual-MiniLM-L12-v2chinese-clip-vit-base-patch16
are available or downloaded.
# Start the API server π
python api_server.py
# Run the main application π
python app.pyA Chinese-language setup wizard will guide you to:
- Enter API keys/URLs
- Set document/image directories
- Choose theme (Dark/Light)
- Go to the βζη΄’β tab.
- Enter Chinese queries (e.g.,
εΉ΄εΊ¦ζ₯εζ¦θΏ°). - View results and:
- ζεΌ: Open file
- ζθ¦: AI-generated summary
- OCR ζθ¦: OCR-based summary for images
- Go to the βθ倩β tab
- Select:
- Normal Mode: For general chat
- RAG Mode: Upload and ask about specific files
- Ask questions in Chinese and get summarized answers
- Minimize to tray by closing window
- Press Alt+Q to restore
FileFinder_v2/
βββ .venv/ # Virtual environment
βββ assets/
β βββ ui/ # Chinese-localized GUI
β β βββ chat_frame.py
β β βββ main_app.py
β β βββ search_frame.py
β β βββ setup_wizard.py
β β βββ summary_window.py
β βββ utils/
β βββ file_utils.py
βββ app.py # App entry point
βββ api_server.py # Flask backend
βββ requirements.txt # Dependencies
βββ myicon.ico # App icon
api_server.py includes:
/embed_text: Embed Chinese text/embed_image: Embed image/embed_clip_text: CLIP-based text-image embedding/extract_pdf_with_ocr: OCR for Chinese PDFs/extract_image_ocr: OCR for Chinese images
We welcome contributions! π
# Fork and branch
git checkout -b feature/YourFeature
# Commit and push
git commit -m "Add YourFeature"
git push origin feature/YourFeature- Follow PEP 8
- Maintain Chinese localization
- File Access Errors: Verify file paths and network drives.
- OCR Issues: Make sure Tesseract is installed with
chi_simsupport. - API Failures: Confirm
api_server.pyis running and keys are set. - UI Bugs: Check
customtkintercompatibility.
Open an issue on GitHub with logs if needed.
MIT License β see LICENSE for details.
- CustomTkinter: Modern GUI framework
- SentenceTransformers: Chinese text embeddings
- Transformers: ChineseCLIP integration
- Tesseract OCR: Robust OCR engine
Built with β€οΈ for Chinese-speaking users, AI File Finder v2 is your ultimate file management companion!