PDFChat-RAG

A simple example of Retrieval-Augmented Generation (RAG) application with basic functionality.
PDFChat-RAG is a modular, containerized application that enables semantic search and chat over PDF documents using RAG. Built with Flask, Celery, Ollama, ChromaDB, and more, it provides an end-to-end pipeline for uploading, and conversing with PDF documents using local language models.

🚀 Features

📄 Upload PDFs and extract embedded text
🔍 Semantic search over vector embeddings using ChromaDB
💬 Chat interface for querying document content via RAG
🧠 LLM integration using Ollama
🧰 Background processing with Celery & Redis
🗂️ Metadata storage in MongoDB

🧱 Architecture

🐳 Dockerized Services

Container	Role
`flask-app`	PDF upload, RAG chat UI/API
`ollama`	Hosts and serves LLM models
`celery`	Runs background tasks
`redis`	Pub/sub and Celery broker
`chromadb`	Stores and queries vector embeddings
`mongodb`	Stores app metadata and user sessions

Clone the Repo

git clone https://github.com/darshanz/pdfchat-rag.git
cd pdfchat-rag

NB. Create relevant directories for storing the uploaded files in the host machine and map them to the flask-app docker ocntainer. Similarly map the directories for mongodb and ollama-models directories.

Start the App

docker-compose up --build

This will start all containers. Access the app at http://localhost:5000

TODO

Include support for multiple files
User loging/auth
Save chat history for given pdf
Extracts images and tables from the document and Include in chat responses

📜 License

This project is licensed under the MIT License.

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
app		app
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
architecture.png		architecture.png
docker-compose.yml		docker-compose.yml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

PDFChat-RAG

🚀 Features

🧱 Architecture

🐳 Dockerized Services

Clone the Repo

Start the App

TODO

📜 License

About

Uh oh!

Releases

Packages

Languages

License

darshanz/pdfchat-rag

Folders and files

Latest commit

History

Repository files navigation

PDFChat-RAG

🚀 Features

🧱 Architecture

🐳 Dockerized Services

Clone the Repo

Start the App

TODO

📜 License

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages