Local Python RAG CLI

This project is a simple setup for RAG (Retrieval Augmented Generation) using a local LLM.

This project serves as a good starting point for those looking to start building chat bots using a custom corpus, and can be easily adapted for more complex uses.

The main application is a simple text input allowing the user to prompt a local Ollama model.

Plain text files can be added to the content directory to provide context to the LLM.

The model will then then attempt to find content relevant to the user's prompt to generate a response.

Getting Started

This project use's devenv to ensure that all necessary dependencies and configuration are present while in the project's development environment.

Globally install Ollama, the llama3.1 model, and setup embeddings with ollama run nomic-embed-text.
Install devenv by following its Getting Started guide.
Clone this repo.
In the root of the project, run devenv shell to activate the development environment, which will install all project dependencies and drop you into an isolated bash shell. Neovim and some other utilities are present for convenience. The environment is hermetic except for the $HOME environment variable so that the globally installed Ollama model can be accessed.
Upload the text files you wish to provide as context to the content directory.
Run python -m backend. If the ChromaDB directory is not present yet, embeddings will be generated for all the context you've uploaded.
In another shell, repeat step 3., then run devenv up to start the Ollama server, which needs to be running so the chat CLI can query it for responses.
Back in the first devenv shell, run python -m backend to enter the chat CLI.
Prompt the model about your data! Play around with the model you're using, adjust the similarity relevance threshold, try using files other than plain text, etc.

Side Note: The more hermetic (isolated) your development environments are from your global system, the more portable your project will be to other's machines. This is good practice, but that said, if this is not a concern of yours, you can easily scrap all the devenv setup and run this project with whatever if your preferred method for setting up python projects.

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
backend		backend
devenv		devenv
.env.example		.env.example
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
devenv.lock		devenv.lock
devenv.nix		devenv.nix
devenv.yaml		devenv.yaml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Local Python RAG CLI

Getting Started

About

Uh oh!

Releases

Packages

Languages

License

MMongelli99/rag

Folders and files

Latest commit

History

Repository files navigation

Local Python RAG CLI

Getting Started

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages