GitHub - bonet/llm-compare

LLM Output Classifier

The Project

In this project, we conduct a simple experiment to investigate whether an LLM can recognize their own output.

The experiment is divided into 2 phases:

Phase 1
In this phase, we feed prompt inputs to 3 different LLMs (OpenAI GPT4, Llama 2, and Mixtral 8x7B) and save the results into files.
Phase 2
We build a classification query that is asking each LLM to determine which one of the outputs from Phase 1 are theirs.

LLM wrapper

LLMCaller is a wrapper for calling various LLMs dynamically. Under the hood, the class is using Ollama platform to locally connect with Llama 2 and Mixtral models and Langchain to remotely connect with OpenAI GPT-4.

Quickstart

To run this project you need to have pipenv installed:
- On Linux: sudo apt install pipenv
- On Mac: brew install pipenv
If you don't have Ollama already, you can download it here
Add environment variables:
- cp .env.example .env to copy environment file
- Add your OpenAI API key to .env file
Install packages:
- pipenv shell to enter virtual environment
- pipenv install to install python packages listed in Pipfile
Run Ollama server:
- ollama serve to run Ollama server locally and handle Llama 2 and Mixtral prompt request
On another shell, run the script:
- python3 generate_results.py to run Phase 1
- python3 classify_results.py to run Phase 2
The classification result can be found here

A more detailed discussion about this project can be found on this medium blog.

Name		Name	Last commit message	Last commit date
Latest commit History 18 Commits
results		results
.env.example		.env.example
.gitignore		.gitignore
Pipfile		Pipfile
Pipfile.lock		Pipfile.lock
README.md		README.md
classify_results.py		classify_results.py
generate_results.py		generate_results.py
llm_caller.py		llm_caller.py
llm_models.py		llm_models.py
llm_questions.py		llm_questions.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

LLM Output Classifier

The Project

LLM wrapper

Quickstart

About

Uh oh!

Releases

Packages

Uh oh!

Languages

bonet/llm-compare

Folders and files

Latest commit

History

Repository files navigation

LLM Output Classifier

The Project

LLM wrapper

Quickstart

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages