QCluster

A Python library for clustering customer questions using large language models.
Explore the docs »

Report Bug · Request Feature

Table of Contents

🎯 About The Project
🚀 Getting Started
- 🛠️ Prerequisites
- 📦 Installation
▶️ Usage
📄 License

🎯 About The Project

QCluster is a powerful Python library designed to help you make sense of large volumes of customer feedback. By leveraging the power of Large Language Models (LLMs), QCluster can automatically group similar customer questions, allowing you to identify trends, pain points, and frequently asked questions with ease.

This project provides a complete pipeline for:

Extracting customer questions from your data sources.
Generating embeddings for each question.
Clustering the questions based on their semantic similarity.
Evaluating the quality of the clustering results.
Generating insightful reports.

🚀 Getting Started

Follow these simple steps to get your local copy of QCluster up and running.

🛠️ Prerequisites

This project was tested on macOS with Apple Silicon, but it should work on other systems as well.

Python 3.12+
uv: A fast Python package installer and resolver.
ollama: Run large language models locally.
- You will also need the qwen2.5:3b model, but you can configure other models as well.

📦 Installation

Clone the repo

git clone https://github.com/dbudaghyan/qcluster.git
cd qcluster

Set up the environment variables
```
cp .env.example .env
```
You can modify the .env file to change the default settings.
Install ollama
- Using Homebrew (on macOS):
```
brew install ollama
```
- Or download the binary directly from the official website.
Pull the LLM model
```
ollama pull qwen2.5:3b
```
If you have defined other models in your .env file, make sure to pull them as well.
Start the ollama server
```
ollama serve
```
Install the Python dependencies
```
uv sync
```

▶️ Usage

You can run the clustering pipeline either as a simple Python script or through a Jupyter Notebook for a more interactive experience.

Option 1: Python Script

uv run qcluster.pipeline

Option 2: Jupyter Notebook

Add the project root to the Python path:

export PYTHONPATH=$(pwd)

Then run Jupyter Lab:

uv run --with jupyter jupyter-lab

The notebook is located at notebooks/pipeline.ipynb. The reports will be saved in the EVALUATION_RESULTS_DIR defined in the .env file.

TL;DR

cd qcluster
cp .env.example .env
# Modify the .env file if needed
brew install ollama
ollama pull qwen2.5:3b
# pull other models if needed (if defined in the .env file)
ollama serve
uv sync
uv run qcluster.pipeline
# or
# export PYTHONPATH=$(pwd)
# uv run --with jupyter jupyter-lab
# and open notebooks/pipeline.ipynb and run the cells

The reports will be saved in the EVALUATION_RESULTS_DIR defined in the .env file. See the Evaluation Results for more details on the results structure.

📄 License

Distributed under the GPL-2.0 License. See LICENSE for more information.

Name		Name	Last commit message	Last commit date
Latest commit History 102 Commits
.github/workflows		.github/workflows
data		data
docs		docs
evaluation_results/20250616-103219-c1852d7		evaluation_results/20250616-103219-c1852d7
notebooks		notebooks
qcluster		qcluster
tests/test_qcluster		tests/test_qcluster
.env.example		.env.example
.gitignore		.gitignore
CHANGELOG.md		CHANGELOG.md
LICENSE		LICENSE
README.md		README.md
__init__.py		__init__.py
pyproject.toml		pyproject.toml
uv.lock		uv.lock

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

QCluster

🎯 About The Project

🚀 Getting Started

🛠️ Prerequisites

📦 Installation

▶️ Usage

Option 1: Python Script

Option 2: Jupyter Notebook

TL;DR

📄 License

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

QCluster

🎯 About The Project

🚀 Getting Started

🛠️ Prerequisites

📦 Installation

▶️ Usage

Option 1: Python Script

Option 2: Jupyter Notebook

TL;DR

📄 License

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages