They said it couldn't be done. We did it.
They said the sauce was secret. We figured it out.
In 1000+ head to head tests against the Big Boys. We win. Every. Single. Time.
Better content. Better relevancy. Better context.
Don't take our word for it. It's free and open source. Run your own head to head benchmarks and see for yourself.
Code context retrieval that actually works. 94% accuracy. Self-learning. Runs anywhere.
Game. Over.
pip install ace-frameworkfrom ace import UnifiedRetriever
retriever = UnifiedRetriever()
results = retriever.retrieve("your query")| Metric | Value |
|---|---|
| Accuracy | 94% |
| Test Queries | 1,000 |
| Response Time | <200ms |
LinUCB HyDE HDBSCAN Cross-Encoder BM25 Qdrant AST-Chunking Semantic-Dedup Confidence-Decay MiniLM Voyage
- Local: Ollama, LMStudio, any embedding model
- Cloud: OpenAI, Voyage, Gemini
- IDE: MCP server for VS Code, Cursor, Claude
| Component | Recommendation | Notes |
|---|---|---|
| Embeddings (Text) | Qwen3-Embedding-8B (4096d) | Local via LM Studio, ~8GB VRAM |
| Embeddings (Code) | Voyage-code-3 | API, optimized for code |
| Vector DB | Qdrant | Local or cloud, free tier available |
| LLM | Any | Ollama, LM Studio, OpenAI, Gemini |
# Qdrant (Docker)
docker run -p 6333:6333 qdrant/qdrant
# LM Studio
# 1. Download: lmstudio.ai
# 2. Load: Qwen3-Embedding-8B
# 3. Start server on port 1234HyDE - Hypothetical Document Embeddings for query expansion
LinUCB Bandit - Learns which retrieval strategies work for your data
HDBSCAN Dedup - Kills near-duplicate chunks
Confidence Decay - Old memories fade, fresh data wins
AST Chunking - Code-aware splitting that doesn't break functions
Cross-Encoder Reranking - Precision filtering after retrieval
Self-Learning Memory - Cross-workspace patterns + project-specific knowledge. The one-two punch.
Like it? Star. Ideas? Contribute.
Research Foundation:
- Agentic Context Engineering - UC Berkeley & SambaNova Systems (Zhang et al.)
- Dynamic Cheatsheet methodology
- LinUCB Algorithm - Li et al. (2010)
Code Inspirations:
- Kayba.ai - Original implementation
- m1rl0k/Context-Engine - AST chunking, ReFRAG
- ELF - Confidence decay, golden rules
- r/Rag community - Memory architecture discussions
Built On:
- Qdrant - Vector search
- Sentence Transformers - MiniLM, Cross-Encoders
- Voyage AI - Embeddings
- HDBSCAN - Clustering
- LiteLLM - Multi-provider LLM
- FastMCP - MCP server framework
MIT. Do what you want.