GitHub - christopherkarani/Wax: Lightening fast RAG on Apple Silicon. On-Device. No Server. No API. One File. Pure Swift

Wax is a high-performance, single-file memory layer for AI agents on Apple platforms.
On-device, private, and portable — no server, no cloud, zero infrastructure.

What is Wax?

Wax is a Swift-native persistence engine designed for the next generation of AI agents. It encapsulates documents, high-dimensional embeddings, and structured knowledge into a single, portable .wax file.

Unlike traditional databases that require complex setups or cloud dependencies, Wax provides a unified memory layer that lives entirely on-device, leveraging Metal-accelerated inference for sub-10ms recall latency.

Why Wax?

Feature	Wax	SQLite (FTS5)	Cloud Vector DBs
Search	Hybrid (Text + Vector)	Text Only*	Vector Only*
Latency	~6ms (p95)	~10ms (p95)	150ms - 500ms+
Privacy	100% Local	100% Local	Cloud-hosted
Setup	Zero Config	Low	Complex (API Keys)
Architecture	Apple Silicon Native	Generic	Varies

📦 Why a Single `.wax` File?

Most RAG systems require a database, a vector store, and a file server. Wax bundles everything—documents, metadata, and high-dimensional indices—into one portable binary.

Zero Infrastructure: No Docker, no DB setup, no cloud bill.
Truly Portable: AirDrop your agent's memory to another Mac, or sync it via iCloud.
Atomic: One file to backup, one file to version control, one file to delete.

Performance

Wax is tuned for the M-series architecture, providing near-instantaneous recall even with large-scale local indices.

Recall Latency (p95)

Lower is better. Measured in milliseconds.

Wax (Hybrid)  |██ 6.1ms
SQLite (Text) |████ 12ms
Cloud RAG     |██████████████████████████████████████████████████ 150ms+

Cold Open Time (p95)

Lower is better. Measured in milliseconds.

Wax           |███ 9.2ms
Traditional   |██████████████████████████████████████ 120ms+

Tip

Ingest Throughput: Wax handles 85.9 docs/s with full hybrid indexing on an M3 Max. Full benchmark report: docs/benchmarks/2026-03-06-performance-results.md

Architecture

Wax uses a "Database of Databases" model. It manages its own frame-based storage format while embedding specialized search engines (SQLite FTS5 and Metal-accelerated HNSW) as serialized blobs within the main file.

Internal File Layout

┌──────────────────────────────────────────────────────────────────────────┐
│                          Dual Header Pages (A/B)                         │
│   (Magic, Version, Generation, Pointers to WAL & TOC, Checksums)         │
├──────────────────────────────────────────────────────────────────────────┤
│                          WAL (Write-Ahead Log)                           │
│   (Atomic ring buffer for crash-resilient uncommitted mutations)         │
├──────────────────────────────────────────────────────────────────────────┤
│                          Compressed Data Frames                          │
│   ┌──────────────────┐  ┌──────────────────┐  ┌──────────────────┐       │
│   │ Frame 0 (LZ4)    │  │ Frame 1 (LZ4)    │  │ Frame 2 (LZ4)    │ ...   │
│   │ [Raw Document]   │  │ [Metadata/JSON]  │  │ [System Info]    │       │
│   └──────────────────┘  └──────────────────┘  └──────────────────┘       │
├──────────────────────────────────────────────────────────────────────────┤
│                          Hybrid Search Indices                           │
│   ┌──────────────────────────────┐  ┌──────────────────────────────┐     │
│   │ SQLite FTS5 Blob             │  │ Metal HNSW Index             │     │
│   │ (Text Search + EAV Facts)    │  │ (Vector Search)              │     │
│   └──────────────────────────────┘  └──────────────────────────────┘     │
├──────────────────────────────────────────────────────────────────────────┤
│                          TOC (Table of Contents)                         │
│   (Index of all frames, parent-child relations, and engine manifests)    │
└──────────────────────────────────────────────────────────────────────────┘

Atomic Resilience: Dual-headers and WAL ensure that even if the process crashes mid-write, the store remains consistent.
Unified Retrieval: A single query triggers parallel execution across the BM25 (text) and HNSW (vector) engines.
Structured Knowledge: Built-in EAV (Entity-Attribute-Value) storage for persistent facts and long-term reasoning.

Quick Start

Copy and paste this into a main.swift file to get started immediately.

import Foundation
import Wax
import WaxVectorSearchMiniLM

@main
struct AgentMemory {
    static func main() async throws {
        let url = URL(fileURLWithPath: "agent.wax")

        // 1. Initialize a memory store (on-device MiniLM embeddings)
        let memory = try await MemoryOrchestrator.openMiniLM(at: url)

        // 2. Commit a new memory
        try await memory.remember("The user is building a habit tracker in SwiftUI.")

        // 3. Perform semantic recall
        let context = try await memory.recall(query: "What is the user building?")

        if let bestMatch = context.items.first {
            print("Recall: \(bestMatch.text)") 
            // Output: "Recall: The user is building a habit tracker in SwiftUI."
        }
    }
}

Looking to store persistent facts and long-term reasoning? See Structured Memory.

Installation

Swift Package Manager

dependencies: [
    .package(url: "https://github.com/christopherkarani/Wax.git", from: "0.1.8")
]

Ecosystem Tools

🤖 MCP Server

Wax provides a first-class Model Context Protocol (MCP) server. Connect your local memory to Claude Code or any MCP-compatible agent.

npx -y waxmcp@latest mcp install --scope user

🔍 WaxRepo

A semantic search TUI for your git history. Index any repository and find code or commits using natural language.

# From within any git repo
wax-repo index
wax-repo search "where did we implement the WAL?"

License

Wax is released under the Apache License 2.0. See LICENSE for details.

_{Built for developers who believe user data belongs on the user's device.}

Name		Name	Last commit message	Last commit date
Latest commit History 253 Commits
.claude/agents		.claude/agents
.github/workflows		.github/workflows
.vscode		.vscode
Resources		Resources
Sources		Sources
Tests		Tests
docs		docs
locales		locales
scripts		scripts
skills		skills
.gitignore		.gitignore
AGENTS.md		AGENTS.md
AUDIT_REPORT.md		AUDIT_REPORT.md
CLAUDE.md		CLAUDE.md
LICENSE		LICENSE
Package.resolved		Package.resolved
Package.swift		Package.swift
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

What is Wax?

Why Wax?

📦 Why a Single `.wax` File?

Performance

Recall Latency (p95)

Cold Open Time (p95)

Architecture

Internal File Layout

Quick Start

Installation

Swift Package Manager

Ecosystem Tools

🤖 MCP Server

🔍 WaxRepo

License

About

Uh oh!

Releases 13

Packages

Uh oh!

Contributors 3

Languages

Folders and files

Latest commit

History

Repository files navigation

What is Wax?

Why Wax?

📦 Why a Single .wax File?

Performance

Recall Latency (p95)

Cold Open Time (p95)

Architecture

Internal File Layout

Quick Start

Installation

Swift Package Manager

Ecosystem Tools

🤖 MCP Server

🔍 WaxRepo

License

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases 13

Packages 0

Uh oh!

Contributors 3

Languages

📦 Why a Single `.wax` File?

Packages