Memory Caching

Community PyTorch implementation and reproduction scaffold for Memory Caching: RNNs with Growing Memory.

Installable runtime modules, explicit claim boundaries, model-backed scientific artifacts, and publication-grade reproduction tooling live in one repository, with the stable package surface kept intentionally narrow.

Project Overview

memory_caching/
├── src/memory_caching/       Stable package surface published as memory-caching
│   ├── layer.py              Memory Caching wrapper
│   ├── backends/             Linear, DLA, Titans, SWLA(c=2)
│   ├── bench/                Benchmark adapters, runners, manifests
│   ├── models.py             Tiny model-backed scientific path
│   └── scientific_manifest.py Scientific artifact truthfulness checks
├── configs/                  Train + benchmark + baseline-tracking configs
├── docs/                     Reproduction, release, API, and claim-boundary docs
├── examples/                 Stable public examples
├── scripts/                  Train / eval / gate / packaging entrypoints
└── tests/                    Backend, API, benchmark, and release-path coverage

At a Glance

Area	Current State
Stable runtime package	`memory-caching==0.1.0` on PyPI
Wrapper mechanisms	Residual / GRM / Soup / SSC
Segmentation	Constant and logarithmic
Backends	`linear`, `dla`, `titans`, `swla(c=2)`
Scientific artifact path	Model-backed, truthful manifests, non-smoke targets
Public release status	Publishable package surface with explicit release preflight
Full paper parity	Still blocked by incomplete baseline evidence and larger parity gaps

Project Status

Scope	Status
Stable public PyTorch package	`Active`
Mechanism-faithful MC wrapper implementation	`Implemented`
Engineering scaffold and packaging integrity	`Green`
Scientific gate with model-backed evidence	`Green`
Full table-level paper parity	`Blocked by missing baselines`

This is not official author code. See reproduction_report.md, CLAIM_TO_EVIDENCE_MATRIX.md, and PAPER_PARITY_BLOCKERS.md for the exact claim surface.

Quickstart

Option A: `uv` from source

uv sync --extra dev
uv run mc list-variants
uv run mc smoke-eval --backend linear --device cpu --warmup-steps 1 --batch-size 1 --seq-len 8 --vocab-size 16 --d-model 8 --num-heads 2

Option B: `pip` editable install

python -m venv .venv
source .venv/bin/activate
python -m pip install --upgrade pip
python -m pip install -e ".[dev]"
mc list-variants

Option C: PyPI

python -m pip install memory-caching
python -c "from memory_caching import MCConfig, MemoryCachingLayer; print('ok')"

Torch / CUDA note

CPU-only example:
- python -m pip install torch --index-url https://download.pytorch.org/whl/cpu
CUDA 12.1 example:
- python -m pip install torch --index-url https://download.pytorch.org/whl/cu121

Install a torch build that matches your local CUDA runtime and driver stack before CUDA workflows.

Stable Package Boundary

The supported top-level runtime imports are:

memory_caching.MCConfig
memory_caching.MemoryCachingLayer
memory_caching.SegmentCache
memory_caching.LinearMemoryBackend
memory_caching.DLABackend
memory_caching.TitansBackend
memory_caching.SWLABackend

For runtime use, prefer:

layer(x) for the normal forward path
layer.forward_with_cache(x) when cached segment checkpoints are needed
layer.inspect(x) when per-token routing/debug rows are needed

Repo tooling is intentionally broader than the public package API. CLI wiring, smoke helpers, benchmark runners, release gates, and report-generation scripts remain repo-level tooling rather than stable semver-tracked runtime surface.

Full API notes:

PUBLIC_API.md

Namespaced experimental/reference modules now present in the package:

memory_caching.baselines.LogLinearPP
memory_caching.loglinear.LogLinearAttentionReference
memory_caching.loglinear.ChunkedLogLinearAttentionReference

Documentation Navigator

Start here

Core docs

Topic	Link	Purpose
Documentation index	docs/README.md	Fast entrypoint to the full doc set
Reproduction status	reproduction_report.md	What is implemented, what is blocked
Public runtime API	PUBLIC_API.md	Stable import surface and boundaries
Log-linear terminology	LOG_LINEAR_TERMINOLOGY.md	Separates `LogLinearPP` from original `LogLinearAttention`
LogLinearPP baseline	LOG_LINEAR_PP_BASELINE.md	MC-paper baseline preset semantics
LogLinearAttention reference	LOG_LINEAR_ATTENTION_REFERENCE.md	Original mechanism reference-path status
Architecture	ARCHITECTURE.md	Layer flow, backend roles, artifact pipeline
Claim discipline	CLAIM_TO_EVIDENCE_MATRIX.md	Claim-to-evidence mapping
Claim boundaries	CLAIM_BOUNDARY.md	What is explicitly out of claim scope
Paper mapping	PAPER_TO_CODE.md	Paper mechanism to implementation map
Progress ledger	PROGRESS_LEDGER.md	Current weighted plan state
Paper parity blockers	PAPER_PARITY_BLOCKERS.md	What still blocks literal parity claims
Release runbook	PYPI_RELEASE_RUNBOOK.md	Package publishing path
Support matrix	CONSUMER_SUPPORT_MATRIX.md	User-facing environment support

Common Workflows

# List implemented backend/aggregation variants
mc list-variants

# Minimal CPU smoke eval
mc smoke-eval --backend linear --device cpu --warmup-steps 1 --batch-size 1 --seq-len 8 --vocab-size 16 --d-model 8 --num-heads 2

# Debug routing and cache behavior
uv run mc debug-layer --backend linear --aggregation grm --seq-len 8 --d-model 8 --num-heads 2 --out-json outputs/debug/debug_layer.json

# Repository engineering gate
uv run python scripts/reports/release_gate_v1.py --mode repo --out outputs/reports/release_gate_repo_v1.json

# Scientific gate
uv run python scripts/reports/release_gate_v1.py --mode scientific --out outputs/reports/release_gate_scientific_v1.json

For dense command coverage, use:

Scientific Boundaries

Canonical terminology used in this repository:

engineering scaffold: code quality, reproducibility, packaging, and report-generation integrity
scientific evidence: model-backed artifacts with non-smoke targets and truthful manifests
paper parity: faithful reproduction of the paper's reported baselines, metrics, and missing comparison rows

scientific evidence is stricter than the engineering scaffold, but it is still not the same as paper parity.

What a green scientific gate still does not prove:

full paper parity
full evaluation evidence for LogLinearPP
original LogLinearAttention remains a separate future mechanism track
throughput parity or unpublished internal-author equivalence

Backend-specific limits also remain important:

linear is an unnormalized matrix-memory reference path, not a normalized linear-attention parity claim
dla, titans, and swla are mechanism-oriented reference implementations
titans and swla currently use constant scalar coefficients where the paper presents time-indexed coefficients
soup is only true state-space mixing when the backend supports state mixing; otherwise the repo uses an explicit output-mixture fallback

Install and Release Surfaces

Surface	Command	Output
Editable source install	`python -m pip install -e .`	local runtime package
Dev install	`python -m pip install -e ".[dev]"`	local dev + tests + packaging tools
Built wheel install	`python -m pip install dist/*.whl`	release-like install path
Repo engineering gate	`uv run python scripts/reports/release_gate_v1.py --mode repo ...`	package/repo integrity
Scientific gate	`uv run python scripts/reports/release_gate_v1.py --mode scientific ...`	scientific artifact integrity
PyPI release preflight	`uv run python scripts/checks/pypi_release_preflight.py`	publish-readiness report

Examples

Stable published examples:

Pilot runner:

uv run ./scripts/checks/loglinear_pilot.sh

Current namespaced research/reference surfaces:

memory_caching.baselines.LogLinearPP
memory_caching.loglinear.LogLinearAttentionReference
memory_caching.loglinear.ChunkedLogLinearAttentionReference
tiny-model families:
- tiny_loglinear_ref_lm
- tiny_loglinear_chunked_lm

Sample subset dataset files included for benchmark dry runs:

examples/longbench_subset.jsonl
examples/retrieval_subset.jsonl

Package and Repository Links

Resource	Location
Paper	arXiv:2602.24281
PyPI	pypi.org/project/memory-caching
GitHub	github.com/kmccleary3301/memory_caching
Release	v0.1.0

Citation

If you use this repository, cite the original paper and this implementation.

Original Paper

@article{chandra2026memorycaching,
  title={Memory Caching: RNNs with Growing Memory},
  author={Chandra, ...},
  journal={arXiv preprint arXiv:2602.24281},
  year={2026}
}

This Implementation

@software{memory_caching2026,
  title={memory-caching: Community PyTorch Implementation of Memory Caching},
  author={McCleary, Kyle},
  url={https://github.com/kmccleary3301/memory_caching},
  year={2026}
}

Licensed under MIT. Public package surface is documented. Paper-parity limits are documented explicitly.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Memory Caching

Project Overview

At a Glance

Project Status

Quickstart

Option A: `uv` from source

Option B: `pip` editable install

Option C: PyPI

Torch / CUDA note

Stable Package Boundary

Documentation Navigator

Start here

Core docs

Common Workflows

Scientific Boundaries

Install and Release Surfaces

Examples

Package and Repository Links

Citation

Original Paper

This Implementation

About

Uh oh!

Releases 1

Packages

Uh oh!

Contributors

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 26 Commits
.github		.github
configs		configs
data/raw		data/raw
docs		docs
examples		examples
scripts		scripts
src/memory_caching		src/memory_caching
tests		tests
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
pyproject.toml		pyproject.toml
uv.lock		uv.lock

Folders and files

Latest commit

History

Repository files navigation

Memory Caching

Project Overview

At a Glance

Project Status

Quickstart

Option A: uv from source

Option B: pip editable install

Option C: PyPI

Torch / CUDA note

Stable Package Boundary

Documentation Navigator

Start here

Core docs

Common Workflows

Scientific Boundaries

Install and Release Surfaces

Examples

Package and Repository Links

Citation

Original Paper

This Implementation

About

Resources

License

Contributing

Uh oh!

Stars

Watchers

Forks

Releases 1

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Option A: `uv` from source

Option B: `pip` editable install

Packages