State of the Art,
Verified

Independent ML benchmarks across 17 research areas. Track progress, find implementations, compare models.

Vision, NLP, reasoning, code, speech, medical, robotics, and more. All results verified with source links.

286+ benchmark results
17 research areas
143 models tracked
Links to implementations
Free PDF Download

The Zen of AI Composition

Building intelligent systems from first principles. A philosophical guide to AI transformations, modular composition, and evidence-based prompting.

Download now
New Feature

AI Building Blocks

Stop searching. Start building. See which tools transform your data - with production-ready implementations.

Explore blocks

Can I trust these numbers?

Numbers from published papers, verified with our own tests where possible. No marketing claims, no sponsored rankings.

Which model fits my use case?

Compare accuracy, speed, cost, and deployment complexity. We show you the tradeoffs that matter for production.

Can I use this data?

Yes. All benchmark data available as JSON. Build dashboards, cite it in papers, integrate it into your tools.

286+
Benchmark results
17
Research areas
86
Datasets tracked
143
Models compared
Open Data

Use This Data

All benchmark data available as JSON

Build dashboards, cite in papers, integrate into your tools. No API key needed. Updated weekly with new results.

Free to use
Source links included
Updated weekly

Frequently Asked Questions

What is CodeSOTA?

CodeSOTA is an independent ML benchmark tracking platform. We provide verified state-of-the-art results across 17 research areas including computer vision, NLP, reasoning, code generation, speech, medical AI, robotics, and more.

Is this a Papers with Code replacement?

CodeSOTA builds on the Papers with Code legacy after Meta shut it down in July 2025. We track 286+ benchmark results with links to implementations. Read the full story.

Are these benchmarks verified?

Yes. We run benchmarks independently where possible, rather than just aggregating paper claims. All data includes source URLs and access dates for verification. See our methodology.

Can I use this benchmark data?

Yes. All benchmark data is available as JSON at /data/benchmarks.json. Build dashboards, cite it in papers, or integrate it into your tools.

What People Say

PZ

Piotr Zaczek

AI Consultant, scaling Voice-AI for 15M+ calls/year

"Zajebista robota. Doslownie wczoraj szukalem dobrych porownywarek OCRow i jedynie marketingowy BS. Good job!"

December 2024

?

Anonymous

AI Engineer

"Super czysty, slop-free UI, ale przede wszystkim copy: bardzo precyzyjne pozycjonowanie i przeglad projektow."

December 2024

Cite CodeSOTA

If you use CodeSOTA in your research, please cite:

@misc{wikiel2025codesota,
  author = {Wikieł, Kacper},
  title = {CodeSOTA: Independent ML Benchmark Tracking},
  year = {2025},
  url = {https://codesota.com},
  note = {Accessed: 2025}
}

Or in plain text: Wikieł, K. (2025). CodeSOTA: Independent ML Benchmark Tracking. https://codesota.com

Want updates on new benchmarks?

We'll let you know when we add tests for new models or tasks.