Miles is an enterprise-facing reinforcement learning framework for large-scale MoE post-training and production workloads, forked from and co-evolving with slime.

Python 755 79 Updated Jan 22, 2026

ScalingIntelligence / KernelBench

KernelBench: Can LLMs Write GPU Kernels? - Benchmark + Toolkit with Torch -> CUDA (+ more DSLs)

Jupyter Notebook 768 121 Updated Jan 20, 2026

laude-institute / terminal-bench

A benchmark for LLMs on complicated tasks in the terminal

Python 1,395 457 Updated Jan 22, 2026

AstroPilot-AI / Denario

Modular Multi-Agent System for Scientific Research Assistance

TeX 479 75 Updated Jan 22, 2026

zlwang-cs / OfficeBench

OfficeBench: Benchmarking Language Agents across Multiple Applications for Office Automation

Python 29 6 Updated May 23, 2025

harrytea / TGDoc

"Towards Improving Document Understanding: An Exploration on Text-Grounding via MLLMs" 2023

Python 16 1 Updated Nov 28, 2024

nttmdlab-nlp / SlideVQA

SlideVQA: A Dataset for Document Visual Question Answering on Multiple Images (AAAI2023)

Python 104 8 Updated Mar 31, 2025

deepseek-ai / DeepSeek-OCR

Contexts Optical Compression

Python 22,136 2,021 Updated Oct 25, 2025

google-deepmind / bbeh

Python 112 8 Updated May 7, 2025

hkust-nlp / ceval

Official github repo for C-Eval, a Chinese evaluation suite for foundation models [NeurIPS 2023]

Python 1,804 83 Updated Jul 27, 2025

Anni-Zou / DocBench

DocBench: A Benchmark for Evaluating LLM-based Document Reading Systems

Python 62 6 Updated Sep 29, 2024

OrangeX4 / latex2sympy

Forked from purdue-tlt/latex2sympy

Parse LaTeX math expressions

Python 143 32 Updated Aug 5, 2024

Zefan-Cai / R-KV

[Neurips 2025] R-KV: Redundancy-aware KV Cache Compression for Reasoning Models

Python 1,170 188 Updated Oct 16, 2025

guozhiyu / vatp

[EMNLP 2024 (main)] Attention Score is not All You Need for Token Importance Indicator in KV Cache Reduction: Value Also Matters

Python 11 2 Updated Nov 5, 2024

UChi-JCL / CacheGen

Python 147 22 Updated Oct 9, 2024

zhuhanqing / APOLLO

APOLLO: SGD-like Memory, AdamW-level Performance; MLSys'25 Oustanding Paper Honorable Mention

Python 267 13 Updated Nov 29, 2025

vllm-project / vllm

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 68,112 12,780 Updated Jan 22, 2026

opengear-project / GEAR

GEAR: An Efficient KV Cache Compression Recipefor Near-Lossless Generative Inference of LLM

Python 176 18 Updated Jul 12, 2024

zhzihao / QPruningKV

More Tokens, Lower Precision: Towards the Optimal Token-Precision Trade-off in KV Cache Compression

Python 12 Updated Jan 15, 2025

Zefan-Cai / Awesome-LLM-KV-Cache

Awesome-LLM-KV-Cache: A curated list of 📙Awesome LLM KV Cache Papers with Codes.

408 25 Updated Mar 3, 2025

FasterDecoding / SnapKV

Python 299 27 Updated Jul 10, 2025

Zefan-Cai / KVCache-Factory

Unified KV Cache Compression Methods for Auto-Regressive Models

Python 1,298 161 Updated Jan 4, 2025

zcli-charlie / Awesome-KV-Cache

83 3 Updated Oct 9, 2024

alibaba / Pai-Megatron-Patch

The official repo of Pai-Megatron-Patch for LLM & VLM large scale training developed by Alibaba Cloud.

Python 1,514 221 Updated Dec 15, 2025

dair-ai / Mathematics-for-ML

🧮 A collection of resources to learn mathematics for machine learning

5,688 622 Updated Jan 24, 2023

AGI-Edgerunners / LLM-Adapters

Code for our EMNLP 2023 Paper: "LLM-Adapters: An Adapter Family for Parameter-Efficient Fine-Tuning of Large Language Models"

Python 1,225 121 Updated Mar 10, 2024

Neural Network

Natural language processing

Firoz Shaik firozgit

Lists (4)

KVCache

LLMs

lora

tools

Starred repositories

Neural Network

Natural language processing

Python

R

Computer vision

Deep learning

Machine learning