Skip to content
View firozgit's full-sized avatar

Block or report firozgit

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

Large language models (LLMs) made easy, EasyLM is a one stop solution for pre-training, finetuning, evaluating and serving LLMs in JAX/Flax.

Python 78 16 Updated Aug 17, 2024

Textbook on reinforcement learning from human feedback

TeX 1,424 127 Updated Jan 22, 2026

A project to scrape, tabulate, and display job data from the O*NET website, and possibly other websites. Non-commercial.

Python 16 4 Updated Mar 15, 2020

slime is an LLM post-training framework for RL Scaling.

Python 3,472 439 Updated Jan 20, 2026

Miles is an enterprise-facing reinforcement learning framework for large-scale MoE post-training and production workloads, forked from and co-evolving with slime.

Python 755 79 Updated Jan 22, 2026

KernelBench: Can LLMs Write GPU Kernels? - Benchmark + Toolkit with Torch -> CUDA (+ more DSLs)

Jupyter Notebook 768 121 Updated Jan 20, 2026

A benchmark for LLMs on complicated tasks in the terminal

Python 1,395 457 Updated Jan 22, 2026

Modular Multi-Agent System for Scientific Research Assistance

TeX 479 75 Updated Jan 22, 2026

OfficeBench: Benchmarking Language Agents across Multiple Applications for Office Automation

Python 29 6 Updated May 23, 2025

"Towards Improving Document Understanding: An Exploration on Text-Grounding via MLLMs" 2023

Python 16 1 Updated Nov 28, 2024

SlideVQA: A Dataset for Document Visual Question Answering on Multiple Images (AAAI2023)

Python 104 8 Updated Mar 31, 2025

Contexts Optical Compression

Python 22,136 2,021 Updated Oct 25, 2025
Python 112 8 Updated May 7, 2025

Official github repo for C-Eval, a Chinese evaluation suite for foundation models [NeurIPS 2023]

Python 1,804 83 Updated Jul 27, 2025

DocBench: A Benchmark for Evaluating LLM-based Document Reading Systems

Python 62 6 Updated Sep 29, 2024

Parse LaTeX math expressions

Python 143 32 Updated Aug 5, 2024

[Neurips 2025] R-KV: Redundancy-aware KV Cache Compression for Reasoning Models

Python 1,170 188 Updated Oct 16, 2025

[EMNLP 2024 (main)] Attention Score is not All You Need for Token Importance Indicator in KV Cache Reduction: Value Also Matters

Python 11 2 Updated Nov 5, 2024
Python 147 22 Updated Oct 9, 2024

APOLLO: SGD-like Memory, AdamW-level Performance; MLSys'25 Oustanding Paper Honorable Mention

Python 267 13 Updated Nov 29, 2025

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 68,112 12,780 Updated Jan 22, 2026

GEAR: An Efficient KV Cache Compression Recipefor Near-Lossless Generative Inference of LLM

Python 176 18 Updated Jul 12, 2024

More Tokens, Lower Precision: Towards the Optimal Token-Precision Trade-off in KV Cache Compression

Python 12 Updated Jan 15, 2025

Awesome-LLM-KV-Cache: A curated list of 📙Awesome LLM KV Cache Papers with Codes.

408 25 Updated Mar 3, 2025
Python 299 27 Updated Jul 10, 2025

Unified KV Cache Compression Methods for Auto-Regressive Models

Python 1,298 161 Updated Jan 4, 2025

The official repo of Pai-Megatron-Patch for LLM & VLM large scale training developed by Alibaba Cloud.

Python 1,514 221 Updated Dec 15, 2025

🧮 A collection of resources to learn mathematics for machine learning

5,688 622 Updated Jan 24, 2023

Code for our EMNLP 2023 Paper: "LLM-Adapters: An Adapter Family for Parameter-Efficient Fine-Tuning of Large Language Models"

Python 1,225 121 Updated Mar 10, 2024
Next