wangx26

Follow

wangx26

Follow

1 follower · 4 following

Achievements

Achievements

Stars

LLM

17 repositories

AIoT-MLSys-Lab / Efficient-LLMs-Survey

[TMLR 2024] Efficient Large Language Models: A Survey

1,242 98 Updated Jun 23, 2025

jzhang38 / TinyLlama

The TinyLlama project is an open endeavor to pretrain a 1.1B Llama model on 3 trillion tokens.

Python 8,840 582 Updated May 3, 2024

xai-org / grok-1

Grok open release

Python 50,570 8,372 Updated Aug 30, 2024

ollama / ollama

Get up and running with OpenAI gpt-oss, DeepSeek-R1, Gemma 3 and other models.

Go 158,175 13,998 Updated Dec 24, 2025

BlinkDL / RWKV-LM

RWKV (pronounced RwaKuv) is an RNN with great LLM performance, which can also be directly trained like a GPT transformer (parallelizable). We are at RWKV-7 "Goose". So it's combining the best of RN…

Python 14,244 983 Updated Dec 19, 2025

jeinlee1991 / chinese-llm-benchmark

ReLE评测：中文AI大模型能力评测（持续更新）：目前已囊括335个大模型，覆盖chatgpt、gpt-5.2、o4-mini、谷歌gemini-3-pro、Claude-4.5、文心ERNIE-X1.1、ERNIE-5.0-Thinking、qwen3-max、百川、讯飞星火、商汤senseChat等商用模型，以及kimi-k2、ernie4.5、minimax-M2、deepseek-…

5,327 212 Updated Dec 23, 2025

rasbt / LLMs-from-scratch

Implement a ChatGPT-like LLM in PyTorch from scratch, step by step

Jupyter Notebook 81,620 12,203 Updated Dec 21, 2025

adamcohenhillel / ADeus

An open source AI wearable device that captures what you say and hear in the real world and then transcribes and stores it on your own server. You can then chat with Adeus using the app, and it wil…

TypeScript 3,329 315 Updated Apr 22, 2024

jingyaogong / minimind

🚀🚀 「大模型」2小时完全从0训练26M的小参数GPT！🌏 Train a 26M-parameter GPT from scratch in just 2h!

Python 36,080 4,262 Updated Dec 24, 2025

deepseek-ai / FlashMLA

FlashMLA: Efficient Multi-head Latent Attention Kernels

C++ 11,936 922 Updated Dec 15, 2025

deepseek-ai / DeepEP

DeepEP: an efficient expert-parallel communication library

Cuda 8,828 1,036 Updated Dec 24, 2025

deepseek-ai / DeepGEMM

DeepGEMM: clean and efficient FP8 GEMM kernels with fine-grained scaling

Cuda 5,996 778 Updated Dec 23, 2025

SillyTavern / SillyTavern

LLM Frontend for Power Users.

JavaScript 21,228 4,452 Updated Dec 24, 2025

unslothai / unsloth

Fine-tuning & Reinforcement Learning for LLMs. 🦥 Train OpenAI gpt-oss, DeepSeek-R1, Qwen3, Gemma 3, TTS 2x faster with 70% less VRAM.

Python 49,856 4,112 Updated Dec 23, 2025

deepseek-ai / awesome-deepseek-integration

Integrate the DeepSeek API into popular softwares

34,828 3,907 Updated Sep 25, 2025

volcengine / verl

verl: Volcano Engine Reinforcement Learning for LLMs

Python 17,765 2,889 Updated Dec 24, 2025

TauricResearch / TradingAgents

TradingAgents: Multi-Agents LLM Financial Trading Framework

Python 26,926 5,081 Updated Oct 9, 2025