Skip to content
View wangx26's full-sized avatar

Block or report wangx26

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Stars

LLM

17 repositories

[TMLR 2024] Efficient Large Language Models: A Survey

1,242 98 Updated Jun 23, 2025

The TinyLlama project is an open endeavor to pretrain a 1.1B Llama model on 3 trillion tokens.

Python 8,840 582 Updated May 3, 2024

Grok open release

Python 50,570 8,372 Updated Aug 30, 2024

Get up and running with OpenAI gpt-oss, DeepSeek-R1, Gemma 3 and other models.

Go 158,175 13,998 Updated Dec 24, 2025

RWKV (pronounced RwaKuv) is an RNN with great LLM performance, which can also be directly trained like a GPT transformer (parallelizable). We are at RWKV-7 "Goose". So it's combining the best of RN…

Python 14,244 983 Updated Dec 19, 2025

ReLE评测:中文AI大模型能力评测(持续更新):目前已囊括335个大模型,覆盖chatgpt、gpt-5.2、o4-mini、谷歌gemini-3-pro、Claude-4.5、文心ERNIE-X1.1、ERNIE-5.0-Thinking、qwen3-max、百川、讯飞星火、商汤senseChat等商用模型, 以及kimi-k2、ernie4.5、minimax-M2、deepseek-…

5,327 212 Updated Dec 23, 2025

Implement a ChatGPT-like LLM in PyTorch from scratch, step by step

Jupyter Notebook 81,620 12,203 Updated Dec 21, 2025

An open source AI wearable device that captures what you say and hear in the real world and then transcribes and stores it on your own server. You can then chat with Adeus using the app, and it wil…

TypeScript 3,329 315 Updated Apr 22, 2024

🚀🚀 「大模型」2小时完全从0训练26M的小参数GPT!🌏 Train a 26M-parameter GPT from scratch in just 2h!

Python 36,080 4,262 Updated Dec 24, 2025

FlashMLA: Efficient Multi-head Latent Attention Kernels

C++ 11,936 922 Updated Dec 15, 2025

DeepEP: an efficient expert-parallel communication library

Cuda 8,828 1,036 Updated Dec 24, 2025

DeepGEMM: clean and efficient FP8 GEMM kernels with fine-grained scaling

Cuda 5,996 778 Updated Dec 23, 2025

LLM Frontend for Power Users.

JavaScript 21,228 4,452 Updated Dec 24, 2025

Fine-tuning & Reinforcement Learning for LLMs. 🦥 Train OpenAI gpt-oss, DeepSeek-R1, Qwen3, Gemma 3, TTS 2x faster with 70% less VRAM.

Python 49,856 4,112 Updated Dec 23, 2025

Integrate the DeepSeek API into popular softwares

34,828 3,907 Updated Sep 25, 2025

verl: Volcano Engine Reinforcement Learning for LLMs

Python 17,765 2,889 Updated Dec 24, 2025

TradingAgents: Multi-Agents LLM Financial Trading Framework

Python 26,926 5,081 Updated Oct 9, 2025