🧑‍🏫 60+ Implementations/tutorials of deep learning papers with side-by-side notes 📝; including transformers (original, xl, switch, feedback, vit, ...), optimizers (adam, adabelief, sophia, ...), ga…

Python 65,262 6,577 Updated Nov 11, 2025

microsoft / generative-ai-for-beginners

21 Lessons, Get Started Building with Generative AI

Jupyter Notebook 104,961 56,002 Updated Jan 5, 2026

mlfoundations / dclm

DataComp for Language Models

HTML 1,405 129 Updated Sep 9, 2025

jzhang38 / TinyLlama

The TinyLlama project is an open endeavor to pretrain a 1.1B Llama model on 3 trillion tokens.

Python 8,865 587 Updated May 3, 2024

OpenCoder-llm / OpenCoder-llm

The Open Cookbook for Top-Tier Code Large Language Model

Python 1,984 115 Updated Dec 8, 2024

datajuicer / data-juicer

Data processing for and with foundation models! 🍎 🍋 🌽 ➡️ ➡️🍸 🍹 🍷

Python 5,725 313 Updated Jan 9, 2026

rasbt / LLMs-from-scratch

Implement a ChatGPT-like LLM in PyTorch from scratch, step by step

Jupyter Notebook 82,722 12,435 Updated Jan 10, 2026

Leolty / repobench

✨ RepoBench: Benchmarking Repository-Level Code Auto-Completion Systems - ICLR 2024

Python 184 11 Updated Aug 16, 2024

karpathy / minbpe

Minimal, clean code for the Byte Pair Encoding (BPE) algorithm commonly used in LLM tokenization.

Python 10,256 994 Updated Jul 1, 2024

openai / gpt-2

Code for the paper "Language Models are Unsupervised Multitask Learners"

Python 24,544 5,849 Updated Aug 14, 2024

naklecha / llama3-from-scratch

llama3 implementation one matrix multiplication at a time

Jupyter Notebook 15,232 1,288 Updated May 23, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

ThreeAu Simba2017

Block or report Simba2017

Stars

jingyaogong / minimind

qibin0506 / Cortex

vllm-project / vllm-ascend

huggingface / open-r1

hiyouga / EasyR1

Simba2017 / EasyR1

deepseek-ai / DualPipe

deepseek-ai / open-infra-index

xlite-dev / LeetCUDA

se2p / pynguin

openai / simple-evals

peilongchencc / My-LLaMA-Factory

huggingface / Math-Verify

OpenHands / OpenHands

volcengine / verl

Jiayi-Pan / TinyZero

huggingface / smollm

deepseek-ai / DeepSeek-R1

hkust-nlp / simpleRL-reason

labmlai / annotated_deep_learning_paper_implementations