Stars
AI agents running research on single-GPU nanochat training automatically
An open source library designed to provide community examples of Joint Embedding Predictive Architectures (JEPAs). It contains code and examples for learning representations from images, video, and…
A compact implementation of SGLang, designed to demystify the complexities of modern LLM serving systems.
Miles is an enterprise-facing reinforcement learning framework for LLM and VLM post-training, forked from and co-evolving with slime.
Build LLM & write paper from scratch
🚀 Efficient implementations of state-of-the-art linear attention models
A 128M parameter language model built from scratch for learning how large language models work.
Research on muon optimizer in LLM pretraining.
cheap & easy LLM experiments for amateurs (alpha)
Unofficial implementation of Tune-A-Video
[ICCV 2023] Tune-A-Video: One-Shot Tuning of Image Diffusion Models for Text-to-Video Generation
🤗 Diffusers: State-of-the-art diffusion models for image and audio generation in PyTorch - fork with video pseudo3d



