Stars
Learn how to design large-scale systems. Prep for the system design interview. Includes Anki flashcards.
A high-throughput and memory-efficient inference and serving engine for LLMs
Rich is a Python library for rich text and beautiful formatting in the terminal.
CLI platform to experiment with codegen. Precursor to: https://lovable.dev
Ray is an AI compute engine. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.
An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.
Opiniated RAG for integrating GenAI in your apps 🧠 Focus on your product rather than the RAG. Easy integration in existing products with customisation! Any LLM: GPT4, Groq, Llama. Any Vectorstore: …
A comprehensive collection of Agent Skills for context engineering, multi-agent architectures, and production agent systems. Use when building, optimizing, or debugging agent systems that require e…
The official GitHub page for the survey paper "A Survey of Large Language Models".
Run, manage, and scale AI workloads on any AI infrastructure. Use one system to access & manage all AI compute (Kubernetes, Slurm, 20+ clouds, on-prem).
Official Python client library for kubernetes
My learning notes for ML SYS.
The official PyTorch implementation of Google's Gemma models
This repository is for active development of the Azure SDK for Python. For consumers of the SDK we recommend visiting our public developer docs at https://learn.microsoft.com/python/azure/ or our v…
SkyRL: A Modular Full-stack RL Library for LLMs
🔥 Blazing fast bulk data transfers between any cloud 🔥
A command line tool for interacting with cloud storage services.


