Starred repositories
AutoGPT is the vision of accessible AI for everyone, to use and to build on. Our mission is to provide the tools, so that you can focus on what matters.
🦜🔗 The platform for reliable agents.
🌐 Make websites accessible for AI agents. Automate tasks online with ease.
A high-throughput and memory-efficient inference and serving engine for LLMs
Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)
Ray is an AI compute engine. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.
Opiniated RAG for integrating GenAI in your apps 🧠 Focus on your product rather than the RAG. Easy integration in existing products with customisation! Any LLM: GPT4, Groq, Llama. Any Vectorstore: …
A toolkit for developing and comparing reinforcement learning algorithms.
Python SDK, Proxy Server (AI Gateway) to call 100+ LLM APIs in OpenAI (or native) format, with cost tracking, guardrails, loadbalancing and logging. [Bedrock, Azure, OpenAI, VertexAI, Cohere, Anthr…
Pretrain, finetune ANY AI model of ANY size on 1 or 10,000+ GPUs with zero code changes.
A modular graph-based Retrieval-Augmented Generation (RAG) system
DeepSeek Coder: Let the Code Write Itself
Build Real-Time Knowledge Graphs for AI Agents
Educational framework exploring ergonomic, lightweight multi-agent orchestration. Managed by OpenAI Solution team.
An open-source, code-first Python toolkit for building, evaluating, and deploying sophisticated AI agents with flexibility and control.
Machine Learning Engineering Open Book
Distributed training framework for TensorFlow, Keras, PyTorch, and Apache MXNet.
The open-source AIOps and alert management platform
An API standard for single-agent reinforcement learning environments, with popular reference environments and related utilities (formerly Gym)
Agent S: an open agentic framework that uses computers like a human
🚀 A simple way to launch, train, and use PyTorch models on almost any device and distributed configuration, automatic mixed precision (including fp8), and easy-to-configure FSDP and DeepSpeed support
A robust, efficient, low-latency speech-to-text library with advanced voice activity detection, wake word activation and instant transcription.
Composable building blocks to build LLM Apps
AWS MCP Servers — helping you get the most out of AWS, wherever you use MCP.
Efficient Triton Kernels for LLM Training
Unified embedding generation and search engine. Also available on cloud - cloud.marqo.ai


