Skip to content
View binarycrayon's full-sized avatar

Organizations

@argoproj-labs

Block or report binarycrayon

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Stars

ML

102 repositories
Shell 83 12 Updated Apr 23, 2023

Large Language Model Text Generation Inference

Python 10,747 1,253 Updated Jan 8, 2026

JetStream is a throughput and memory optimized engine for LLM inference on XLA devices, starting with TPUs (and GPUs in future -- PRs welcome).

Python 403 58 Updated Jan 5, 2026

This repository showcases various advanced techniques for Retrieval-Augmented Generation (RAG) systems. RAG systems combine information retrieval with generative models to provide accurate and cont…

Jupyter Notebook 24,460 2,843 Updated Jan 26, 2026

Claude Engineer is an interactive command-line interface (CLI) that leverages the power of Anthropic's Claude-3.5-Sonnet model to assist with software development tasks.This framework enables Claud…

Python 11,164 1,155 Updated Dec 12, 2024

Qwen3 is the large language model series developed by Qwen team, Alibaba Cloud.

Python 26,397 1,864 Updated Jan 9, 2026

Pytorch implementation of Transfusion, "Predict the Next Token and Diffuse Images with One Multi-Modal Model", from MetaAI

Python 1,321 71 Updated Jan 27, 2026

A Survey on Large Language Model-Based Game Agents

813 27 Updated Nov 4, 2025

The official Python client for the Hugging Face Hub.

Python 3,297 920 Updated Jan 30, 2026

Optimizing inference proxy for LLMs

Python 3,305 263 Updated Jan 28, 2026

A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)

Python 16,655 3,316 Updated Jan 30, 2026

A PyTorch native platform for training generative AI models

Python 5,020 686 Updated Jan 30, 2026

PyTorch native quantization and sparsity for training and inference

Python 2,652 414 Updated Jan 30, 2026

A framework to study AI models in Reasoning, Alignment, and use of Memory (RAM).

Python 344 41 Updated Dec 16, 2025

Solve puzzles. Learn CUDA.

Jupyter Notebook 11,920 922 Updated Sep 1, 2024

Transformers-compatible library for applying various compression algorithms to LLMs for optimized deployment with vLLM

Python 2,654 380 Updated Jan 30, 2026

DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.

Python 41,476 4,699 Updated Jan 29, 2026

PyTorch/XLA integration with JetStream (https://github.com/google/JetStream) for LLM inference"

Python 80 21 Updated Dec 18, 2025

Fine-tuning & Reinforcement Learning for LLMs. 🦥 Train OpenAI gpt-oss, DeepSeek, Qwen, Llama, Gemma, TTS 2x faster with 70% less VRAM.

Python 51,386 4,248 Updated Jan 30, 2026

Entropy Based Sampling and Parallel CoT Decoding

Python 3,436 324 Updated Nov 13, 2024

Materials for learning SGLang

728 55 Updated Jan 5, 2026

Educational framework exploring ergonomic, lightweight multi-agent orchestration. Managed by OpenAI Solution team.

Python 20,853 2,223 Updated Mar 11, 2025

A collection of projects designed to help developers quickly get started with building deployable applications using the Claude API

Python 13,742 2,298 Updated Jan 20, 2026

GitHub Action to install CUDA

TypeScript 199 68 Updated Dec 27, 2025

Prometheus exporter for Starlette and FastAPI

Python 411 37 Updated Oct 15, 2024

Meta Lingua: a lean, efficient, and easy-to-hack codebase to research LLMs.

Python 4,752 270 Updated Jul 18, 2025

Tensors and Dynamic neural networks in Python with strong GPU acceleration

Python 97,065 26,698 Updated Jan 30, 2026

Curated list of datasets and tools for post-training.

4,205 347 Updated Nov 10, 2025

The reinforcement learning training code for AgiBot X1.

Python 1,631 505 Updated Oct 23, 2024