binarycrayon

Follow

Yudi Xue binarycrayon

Follow

46 followers · 107 following

Canada

Achievements

Achievements

Organizations

Stars

ML

102 repositories

Elyah2035 / llama-dl

Shell 83 12 Updated Apr 23, 2023

huggingface / text-generation-inference

Large Language Model Text Generation Inference

Python 10,747 1,253 Updated Jan 8, 2026

AI-Hypercomputer / JetStream

JetStream is a throughput and memory optimized engine for LLM inference on XLA devices, starting with TPUs (and GPUs in future -- PRs welcome).

Python 403 58 Updated Jan 5, 2026

NirDiamant / RAG_Techniques

This repository showcases various advanced techniques for Retrieval-Augmented Generation (RAG) systems. RAG systems combine information retrieval with generative models to provide accurate and cont…

Jupyter Notebook 24,460 2,843 Updated Jan 26, 2026

Doriandarko / claude-engineer

Claude Engineer is an interactive command-line interface (CLI) that leverages the power of Anthropic's Claude-3.5-Sonnet model to assist with software development tasks.This framework enables Claud…

Python 11,164 1,155 Updated Dec 12, 2024

QwenLM / Qwen3

Qwen3 is the large language model series developed by Qwen team, Alibaba Cloud.

Python 26,397 1,864 Updated Jan 9, 2026

lucidrains / transfusion-pytorch

Pytorch implementation of Transfusion, "Predict the Next Token and Diffuse Images with One Multi-Modal Model", from MetaAI

Python 1,321 71 Updated Jan 27, 2026

git-disl / awesome-LLM-game-agent-papers

A Survey on Large Language Model-Based Game Agents

813 27 Updated Nov 4, 2025

huggingface / huggingface_hub

The official Python client for the Hugging Face Hub.

Python 3,297 920 Updated Jan 30, 2026

algorithmicsuperintelligence / optillm

Optimizing inference proxy for LLMs

Python 3,305 263 Updated Jan 28, 2026

NVIDIA-NeMo / NeMo

A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)

Python 16,655 3,316 Updated Jan 30, 2026

pytorch / torchtitan

A PyTorch native platform for training generative AI models

Python 5,020 686 Updated Jan 30, 2026

da03 / Internalize_CoT_Step_by_Step

Python 203 21 Updated Apr 19, 2025

pytorch / ao

PyTorch native quantization and sparsity for training and inference

Python 2,652 414 Updated Jan 30, 2026

facebookresearch / RAM

A framework to study AI models in Reasoning, Alignment, and use of Memory (RAM).

Python 344 41 Updated Dec 16, 2025

srush / GPU-Puzzles

Solve puzzles. Learn CUDA.

Jupyter Notebook 11,920 922 Updated Sep 1, 2024

vllm-project / llm-compressor

Transformers-compatible library for applying various compression algorithms to LLMs for optimized deployment with vLLM

Python 2,654 380 Updated Jan 30, 2026

deepspeedai / DeepSpeed

DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.

Python 41,476 4,699 Updated Jan 29, 2026

AI-Hypercomputer / jetstream-pytorch

PyTorch/XLA integration with JetStream (https://github.com/google/JetStream) for LLM inference"

Python 80 21 Updated Dec 18, 2025

unslothai / unsloth

Fine-tuning & Reinforcement Learning for LLMs. 🦥 Train OpenAI gpt-oss, DeepSeek, Qwen, Llama, Gemma, TTS 2x faster with 70% less VRAM.

Python 51,386 4,248 Updated Jan 30, 2026

xjdr-alt / entropix

Entropy Based Sampling and Parallel CoT Decoding

Python 3,436 324 Updated Nov 13, 2024

sgl-project / sgl-learning-materials

Materials for learning SGLang

728 55 Updated Jan 5, 2026

openai / swarm

Educational framework exploring ergonomic, lightweight multi-agent orchestration. Managed by OpenAI Solution team.

Python 20,853 2,223 Updated Mar 11, 2025

anthropics / claude-quickstarts

A collection of projects designed to help developers quickly get started with building deployable applications using the Claude API

Python 13,742 2,298 Updated Jan 20, 2026

Jimver / cuda-toolkit

GitHub Action to install CUDA

TypeScript 199 68 Updated Dec 27, 2025

stephenhillier / starlette_exporter

Prometheus exporter for Starlette and FastAPI

Python 411 37 Updated Oct 15, 2024

facebookresearch / lingua

Meta Lingua: a lean, efficient, and easy-to-hack codebase to research LLMs.

Python 4,752 270 Updated Jul 18, 2025

pytorch / pytorch

Tensors and Dynamic neural networks in Python with strong GPU acceleration

Python 97,065 26,698 Updated Jan 30, 2026

mlabonne / llm-datasets

Curated list of datasets and tools for post-training.

4,205 347 Updated Nov 10, 2025

AgibotTech / agibot_x1_train

The reinforcement learning training code for AgiBot X1.

Python 1,631 505 Updated Oct 23, 2024