Skip to content
View hecola's full-sized avatar

Highlights

  • Pro

Block or report hecola

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

50 stars written in Python
Clear filter

🤗 Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal models, for both inference and training.

Python 155,066 31,732 Updated Jan 14, 2026

The simplest, fastest repository for training/finetuning medium-sized GPTs.

Python 51,969 8,720 Updated Nov 12, 2025

Making large AI models cheaper, faster and more accessible

Python 41,318 4,542 Updated Dec 22, 2025

You like pytorch? You like micrograd? You love tinygrad! ❤️

Python 31,127 3,829 Updated Jan 14, 2026

SGLang is a high-performance serving framework for large language models and multimodal models.

Python 22,440 4,048 Updated Jan 14, 2026

Machine Learning Engineering Open Book

Python 16,354 1,008 Updated Jan 11, 2026

TensorRT LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and supports state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. Tensor…

Python 12,632 2,008 Updated Jan 14, 2026

Nano vLLM

Python 10,756 1,381 Updated Nov 3, 2025

The Triton Inference Server provides an optimized cloud and edge inferencing solution.

Python 10,238 1,701 Updated Jan 14, 2026

🏔️国立台湾大学、新加坡国立大学、早稻田大学、东京大学,中央研究院(台湾)以及中国重点高校及科研机构,社科、经济、数学、博弈论、哲学、系统工程类学术论文等知识库。

Python 9,314 1,893 Updated Jan 6, 2026

A PyTorch Extension: Tools for easy mixed precision and distributed training in Pytorch

Python 8,891 1,508 Updated Jan 14, 2026

交易模块

Python 7,639 1,754 Updated Sep 10, 2025

Domain-specific language designed to streamline the development of high-performance GPU/CPU/Accelerators kernels

Python 4,697 395 Updated Jan 14, 2026

FlashInfer: Kernel Library for LLM Serving

Python 4,650 646 Updated Jan 14, 2026

Performance-Optimized AI Inference on Your GPUs. Unlock it by selecting and tuning the optimal inference engine for your model.

Python 4,373 443 Updated Jan 14, 2026

A course of learning LLM inference serving on Apple Silicon for systems engineers: build a tiny vLLM + Qwen.

Python 3,675 257 Updated Dec 18, 2025

A library for accelerating Transformer models on NVIDIA GPUs, including using 8-bit and 4-bit floating point (FP8 and FP4) precision on Hopper, Ada and Blackwell GPUs, to provide better performance…

Python 3,087 607 Updated Jan 13, 2026

A compact implementation of SGLang, designed to demystify the complexities of modern LLM serving systems.

Python 2,933 323 Updated Jan 6, 2026

compiler learning resources collect.

Python 2,659 363 Updated Mar 19, 2025

MII makes low-latency and high-throughput inference possible, powered by DeepSpeed.

Python 2,089 188 Updated Jun 30, 2025

A fast MoE impl for PyTorch

Python 1,827 200 Updated Feb 10, 2025

Several simple examples for popular neural network toolkits calling custom CUDA operators.

Python 1,527 203 Updated Apr 29, 2021

面向编译器开发人员的V8内部实现文档

Python 1,510 138 Updated Jul 28, 2023

Automatically Collect POC or EXP from GitHub by CVE ID.

Python 1,116 230 Updated Jan 14, 2026

FlagGems is an operator library for large language models implemented in the Triton Language.

Python 870 209 Updated Jan 14, 2026

A RISC-V ELF psABI Document

Python 825 182 Updated Dec 15, 2025

Virtualized Elastic KV Cache for Dynamic GPU Sharing and Beyond

Python 746 80 Updated Jan 10, 2026

Visual Studio Code project/compile_commands.json generator for Linux kernel sources and out-of-tree modules

Python 643 160 Updated Sep 23, 2023

GLake: optimizing GPU memory management and IO transmission.

Python 497 45 Updated Mar 24, 2025

【2024年新版】国科大 陈云霁 智能计算系统AICS实验代码

Python 486 44 Updated Jun 12, 2025
Next