Skip to content
View hecola's full-sized avatar

Highlights

  • Pro

Block or report hecola

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

Mooncake is the serving platform for Kimi, a leading LLM service provided by Moonshot AI.

C++ 4,570 502 Updated Jan 14, 2026

关于Transformer模型的最简洁pytorch实现,包含详细注释

Jupyter Notebook 230 28 Updated Nov 13, 2023
Python 47 13 Updated Aug 23, 2023

A complete computer science study plan to become a software engineer.

335,841 81,565 Updated Aug 28, 2025

Repository of Jupyter notebook tutorials for teaching the Deep Learning Course at the University of Amsterdam (MSc AI), Fall 2023

Jupyter Notebook 3,067 665 Updated Oct 31, 2025

A compact implementation of SGLang, designed to demystify the complexities of modern LLM serving systems.

Python 2,924 323 Updated Jan 6, 2026

PipeSwitch: Fast Pipelined Context Switching for Deep Learning Applications

Python 127 32 Updated May 9, 2022

Slurm: A Highly Scalable Workload Manager

C 3,638 787 Updated Jan 13, 2026

Samples for CUDA Developers which demonstrates features in CUDA Toolkit

C 8,702 2,249 Updated Jan 6, 2026

A modern ebook manager and reader with sync and backup capacities for Windows, macOS, Linux, Android, iOS and Web

JavaScript 25,706 1,923 Updated Jan 14, 2026

Readest is a modern, feature-rich ebook reader designed for avid readers offering seamless cross-platform access, powerful tools, and an intuitive interface to elevate your reading experience.

TypeScript 16,342 883 Updated Jan 14, 2026

📚 从零开始的大语言模型原理与实践教程

Jupyter Notebook 24,290 2,221 Updated Jan 3, 2026

A simple C++11 Thread Pool implementation

C++ 8,649 2,356 Updated Jul 20, 2024

workspace是基于C++11的轻量级异步执行框架,支持:通用任务异步并发执行、优先级任务调度、自适应动态线程池、高效静态线程池、异常处理机制等。

C++ 1,234 184 Updated Jul 16, 2025

Linux kernel stable tree

C 1 Updated Dec 29, 2025

AIInfra(AI 基础设施)指AI系统从底层芯片等硬件,到上层软件栈支持AI大模型训练和推理。

Jupyter Notebook 5,737 789 Updated Dec 22, 2025

AISystem 主要是指AI系统,包括AI芯片、AI编译器、AI推理和训练框架等AI全栈底层技术

Jupyter Notebook 16,088 2,306 Updated Sep 3, 2025

YiRage (Yield Revolutionary AGile Engine) - Multi-Backend LLM Inference Optimization. Extends Mirage with comprehensive support for CUDA, MPS, CPU, Triton, NKI, cuDNN, and MKL backends.

C++ 37 4 Updated Jan 11, 2026

AMD RAD's multi-GPU Triton-based framework for seamless multi-GPU programming

Python 154 30 Updated Jan 13, 2026

Virtualized Elastic KV Cache for Dynamic GPU Sharing and Beyond

Python 746 80 Updated Jan 10, 2026

nanomsg-next-generation -- light-weight brokerless messaging

C 4,475 542 Updated Dec 2, 2025

ZeroMQ core engine in C++, implements ZMTP/3.1

C++ 10,711 2,457 Updated Jan 10, 2026

Unified Communication X (mailing list - https://elist.ornl.gov/mailman/listinfo/ucx-group)

C 1,544 512 Updated Jan 13, 2026

😼 优雅地使用基于 clash/mihomo 的代理环境

Shell 7,951 971 Updated Jan 9, 2026

Fine-grained GPU sharing primitives

Python 147 18 Updated Jul 28, 2025

An optimized neural network operator library for chips base on Xuantie CPU.

C 96 44 Updated Jun 26, 2024

注释的nano_vllm仓库,并且完成了MiniCPM4的适配以及注册新模型的功能

Python 140 28 Updated Aug 11, 2025

FlexAttention based, minimal vllm-style inference engine for fast Gemma 2 inference.

Python 331 20 Updated Nov 2, 2025

Implement a ChatGPT-like LLM in PyTorch from scratch, step by step

Jupyter Notebook 82,947 12,467 Updated Jan 10, 2026

The simplest, fastest repository for training/finetuning medium-sized GPTs.

Python 51,956 8,722 Updated Nov 12, 2025
Next