Skip to content
View AndroidSheepy's full-sized avatar
  • USTC, intern@MBZUAI
  • Abu Dhabi, UAE

Highlights

  • Pro

Block or report AndroidSheepy

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

The official repo of Pai-Megatron-Patch for LLM & VLM large scale training developed by Alibaba Cloud.

Python 1,504 220 Updated Dec 15, 2025

[ICLR 2025🔥] SVD-LLM & [NAACL 2025🔥] SVD-LLM V2

Python 271 36 Updated Aug 28, 2025

A PyTorch native platform for training generative AI models

Python 4,949 662 Updated Jan 11, 2026

An interference-aware scheduler for fine-grained GPU sharing

Python 158 28 Updated Nov 26, 2025

NVIDIA Linux open GPU kernel module source

C 16,612 1,562 Updated Dec 18, 2025

Easy and Efficient dLLM Fine-Tuning

Python 193 7 Updated Dec 15, 2025

LM engine is a library for pretraining/finetuning LLMs

Python 109 24 Updated Jan 10, 2026

LaTeX Template for Statement of Purpose (SoP)

TeX 144 22 Updated Oct 28, 2022

dInfer: An Efficient Inference Framework for Diffusion Language Models

Python 384 37 Updated Jan 7, 2026

[SIGMOD 2025] PQCache: Product Quantization-based KVCache for Long Context LLM Inference

Python 83 20 Updated Dec 7, 2025

Pie: Programmable LLM Serving

Python 84 11 Updated Jan 11, 2026

An Efficient and User-Friendly Scaling Library for Reinforcement Learning with Large Language Models

Python 2,619 197 Updated Jan 11, 2026

kernels, of the mega variety

Python 643 35 Updated Sep 28, 2025

DeepGEMM: clean and efficient FP8 GEMM kernels with fine-grained scaling

Cuda 6,052 792 Updated Jan 6, 2026
C++ 32 2 Updated Jul 17, 2024

The LLVM Project is a collection of modular and reusable compiler and toolchain technologies.

LLVM 36,377 15,738 Updated Jan 11, 2026
Cuda 31 1 Updated Apr 2, 2025

libsmctrl论文的复现,添加了python端接口,可以在python端灵活调用接口来分配计算资源

C 10 1 Updated May 21, 2024
Jupyter Notebook 6 2 Updated Dec 7, 2024

DFloat11 [NeurIPS '25]: Lossless Compression of LLMs and DiTs for Efficient GPU Inference

Python 589 36 Updated Nov 24, 2025

Documentation of NVIDIA chip/hardware interfaces

C 1,320 98 Updated Aug 18, 2025

Effective transpose on Hopper GPU

Cuda 27 3 Updated Sep 6, 2025

📄 Awesome CV is LaTeX template for your outstanding job application

TeX 26,005 5,122 Updated Dec 31, 2025

Port of OpenAI's Whisper model in C/C++

C++ 45,644 5,090 Updated Jan 5, 2026

VoiceTrans是一站式离线AI视频字幕生成和翻译软件,功能包括视频下载,音频提取,听写打轴,字幕翻译,视频合成,字幕总结。

Python 925 42 Updated Dec 16, 2025

🌸 Run LLMs at home, BitTorrent-style. Fine-tuning and inference up to 10x faster than offloading

Python 9,870 586 Updated Sep 7, 2024

Automatically generate, translate, and overlay subtitles for any video.

Python 88 8 Updated Jun 10, 2025

Automatically generate and overlay subtitles for any video.

Python 2,128 358 Updated Jul 12, 2024
Next