Enigmatisms

Follow

☢️

R & D

Qianyue He Enigmatisms

☢️

R & D

Follow

AI Heterogeneous Computing R&D @PaddlePaddle. MEAI@Tsinghua SIGS. Rendering and HPC enthusiast.

220 followers · 241 following

PaddlePaddle
Shanghai
15:17 (UTC +08:00)
https://enigmatisms.github.io/owner-info/

Achievements

Achievements

Highlights

Developer Program Member

Lists (17)

Sort

C++

Excellent C++ projs.

170 repositories

CUDA

Some CUDA repos.

32 repositories

DeepLearning

Deep learning based algorithms and repos.

273 repositories

Game Engine

Engine Engineering

Graphics

Computer graphics related.

341 repositories

HPC

High performance computing

140 repositories

Interesting

Interesting little project or games.

301 repositories

Learning

Excellent learning resources

49 repositories

Lightning

Fast! Make your code faster.

LLMs

Interesting LLMs

46 repositories

Math Algos

Mathematically based Algorithms

NeRF&GS

NeRF and 3D Gaussian

81 repositories

Python

Python interesting repos and utilities

91 repositories

Re-implementation

"My-own" series.

Rust

Repos about rust programming language.

44 repositories

SLAM

45 repositories

Utilities

Speed boosters (CUDA) and useful repos.

206 repositories

Starred repositories

Alibaba-NLP / qqr

qqr is an RL training framework for open-ended agents.

Python 167 14 Updated Jan 16, 2026

Unays7 / HFT-Interview-Prep

Guide to prepare for HFT interviews (SWEs)

202 25 Updated Dec 19, 2025

NVIDIA / nvbandwidth

A tool for bandwidth measurements on NVIDIA GPUs.

C++ 607 68 Updated Apr 15, 2025

deepseek-ai / Engram

Conditional Memory via Scalable Lookup: A New Axis of Sparsity for Large Language Models

Python 2,995 193 Updated Jan 14, 2026

stas00 / ml-engineering

Machine Learning Engineering Open Book

Python 16,404 1,014 Updated Jan 11, 2026

NVIDIA / TensorRT-Edge-LLM

High-performance, light-weight C++ LLM and VLM Inference Software for Physical AI

C++ 192 21 Updated Jan 5, 2026

openai / sparse_attention

Examples of using sparse attention, as in "Generating Long Sequences with Sparse Transformers"

Python 1,608 191 Updated Aug 12, 2020

deepreinforce-ai / CUDA-L2

CUDA-L2: Surpassing cuBLAS Performance for Matrix Multiplication through Reinforcement Learning

Cuda 342 20 Updated Jan 8, 2026

openai / circuit_sparsity

Open-source release accompanying Gao et al. 2025

Python 494 52 Updated Dec 11, 2025

volcengine / verl

verl: Volcano Engine Reinforcement Learning for LLMs

Python 18,523 3,064 Updated Jan 20, 2026

Dao-AILab / sonic-moe

Accelerating MoE with IO and Tile-aware Optimizations

Python 550 44 Updated Jan 19, 2026

0voice / expert_readed_books

2021年最新总结，推荐工程师合适读本，计算机科学，软件技术，创业，思想类，数学类，人物传记书籍

11,243 3,300 Updated Jun 20, 2025

baidu / vLLM-Kunlun

vLLM Kunlun (vllm-kunlun) is a community-maintained hardware plugin designed to seamlessly run vLLM on the Kunlun XPU.

Python 230 39 Updated Jan 20, 2026

tile-ai / TileRT

Tile-Based Runtime for Ultra-Low-Latency LLM Inference

Python 534 26 Updated Dec 23, 2025

zrax / pycdc

C++ python bytecode disassembler and decompiler

C++ 4,244 800 Updated Aug 30, 2025

NVIDIA / cutile-python

cuTile is a programming model for writing parallel kernels for NVIDIA GPUs

Python 1,837 99 Updated Jan 20, 2026

NVIDIA / TileGym

Helpful kernel tutorials and examples for tile-based GPU programming

Python 573 33 Updated Jan 20, 2026

dsl-learn / cutile-learn

NVIDIA cuTile learn

Python 150 1 Updated Dec 9, 2025

meta-pytorch / autoparallel

An experimental implementation of compiler-driven automatic sharding of models across a given device mesh.

Python 50 13 Updated Jan 19, 2026

deepseek-ai / LPLB

An early research stage expert-parallel load balancer for MoE models based on linear programming.

Python 490 32 Updated Nov 19, 2025

open-mpi / ompi

Open MPI main development repository

C 2,511 941 Updated Jan 16, 2026

unhappychoice / gitlogue

A cinematic Git commit replay tool for the terminal, turning your Git history into a living, animated story.

Rust 3,970 91 Updated Jan 19, 2026

deepseek-ai / DeepSeek-Math-V2

Python 1,526 127 Updated Dec 1, 2025

modelscope / DiffSynth-Studio

Enjoy the magic of Diffusion models!

Python 11,507 1,100 Updated Jan 20, 2026

Vandermode / LAFA

Large-Area Fabrication-Aware Computational Diffractive Optics (SIGGRAPH Asia & TOG 2025)

Python 17 3 Updated Nov 20, 2025

facebookresearch / sam-3d-body

The repository provides code for running inference with the SAM 3D Body Model (3DB), links for downloading the trained model checkpoints and datasets, and example notebooks that show how to use the…

Python 2,506 253 Updated Dec 19, 2025

nv-tlabs / LLaMA-Mesh

Unifying 3D Mesh Generation with Language Models

Python 1,134 75 Updated Mar 28, 2025

NVIDIA / gdrcopy

A fast GPU memory copy library based on NVIDIA GPUDirect RDMA technology

C++ 1,326 182 Updated Dec 17, 2025

SandAI-org / MagiAttention

A Distributed Attention Towards Linear Scalability for Ultra-Long Context, Heterogeneous Data Training

Python 619 36 Updated Jan 20, 2026

PaddlePaddle / PaddleFleet

Core Functional Library for Distributed Training

Python 10 48 Updated Jan 20, 2026

Starred topics

differentiable-rendering

Python

Haskell

C#

C++

Raspberry Pi

Ansible