Skip to content
View junxu's full-sized avatar

Block or report junxu

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

KernelBench: Can LLMs Write GPU Kernels? - Benchmark + Toolkit with Torch -> CUDA (+ more DSLs)

Jupyter Notebook 782 127 Updated Jan 20, 2026

Autonomous GPU Kernel Generation via Deep Agents

Python 225 28 Updated Jan 31, 2026

Building the Virtuous Cycle for AI-driven LLM Systems

Python 144 21 Updated Jan 30, 2026

Complete Claude Code configuration collection - agents, skills, hooks, commands, rules, MCPs. Battle-tested configs from an Anthropic hackathon winner.

JavaScript 36,494 4,512 Updated Jan 30, 2026

A tool analyzing unused GPU code by machine learning workloads

Rust 14 3 Updated Oct 6, 2025

LLaDA2.0 is the diffusion language model series developed by InclusionAI team, Ant Group.

236 11 Updated Dec 19, 2025
Python 424 70 Updated Aug 26, 2024

Official implementation of our NeurIPS 2025 paper: "FlyLoRA: Boosting Task Decoupling and Parameter Efficiency via Implicit Rank-Wise Mixture-of-Experts."

Jupyter Notebook 177 13 Updated Nov 29, 2025

DLSlime: Flexible & Efficient Heterogeneous Transfer Toolkit

C++ 92 8 Updated Jan 26, 2026

Research and development (R&D) is crucial for the enhancement of industrial productivity, especially in the AI era, where the core aspects of R&D are mainly focused on data and models. We are commi…

Python 10,526 1,199 Updated Feb 1, 2026

交易模块

Python 7,714 1,777 Updated Sep 10, 2025

FactorHub是一个自研的现代化量化因子分析平台,专为量化投资研究者设计。平台完全自主研发,集成了「数据获取-因子管理-因子分析-策略回测-因子挖掘」的完整工作流程,通过直观的Web界面和强大的计算引擎,大幅降低量化分析的门槛,提高研究效率。 FactorHub = Factor(因子) + Hub(中心),意为因子分析和管理的核心枢纽。

Python 93 24 Updated Oct 5, 2025

Domain-specific language designed to streamline the development of high-performance GPU/CPU/Accelerators kernels

Python 4,863 425 Updated Jan 31, 2026

Lightning-Fast RL for LLM Reasoning and Agents. Made Simple & Flexible.

Python 3,470 278 Updated Feb 1, 2026
C++ 342 36 Updated Jan 28, 2026

Agent Reinforcement Trainer: train multi-step agents for real-world tasks using GRPO. Give your agents on-the-job training. Reinforcement learning for Qwen2.5, Qwen3, Llama, and more!

Python 8,495 704 Updated Feb 1, 2026

Qwen-Image is a powerful image generation foundation model capable of complex text rendering and precise image editing.

Python 7,190 418 Updated Dec 31, 2025

Hierarchical Reasoning Model Official Release

Python 12,287 1,789 Updated Sep 9, 2025

Efficient GPU communication over multiple NICs.

C++ 21 4 Updated Nov 20, 2025

Claude Code is an agentic coding tool that lives in your terminal, understands your codebase, and helps you code faster by executing routine tasks, explaining complex code, and handling git workflo…

Shell 62,839 4,720 Updated Jan 31, 2026
Python 8 Updated Nov 3, 2025

Code for Enabling Realtime Reinforcement Learning at Scale with Staggered Asynchronous Inference

Python 10 2 Updated Jun 12, 2025

Mirage Persistent Kernel: Compiling LLMs into a MegaKernel

C++ 2,104 170 Updated Jan 29, 2026

Softlearning is a reinforcement learning framework for training maximum entropy policies in continuous domains. Includes the official implementation of the Soft Actor-Critic algorithm.

Python 1,399 250 Updated Nov 29, 2023

UCCL is an efficient communication library for GPUs, covering collectives, P2P (e.g., KV cache transfer, RL weight transfer), and EP (e.g., GPU-driven)

C++ 1,194 116 Updated Jan 31, 2026

Official Repository of Absolute Zero Reasoner

Python 1,803 293 Updated Aug 24, 2025

ACI.dev is the open source tool-calling platform that hooks up 600+ tools into any agentic IDE or custom AI agent through direct function calling or a unified MCP server. The birthplace of VibeOps.

Python 4,703 456 Updated Jan 8, 2026

Minimalistic 4D-parallelism distributed training framework for education purpose

Python 2,047 163 Updated Aug 26, 2025

Official Repository of "Learning to Reason under Off-Policy Guidance"

Python 406 51 Updated Oct 4, 2025
C 14 1 Updated Dec 13, 2024
Next