Skip to content
View LouChao98's full-sized avatar

Block or report LouChao98

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

CVE-Factory

PHP 36 Updated Feb 2, 2026

Fast, Sharp & Reliable Agentic Intelligence

C++ 546 21 Updated Feb 4, 2026

Breakthrough Method for Agile Ai Driven Development

JavaScript 34,131 4,345 Updated Feb 4, 2026

Open-source framework for the research and development of foundation models.

HTML 749 75 Updated Feb 4, 2026

PaCoRe: Learning to Scale Test-Time Compute with Parallel Coordinated Reasoning

Python 304 8 Updated Feb 2, 2026

Official JAX implementation of End-to-End Test-Time Training for Long Context

Python 513 28 Updated Jan 28, 2026

Accelerating MoE with IO and Tile-aware Optimizations

Python 568 51 Updated Jan 19, 2026

A Quirky Assortment of CuTe Kernels

Python 780 75 Updated Feb 4, 2026

A simple C++ finite state machine library

C++ 1,148 189 Updated Jun 25, 2024

Chain of Experts (CoE) enables communication between experts within Mixture-of-Experts (MoE) models

Python 228 28 Updated Nov 4, 2025
Python 159 14 Updated Dec 27, 2024
C++ 342 36 Updated Jan 28, 2026

Physics of Language Models: Part 4.2, Canon Layers at Scale where Synthetic Pretraining Resonates in Reality

HTML 317 18 Updated Jan 5, 2026

Best practices for training DeepSeek, Mixtral, Qwen and other MoE models using Megatron Core.

Python 161 29 Updated Jan 22, 2026

Efficient End2End Compiler for Mixed-Precision Deep Learning

Python 10 Updated Feb 8, 2025

An intuitive and low-overhead instrumentation tool for Python

Python 1,196 40 Updated Jul 8, 2025
JavaScript 15 1 Updated Jun 14, 2025

Reverse Engineering Gemma 3n: Google's New Edge-Optimized Language Model

Python 262 17 Updated May 27, 2025

你还在为自己存放的VV表情包不够多,使用时觉得不够贴切而感到烦恼吗?快来试试这个项目吧!

Python 2,326 72 Updated Jun 20, 2025
Python 209 8 Updated Oct 27, 2025

🤖FFPA: Extend FlashAttention-2 with Split-D, ~O(1) SRAM complexity for large headdim, 1.8x~3x↑🎉 vs SDPA EA.

Cuda 249 13 Updated Jan 20, 2026

RAFT contains fundamental widely-used algorithms and primitives for machine learning and information retrieval. The algorithms are CUDA-accelerated and form building blocks for more easily writing …

Cuda 977 225 Updated Feb 4, 2026

Official Repo for Open-Reasoner-Zero

Python 2,085 117 Updated Jun 2, 2025

Small tool to disable macOS 15's annoying new screencapture nag popups

Shell 671 17 Updated Nov 7, 2024

open-source code for paper: Retrieval Head Mechanistically Explains Long-Context Factuality

Python 229 24 Updated Aug 2, 2024

Bucketed top-k for PyTorch using a priority queue

Python 8 Updated Mar 22, 2025
Next