Skip to content
View sumleo's full-sized avatar

Block or report sumleo

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results
Python 14 2 Updated Aug 21, 2025

Single File, Single GPU, From Scratch, Efficient, Full Parameter Tuning library for "RL for LLMs"

Jupyter Notebook 576 54 Updated Oct 7, 2025

A lightweight monitoring tool that leverages OS-level strace alongside Python audit hooks to detect sensitive operations during ML model execution.

Python 9 Updated Mar 24, 2025

Keep searching, reading webpages, reasoning until it finds the answer (or exceeding the token budget)

TypeScript 5,054 462 Updated Dec 13, 2025

A high-performance distributed file system designed to address the challenges of AI training and inference workloads.

C++ 9,612 990 Updated Jan 6, 2026

DeepGEMM: clean and efficient FP8 GEMM kernels with fine-grained scaling

Cuda 6,065 796 Updated Jan 6, 2026

FlashMLA: Efficient Multi-head Latent Attention Kernels

C++ 11,966 928 Updated Dec 15, 2025

“连续八年成为全世界最受喜爱的语言,无 GC 也无需手动内存管理、极高的性能和安全性、过程/OO/函数式编程、优秀的包管理、JS 未来基石" — 工作之余的第二语言来试试 Rust 吧。本书拥有全面且深入的讲解、生动贴切的示例、德芙般丝滑的内容,这可能是目前最用心的 Rust 中文学习教程 / Book

Rust 29,697 2,529 Updated Dec 31, 2025

A technical explainer by @kognise of how your computer runs programs, from start to finish.

MDX 5,389 189 Updated Jun 15, 2024

Djinn-Agent: A lightweight CLI tool for seamless interaction with Claude's advanced computer-use capabilities, automating complex tasks from the terminal.

Python 27 Updated Oct 28, 2024

Resources for the "Evaluating the Factual Consistency of Abstractive Text Summarization" paper

Python 307 30 Updated May 1, 2025

Master programming by recreating your favorite technologies from scratch.

Markdown 456,270 42,767 Updated Dec 26, 2025

Visualization and debugging tool for LangChain workflows

Python 741 54 Updated Mar 6, 2024

Awesome-LLM-Robustness: a curated list of Uncertainty, Reliability and Robustness in Large Language Models

806 51 Updated May 21, 2025

A curated list of Large Language Model (LLM) Interpretability resources.

1,460 107 Updated Jun 22, 2025

经济学人(含音频)、纽约客、卫报、连线、大西洋月刊等英语杂志免费下载,支持epub、mobi、pdf格式, 每周更新

CSS 28,053 2,261 Updated Jan 9, 2026

The automated prompt injection framework for LLM-integrated applications.

Python 247 41 Updated Sep 12, 2024

Resource, Evaluation and Detection Papers for ChatGPT

457 25 Updated Mar 21, 2024

📋 A list of open LLMs available for commercial use.

12,590 946 Updated Feb 13, 2025

Fengshenbang-LM(封神榜大模型)是IDEA研究院认知计算与自然语言研究中心主导的大模型开源体系,成为中文AIGC和认知智能的基础设施。

Python 4,147 381 Updated Aug 13, 2024

Interact with your documents using the power of GPT, 100% privately, no data leaks

Python 57,029 7,598 Updated Nov 13, 2024

Generate 3D objects conditioned on text or images

Python 12,189 1,059 Updated Jun 22, 2024

Home of StarCoder: fine-tuning & inference!

Python 7,530 533 Updated Feb 27, 2024

Chinese-LLaMA 1&2、Chinese-Falcon 基础模型;ChatFlow中文对话模型;中文OpenLLaMA模型;NLP预训练/指令微调数据集

Python 3,061 229 Updated Apr 14, 2024

Panda项目是于2023年5月启动的开源海外中文大语言模型项目,致力于大模型时代探索整个技术栈,旨在推动中文自然语言处理领域的创新和合作。

Python 1,039 77 Updated Oct 19, 2023

Codebase for the ACL 2023 paper: White-Box Multi-Objective Adversarial Attack on Dialogue Generation.

Python 16 Updated Dec 8, 2023

🔊 Text-Prompted Generative Audio Model

Jupyter Notebook 38,897 4,682 Updated Aug 19, 2024

[NeurIPS 22] [AAAI 24] Recurrent Transformer-based long-context architecture.

Jupyter Notebook 775 64 Updated Oct 25, 2024

The official gpt4free repository | various collection of powerful language models | o4, o3 and deepseek r1, gpt-4.1, gemini 2.5

Python 65,661 13,691 Updated Jan 12, 2026
Next