[ICLR2025, ICML2025, NeurIPS2025 Spotlight] Quantized Attention achieves speedup of 2-5x compared to FlashAttention, without losing end-to-end metrics across language, image, and video models.

Cuda 3,124 330 Updated Jan 17, 2026

thu-ml / SLA

SLA: Beyond Sparsity in Diffusion Transformers via Fine-Tunable Sparse–Linear Attention

Python 259 15 Updated Jan 17, 2026

OpenDCAI / DataFlow

Easy Data Preparation with latest LLMs-based Operators and Pipelines.

Python 2,826 173 Updated Feb 1, 2026

exacity / deeplearningbook-chinese

Deep Learning Book Chinese Translation

TeX 37,183 9,176 Updated Dec 3, 2019

thu-ml / TurboDiffusion

TurboDiffusion: 100–200× Acceleration for Video Diffusion Models

Python 3,293 224 Updated Jan 29, 2026

the1812 / Bilibili-Evolved

强大的哔哩哔哩增强脚本

TypeScript 28,262 1,728 Updated Jan 29, 2026

FoundationAgents / MetaGPT

🌟 The Multi-Agent Framework: First AI Software Company, Towards Natural Language Programming

Python 63,742 8,012 Updated Jan 21, 2026

yuanchenyang / smalldiffusion

Simple and readable code for training and sampling from diffusion models

Python 693 51 Updated Jun 14, 2025

huggingface / peft

🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.

Python 20,554 2,170 Updated Jan 29, 2026

InternRobotics / AnySplat

[SIGGRAPH Asia 2025 (ACM TOG)] AnySplat: Feed-forward 3D Gaussian Splatting from Unconstrained Views

Python 703 36 Updated Dec 22, 2025

OpenHelix-Team / VLA-Adapter

VLA-Adapter: An Effective Paradigm for Tiny-Scale Vision-Language-Action Model

Python 1,953 176 Updated Nov 18, 2025

hustvl / LightningDiT

[CVPR 2025 Oral] Reconstruction vs. Generation: Taming Optimization Dilemma in Latent Diffusion Models

Python 1,384 51 Updated Dec 16, 2025

MiniMax-AI / VTP

Towards Scalable Pre-training of Visual Tokenizers for Generation

Python 435 10 Updated Dec 16, 2025

DengKaiCQ / Pi-Long

Code implementation of Pi-Long

Python 164 10 Updated Dec 10, 2025

harpreetsahota204 / mineru_2_5

Integrating MinerU2.5 into FiftyOne as a Remote Source Zoo Model

Python 6 Updated Nov 14, 2025

harpreetsahota204 / sam3_images

Implementing sam3 for images as a Remote Source Zoo Model in FiftyOne

Python 3 2 Updated Dec 11, 2025

clementchadebec / benchmark_VAE

Unifying Variational Autoencoder (VAE) implementations in Pytorch (NeurIPS 2022)

Python 1,983 180 Updated Jul 31, 2024

microsoft / agent-lightning

The absolute trainer to light up AI agents.

Python 12,934 1,067 Updated Feb 1, 2026

PKU-VCL-3DV / SLAM3R

[CVPR 2025 Highlight] Real-time dense scene reconstruction with SLAM3R

Python 1,088 69 Updated Oct 18, 2025

QwenLM / Qwen3

Qwen3 is the large language model series developed by Qwen team, Alibaba Cloud.

Python 26,421 1,867 Updated Jan 9, 2026

QwenLM / Qwen

The official repo of Qwen (通义千问) chat & pretrained large language model proposed by Alibaba Cloud.

Python 20,283 1,704 Updated Jan 30, 2026

qiuzh20 / gated_attention

The official implementation for [NeurIPS2025 Oral] Gated Attention for Large Language Models: Non-linearity, Sparsity, and Attention-Sink-Free

Jupyter Notebook 819 51 Updated Dec 20, 2025

apple / axlearn

An Extensible Deep Learning Library

Python 2,317 398 Updated Jan 30, 2026

IIGROUP / MANIQA

[CVPRW oral 2022] MANIQA: Multi-dimension Attention Network for No-Reference Image Quality Assessment

Python 405 45 Updated Jun 10, 2023

genaibook / genaibook

Contains the public resources of Hands on GenAI book

Jupyter Notebook 230 83 Updated Jan 5, 2025

baaivision / Emu3.5

Native Multimodal Models are World Learners

Python 1,436 54 Updated Dec 30, 2025

xeno-canto

image-downloader

fine-grained-classification

Lefutonku lefutonku-github

Lists (5)

mldl_classic

mldl_data_fetching

mldl_datasets

mldl_fiftyone

mldl_processing

Starred repositories

xeno-canto

image-downloader

fine-grained-classification

hmm-viterbi-algorithm

Swift