Lists (2)
Sort Name ascending (A-Z)
Starred repositories
SoulX-FlashTalk is the first 14B model to achieve sub-second start-up latency (0.87s) while maintaining a real-time throughput of 32 FPS on an 8xH800 node.
[CVPR 2026] One-to-All Animation: Alignment-Free Character Animation and Image Pose Transfer
Official Python inference and LoRA trainer package for the LTX-2 audio–video generative model.
Official repository of paper "ProEdit: Inversion-based Editing From Prompts Done Right"
[ICLR 26 Oral] Stable Video Infinity: Infinite-Length Video Generation with Error Recycling
Offical Implementation of SCAIL: Towards Studio-Grade Character Animation via In-Context Learning of 3D-Consistent Pose Representations
[ICLR2026] SeedVR2: One-Step Video Restoration via Diffusion Adversarial Post-Training
deepbeepmeep / Wan2GP
Forked from Wan-Video/Wan2.1A fast AI Video Generator for the GPU Poor. Supports Wan 2.1/2.2, Qwen Image, Hunyuan Video, LTX Video and Flux.
SQL Native Memory Layer for LLMs, AI Agents & Multi-Agent Systems
Video Content Customization Using First Frame
The repository provides code for running inference and finetuning with the Meta Segment Anything Model 3 (SAM 3), links for downloading the trained model checkpoints, and example notebooks that sho…
HunyuanVideo-1.5: A leading lightweight video generation model
Kandinsky 5.0: A family of diffusion models for Video & Image generation
Uni3C: Unifying Precisely 3D-Enhanced Camera and Human Motion Controls for Video Generation [Siggraph Asian 2025]
[NeurIPS 2025] Pixel-Perfect Depth
[ICLR 2026] rCM: SOTA JVP-Based Diffusion Distillation & Few-Step Video Generation & Scaling Up sCM/MeanFlow & Real-Time Autoregressive Video Diffusion
StreamDiffusion, Live Stream APP
A tool for running and customizing real-time, interactive generative AI pipelines and models
DC-VideoGen: Efficient Video Generation with Deep Compression Video Autoencoder
Pure TypeScript media toolkit for reading, writing, and converting video and audio files, directly in the browser.
DecartAI / diffusers-lucy-edit
Forked from huggingface/diffusers🤗 Diffusers: State-of-the-art diffusion models for image, video, and audio generation in PyTorch.
HuMo: Human-Centric Video Generation via Collaborative Multi-Modal Conditioning
UniAnimate-DiT: Human Image Animation with Large-Scale Video Diffusion Transformer
[CVPR 2026] 🔥🔥 Official Repo of USO: Unified Style and Subject-Driven Generation via Disentangled and Reward Learning
HunyuanVideo-Foley: Multimodal Diffusion with Representation Alignment for High-Fidelity Foley Audio Generation.
Less is Enough: Training-Free Video Diffusion Acceleration via Runtime-Adaptive Caching
We present StableAvatar, the first end-to-end video diffusion transformer, which synthesizes infinite-length high-quality audio-driven avatar videos without any post-processing, conditioned on a re…



