Stars
StreamDiffusion, Live Stream APP
Official Repo for Rolling Forcing: Autoregressive Long Video Diffusion in Real Time
📹 A more flexible framework that can generate videos at any resolution and creates videos from images.
🤗 Diffusers: State-of-the-art diffusion models for image, video, and audio generation in PyTorch.
The ultimate training toolkit for finetuning diffusion models
PixelHacker: Image Inpainting with Structural and Semantic Consistency
[ICCV 2025] Code for FreeScale, a tuning-free method for higher-resolution visual generation
[IROS 2025 Award Finalist] The Large-scale Manipulation Platform for Scalable and Intelligent Embodied Systems
The best OSS video generation models, created by Genmo
Downloads videos and playlists from YouTube
智能视频多语言AI配音/翻译工具 - Linly-Dubbing — “AI赋能,语言无界”
Scalable and memory-optimized training of diffusion models
A curated list of recent diffusion models for video generation, editing, and various other applications.
This patch removes restriction on maximum number of simultaneous NVENC video encoding sessions imposed by Nvidia to consumer-grade GPUs.
A flexible tool for creating, organizing, and sharing visualizations of live, rich data. Supports Torch and Numpy.
Portrait4D: Learning One-Shot 4D Head Avatar Synthesis using Synthetic Data (CVPR 24); Portrait4D-v2: Pseudo Multi-View Data Creates Better 4D Head Synthesizer (ECCV 2024)
This repository contains the codes of "A Lip Sync Expert Is All You Need for Speech to Lip Generation In the Wild", published at ACM Multimedia 2020. For HD commercial model, please try out Sync Labs
Wav2Lip version 288 and pipeline to train
Streamlines and simplifies prompt design for both developers and non-technical users with a low code approach.
real time face swap and one-click video deepfake with only a single image
Multilingual Voice Understanding Model
MS-Agent: Lightweight Framework for Empowering Agents with Autonomous Exploration in Complex Task Scenarios