Stars
FinRL®: Financial Reinforcement Learning. 🔥
[TMLR 2025🔥] A survey for the autoregressive models in vision.
Text-audio foundation model from Boson AI
📖 This is a repository for organizing papers, codes, and other resources related to personalized video generation and editing.
This is a repo to track the latest autoregressive visual generation papers.
pytorch distribute tutorials
VARGPT-v1.1: Improve Visual Autoregressive Large Unified Model via Iterative Instruction Tuning and Reinforcement Learning
Minimal and annotated implementations of key ideas from modern deep learning research.
AI Audio Datasets (AI-ADS) 🎵, including Speech, Music, and Sound Effects, which can provide training data for Generative AI, AIGC, AI model training, intelligent audio tool development, and audio a…
Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audi…
A series of technical report on Slow Thinking with LLM
Recipes to scale inference-time compute of open models
AIGCPanel 是一个简单易用的一站式AI数字人系统,支持视频合成、声音合成、声音克隆,简化本地模型管理、一键导入和使用AI模型。
An Invitation to 3D Vision: A Tutorial for Everyone
A generative world for general-purpose robotics & embodied AI learning.
A deep dive on the history of robotics and the future of humanoids
Unofficial implementation of "UniSim: A Neural Closed-Loop Sensor Simulator".
A 3DGS framework for omni urban scene reconstruction and simulation.
HunyuanVideo: A Systematic Framework For Large Video Generation Model
A tutorial about diffusion model application in planning and control.
