[NeurIPS 2025] PyTorch implementation of [ThinkSound], a unified framework for generating audio from any modality, guided by Chain-of-Thought (CoT) reasoning.

Python 1,126 65 Updated Nov 25, 2025

hkchengrex / MMAudio

[CVPR 2025] MMAudio: Taming Multimodal Joint Training for High-Quality Video-to-Audio Synthesis

Python 2,045 240 Updated Nov 30, 2025

SunzeY / SEAgent

Official implementation of "SEAgent: Self-Evolving Computer Use Agent with Autonomous Learning from Experience"

Python 216 19 Updated Aug 7, 2025

liuhuadai / OmniAudio

[ICML 2025] PyTorch Implementation of "OmniAudio: Generating Spatial Audio from 360-Degree Video"

Python 343 9 Updated Jun 27, 2025

X-Drunker / Sonic4D-project-page

Project page of Sonic4D

JavaScript 7 Updated Jun 26, 2025

hustvl / 4DGaussians

[CVPR 2024] 4D Gaussian Splatting for Real-Time Dynamic Scene Rendering

Jupyter Notebook 3,271 309 Updated Oct 27, 2024

facebookresearch / vggt

[CVPR 2025 Best Paper Award] VGGT: Visual Geometry Grounded Transformer

Python 12,143 1,289 Updated Oct 11, 2025

3DTopia / GenDoP

GenDoP: Auto-regressive Camera Trajectory Generation as a Director of Photography

Python 97 5 Updated Dec 31, 2025

JianxGao / MinD-3D

Python 43 3 Updated Oct 13, 2025

JianxGao / CineBrain

JavaScript 4 Updated Dec 12, 2025

3DTopia / Imagine360

Official code of "Imagine360: Immersive 360 Video Generation from Perspective Anchor"

Python 155 6 Updated May 14, 2025

Lizb6626 / IDArb

Official repo for "IDArb: Intrinsic Decomposition for arbitrary number of input views and illuminations"

Python 94 7 Updated Jul 9, 2025

3DTopia / LayerPano3D

[SIGGRAPH 2025] LayerPano3D: Layered 3D Panorama for Hyper-Immersive Scene Generation"

Python 306 13 Updated Jul 24, 2025

Kmcode1 / SG-I2V

This is the official implementation of SG-I2V: Self-Guided Trajectory Control in Image-to-Video Generation.

Jupyter Notebook 114 6 Updated Nov 26, 2024

JoeLeelyf / customize-arxiv-daily

Customize your arXiv recommendation every day.

Python 139 24 Updated Sep 24, 2025

robincourant / DIRECTOR

Python 73 5 Updated Oct 25, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

MengChen Zhang kszpxxzmc

Achievements

Achievements

Highlights

Block or report kszpxxzmc

Stars

Vchitect / LongVie

InternLM / ARM-Thinker

kszpxxzmc / ViSAudio

facebookresearch / audiobox-aesthetics

InternLM / StarBench

GoogleChrome / omnitone

LiuZH-19 / SongGen

SonyResearch / CCStereo

InternLM / CapRL

InternLM / Spark

Lizb6626 / SS4D

3DTopia / 3DGen-Bench

FFmpeg / FFmpeg

jaeyeonkim99 / visage

FunAudioLLM / ThinkSound