-
ViveStudios
- Anyang, Gyeonggi, South Korea
Stars
A Claude Code plugin that automatically captures everything Claude does during your coding sessions, compresses it with AI (using Claude's agent-sdk), and injects relevant context back into future …
HeartMuLa Official Repo: The Most Powerful Open-Source Music Generation Model of 2026
The official code repository for LeVo: High-Quality Song Generation with Multi-Preference Alignment
Repository for training models for music source separation.
Enriching Music Descriptions with a Finetuned-LLM and Metadata for Text-to-Music Retrieval (TTMR++) [ICASSP24]
Interactive Pytorch forward pass visualization in notebooks
Contrastive Learning with Positive-Negative Frame Mask for Music Representation; Offical code
A TTS model capable of generating ultra-realistic dialogue in one pass.
WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)
Character-aware audio-only subtitling
Blender plugin to allow for modding Helldivers 2
vgmstream - A library for playback of various streamed audio formats used in video games.
Convert AudioKinetic Wwise RIFF/RIFX Vorbis to standard Ogg Vorbis
A simple tool to extract things from Helldivers 2 for your 3D printing needs. [NOTE] Hellextractor has ceased developed, see https://github.com/xypwn/filediver instead!
Extract Helldivers 2's 3D models, audio, video, textures and more.
Multi-Scale Neural Audio Codec (SNAC) compresses audio into discrete codes at a low bitrate
A simple, high-quality voice conversion tool focused on ease of use and performance.
🔊 A comprehensive list of open-source datasets for voice and sound computing (95+ datasets).
Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audi…
Implement a ChatGPT-like LLM in PyTorch from scratch, step by step
Resources of our paper "FilmAgent: A Multi-Agent Framework for End-to-End Film Automation in Virtual 3D Spaces". New versions in the making!
High-resolution models for human tasks.
Code of paper "A new baseline for edge detection: Make Encoder-Decoder great again"
Official inference framework for 1-bit LLMs
Pytorch Implementation of "SMITE: Segment Me In TimE" (ICLR 2025)
D-FINE: Redefine Regression Task of DETRs as Fine-grained Distribution Refinement [ICLR 2025 Spotlight]

