Skip to content
View AlexanderXuan's full-sized avatar

Block or report AlexanderXuan

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Bash is all you need - A nano claude code–like 「agent harness」, built from 0 to 1

TypeScript 31,490 5,252 Updated Mar 17, 2026

The most powerful and modular diffusion model GUI, api and backend with a graph/nodes interface.

Python 106,185 12,217 Updated Mar 18, 2026

Your own personal AI assistant. Any OS. Any Platform. The lobster way. 🦞

TypeScript 321,701 61,846 Updated Mar 18, 2026
Python 16 1 Updated Mar 12, 2026

Accurate and general beat tracker

Python 245 46 Updated Feb 27, 2026

Self-supervised key estimation model that matches performance with supervised state-of-the-art model.

Python 48 2 Updated Jun 9, 2025

Official inference code for SoulX-Singer: Towards High-Quality Zero-Shot Singing Voice Synthesis

Python 459 51 Updated Mar 16, 2026

The most powerful local music generation model that outperforms most commercial alternatives, supporting Mac, AMD, Intel, and CUDA devices.

Python 7,957 898 Updated Mar 18, 2026

Robust Singing Voice Transcription and MIDI Extraction

Python 114 6 Updated Nov 20, 2024

SOFA: Singing-Oriented Forced Aligner

Python 211 29 Updated May 16, 2025

Suno API with JWT token authentication support

TypeScript 54 11 Updated Jan 24, 2026

Create Music in Seconds with SunoAPI.

Python 1,755 281 Updated Apr 26, 2025

Use API to call the music generation AI of suno.ai, and easily integrate it into agents like GPTs.

TypeScript 2,739 726 Updated Mar 6, 2026

The best ChatGPT that $100 can buy.

Python 49,326 6,459 Updated Mar 17, 2026

🚀 「大模型」1小时从0训练26M参数的视觉多模态VLM!🌏 Train a 26M-parameter VLM from scratch in just 1 hours!

Python 6,784 735 Updated Feb 4, 2026
Python 156 8 Updated Nov 22, 2024

Official repository of the paper "MuQ: Self-Supervised Music Representation Learning with Mel Residual Vector Quantization".

Python 317 14 Updated Aug 4, 2025

轻量级大语言模型MiniMind的源码解读,包含tokenizer、RoPE、MoE、KV Cache、pretraining、SFT、LoRA、DPO等完整流程

815 70 Updated Jun 16, 2025

🚀🚀 「大模型」2小时完全从0训练26M的小参数GPT!🌏 Train a 26M-parameter GPT from scratch in just 2h!

Python 41,523 5,016 Updated Feb 6, 2026

Encode and decode audio samples to/from continuous and discrete compressed representations!

Python 106 5 Updated Nov 25, 2025

HeartMuLa Official Repo: The Most Powerful Open-Source Music Generation Model of 2026

Python 4,281 335 Updated Mar 5, 2026

Official code for "Semantic-VAE: Semantic-Alignment Latent Representation for Better Speech Synthesis"

Python 109 7 Updated Dec 20, 2025

Codec for paper: LLaSA: Scaling Train-time and Inference-time Compute for LLaMA-based Speech Synthesis

Python 348 49 Updated Jul 21, 2025

AAAI 2025: Codec Does Matter: Exploring the Semantic Shortcoming of Codec for Audio Language Model

Python 295 23 Updated Oct 12, 2025

AnyAccomp: Generalizable accompaniment generation for vocals and solo instruments, powered by a quantized melodic bottleneck.

Python 34 2 Updated Dec 22, 2025

VoxCPM: Tokenizer-Free TTS for Context-Aware Speech Generation and True-to-Life Voice Cloning

Python 6,140 743 Updated Mar 13, 2026

PyTorch implementation of Audio Flamingo: Series of Advanced Audio Understanding Language Models

1,019 85 Updated Dec 15, 2025

Official PyTorch Implementation of "Diffusion Transformers with Representation Autoencoders"

Python 1,808 69 Updated Feb 25, 2026

A Large-scale Cantonese Speech Corpus with Multi-dimensional Annotation

Python 281 15 Updated Feb 5, 2026
Next