Lists (22)
Sort Name ascending (A-Z)
Starred repositories
A WebUI app for Music-Source-Separation-Training and we packed UVR together!
UltraFlux: Data-Model Co-Design for High-quality Native 4K Text-to-Image Generation across Diverse Aspect Ratios
Z-Image workflow with predefined styles for high-quality image generation and a user-friendly experience. Includes pre-configured versions for GGUF and SAFETENSORS checkpoint formats.
Offical Implementation of SCAIL: Towards Studio-Grade Character Animation via In-Context Learning of 3D-Consistent Pose Representations
Stand-In is a lightweight, plug-and-play framework for identity-preserving video generation.
Fun-ASR is an end-to-end speech recognition large model launched by Tongyi Lab.
A fundamental toolkit designed for music, song, and audio generation
Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.
Prompt Manager for ComfyUI, with integration with llama.cpp for prompt generation. Allowing users to generate and save prompts.
🐍 Community-driven Python implementation of TOON
🎒 Token-Oriented Object Notation (TOON) – Compact, human-readable, schema-aware JSON for LLM prompts. Spec, benchmarks, TypeScript SDK.
The PyTorch-based audio source separation toolkit for researchers
Implementation of Band Split Roformer, SOTA Attention network for music source separation out of ByteDance AI Labs
Noise supression using deep filtering
TIGER: Time-frequency Interleaved Gain Extraction and Reconstruction for Efficient Speech Separation
Nano Banana Pro 全网最全提示词整理
📄 Awesome OCR multiple programing languages toolkits based on ONNXRuntime, OpenVINO, PaddlePaddle and PyTorch.
An image annotation/editing tool used on Wayland 一个在wayland 上使用的图片标注/编辑工具
A simple memory system for claude code
ComfyUI-AI-Photography-Toolkit
红墨 - 基于🍌Nano Banana Pro🍌 的一站式小红书图文生成器 《一句话一张图片生成小红书图文》 Red Ink - A one-stop Xiaohongshu image-and-text generator based on the 🍌Nano Banana Pro🍌, "One Sentence, One Image: Generate Xiaohongshu Text …
Normal & height maps generation from single pictures
🚀 AI 全自动短视频引擎 | AI Fully Automated Short Video Engine
Qwen-Image text to image lora trainer