Stars
Stable Diffusion web UI
A feature-rich command-line audio/video downloader
Command-line program to download videos from YouTube.com and other video sites
The most powerful and modular diffusion model GUI, api and backend with a graph/nodes interface.
Robust Speech Recognition via Large-Scale Weak Supervision
中英文敏感词、语言检测、中外手机/电话归属地/运营商查询、名字推断性别、手机号抽取、身份证抽取、邮箱抽取、中日文人名库、中文缩写库、拆字词典、词汇情感值、停用词、反动词表、暴恐词表、繁简体转换、英文模拟中文发音、汪峰歌词生成器、职业名称词库、同义词库、反义词库、否定词库、汽车品牌词库、汽车零件词库、连续英文切割、各种中文词向量、公司名字大全、古诗词库、IT词库、财经词库、成语词库、地名词库、…
Turn any PDF or image document into structured data for your AI. A powerful, lightweight OCR toolkit that bridges the gap between images/PDFs and LLMs. Supports 100+ languages.
An AI SKILL that provide design intelligence for building professional UI/UX multiple platforms
1 min voice data can also be used to train a good TTS model! (few shot voice cloning)
A curated list of awesome Claude Skills, resources, and tools for customizing Claude AI workflows
A generative speech model for daily dialogue.
GFPGAN aims at developing Practical Algorithms for Real-world Face Restoration.
Instant voice cloning by MIT and MyShell. Audio foundation model.
Real-ESRGAN aims at developing Practical Algorithms for General Image/Video Restoration.
Detectron2 is a platform for object detection, segmentation and other visual recognition tasks.
Real-time face swap for PC streaming or video calls
State-of-the-art 2D and 3D Face Analysis Project
SoftVC VITS Singing Voice Conversion
Industry leading face manipulation platform
🗃 Open source self-hosted web archiving. Takes URLs/browser history/bookmarks/Pocket/Pinboard/etc., saves HTML, JS, PDFs, media, and more...
Official inference repo for FLUX.1 models
Image inpainting tool powered by SOTA AI Model. Remove any unwanted object, defect, people from your pictures or erase and replace(powered by stable diffusion) any thing on your pictures.
Faster Whisper transcription with CTranslate2
DeepFaceLab is the leading software for creating deepfakes.
[NeurIPS 2022] Towards Robust Blind Face Restoration with Codebook Lookup Transformer
WebUI extension for ControlNet
Janus-Series: Unified Multimodal Understanding and Generation Models
An API wrapper for Discord written in Python.