RAGFlow is a leading open-source Retrieval-Augmented Generation (RAG) engine that fuses cutting-edge RAG with Agent capabilities to create a superior context layer for LLMs

Python 73,544 8,154 Updated Feb 14, 2026

junjiehe96 / UniPortrait

[ICCV2025] UniPortrait: A Unified Framework for Identity-Preserving Single- and Multi-Human Image Personalization

Python 276 13 Updated May 1, 2025

bcmi / libcom

Image composition toolbox: everything you want to know about image composition or object insertion

Python 712 51 Updated Feb 21, 2026

xinsir6 / ControlNetPlus

ControlNet++: All-in-one ControlNet for image generations and editing!

Python 2,114 64 Updated Sep 30, 2024

wenet-e2e / wesep

Target Speaker Extraction Toolkit

Python 245 33 Updated Oct 4, 2025

ToTheBeginning / PuLID

[NeurIPS 2024] Official code for PuLID: Pure and Lightning ID Customization via Contrastive Alignment

Python 3,521 260 Updated Jul 31, 2025

ggml-org / llama.cpp

LLM inference in C/C++

C++ 95,636 15,031 Updated Feb 23, 2026

Rikorose / DeepFilterNet

Noise supression using deep filtering

Python 3,873 408 Updated Oct 17, 2024

Hannibal046 / Awesome-LLM

Awesome-LLM: a curated list of Large Language Model

26,299 2,303 Updated Jul 31, 2025

gpt-omni / mini-omni

open-source multimodal large language model that can hear, talk while thinking. Featuring real-time end-to-end speech input and streaming audio output conversational capabilities.

Python 3,527 306 Updated Nov 5, 2024

PlayVoice / lora-svc

singing voice change based on whisper, and lora for singing voice clone

Python 648 79 Updated Nov 3, 2023

chatchat-space / Langchain-Chatchat

Langchain-Chatchat（原Langchain-ChatGLM）基于 Langchain 与 ChatGLM, Qwen 与 Llama 等语言模型的 RAG 与 Agent 应用 | Langchain-Chatchat (formerly langchain-ChatGLM), local knowledge based LLM (like ChatGLM, Qwen and…

Python 37,311 6,141 Updated Nov 10, 2025

QwenLM / Qwen

The official repo of Qwen (通义千问) chat & pretrained large language model proposed by Alibaba Cloud.

Python 20,431 1,706 Updated Jan 30, 2026

FunAudioLLM / SenseVoice

Multilingual Voice Understanding Model

Python 7,547 703 Updated Dec 30, 2025

FunAudioLLM / CosyVoice

Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.

Python 19,669 2,219 Updated Feb 11, 2026

Tele-AI / TeleSpeech-ASR

Python 834 75 Updated Jun 7, 2024

modelscope / FunClip

Open-source, accurate and easy-to-use video speech recognition & clipping tool, LLM based AI clipping intergrated.

Python 5,351 651 Updated Jul 11, 2025

neouyghur / Uyghur-Multi-Script-Converter

This converter converts multiple Uyghur scripts: ULS(Uyghur Latin Script), UAS(Uyghur Arabic Script), CTS(Common Turkick Scritp), UCS(Uyghur Cyrilik Script) and Uyghur Yengi (new) Script.

Python 58 23 Updated Aug 18, 2025

RVC-Boss / GPT-SoVITS

1 min voice data can also be used to train a good TTS model! (few shot voice cloning)

Python 55,193 6,028 Updated Feb 9, 2026

fishaudio / Bert-VITS2

vits2 backbone with multilingual-bert

Python 8,694 1,263 Updated Feb 16, 2026

open-mmlab / Amphion

Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audi…

Python 9,694 792 Updated May 27, 2025

modelscope / FunASR

A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.

Python 14,935 1,580 Updated Feb 4, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Kai kli017

Achievements

Achievements

Block or report kli017

Stars

songguoxs / gpt4o-image-prompts

hacksider / Deep-Live-Cam

bytedance / DreamO

Xiaojiu-z / EasyControl

nftblackmagic / catvton-flux

stepfun-ai / Step-Audio

henbudidiao / UAV-path-planning

zhaohaojie1998 / Path-Planning

infiniflow / ragflow