Lists (1)
Sort Name ascending (A-Z)
Stars
Speech recognition API service powered by FunASR and Qwen-ASR, supporting 52 languages, compatible with OpenAI API and Alibaba Cloud Speech API. 基于 FunASR 与 Qwen3-ASR 的语音识别 API 服务,支持 52 种语言,兼容 Open…
Qwen3-ASR is an open-source series of ASR models developed by the Qwen team at Alibaba Cloud, supporting stable multilingual speech/music/song recognition, language detection and timestamp prediction.
🤖 wukong-robot 是一个简单、灵活、优雅的中文语音对话机器人/智能音箱项目,支持ChatGPT多轮对话能力,还可能是首个支持脑机交互的开源智能音箱项目。
Pre-Training with Whole Word Masking for Chinese BERT(中文BERT-wwm系列模型)
A high-throughput and memory-efficient inference and serving engine for LLMs
LLMs-from-scratch项目中文翻译
Open-source industrial-grade ASR models supporting Mandarin, Chinese dialects and English, achieving a new SOTA on public Mandarin ASR benchmarks, while also offering outstanding singing lyrics rec…
Keyphrase or Keyword Extraction 基于预训练模型的中文关键词抽取方法(论文SIFRank: A New Baseline for Unsupervised Keyphrase Extraction Based on Pre-trained Language Model 的中文版代码)
A Prometheus exporter for Celery metrics
An MCP (Model Context Protocol) server for data transformation and BI charts will allow AI assistants to connect to your data sources, transform data, and generate high-quality visualizations throu…
数读 是一件可以让你的大模型安全连接到数据库的MCP工具。| DButils is an all-in-one MCP service that enables your AI to do data analysis by harnessing versatile types of database (sqlite, mysql, postgres, and more) within a u…
这是一个简单的语音对话玩具改造项目,选择了孩子的故事机玩具进行改造,目的是任何人可以自己动手快速打造出个性化的AI语音交互玩具,创造更多可能。并因此推出了衍生的改造硬件套件
ASCII generator (image to text, image to image, video to video)
Awesome Pretrained Chinese NLP Models,高质量中文预训练模型&大模型&多模态模型&大语言模型集合
《开源大模型食用指南》针对中国宝宝量身打造的基于Linux环境快速微调(全参数/Lora)、部署国内外开源大模型(LLM)/多模态大模型(MLLM)教程
Automatic Speech Recognition with Speaker Diarization based on OpenAI Whisper
Multilingual Voice Understanding Model
一个简单的本地网页界面,使用ChatTTS将文字合成为语音,同时支持对外提供API接口。A simple native web interface that uses ChatTTS to synthesize text into speech, along with support for external API interfaces.
🍦 Speech-AI-Forge is a project developed around TTS generation model, implementing an API Server and a Gradio-based WebUI.
官方推荐的 ChatTTS 资源汇总项目,整理了全网相关资源和常见问题 || Officially recommended ChatTTS resource collection project
A generative speech model for daily dialogue.
Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.
🚀 一键部署(含离线整合包)!基于 ChatTTS ,支持流式输出、音色抽卡、长音频生成和分角色朗读。简单易用,无需复杂安装。
