Stars
AutoGPT is the vision of accessible AI for everyone, to use and to build on. Our mission is to provide the tools, so that you can focus on what matters.
Langflow is a powerful tool for building and deploying AI-powered agents and workflows.
🦜🔗 The platform for reliable agents.
Robust Speech Recognition via Large-Scale Weak Supervision
🏡 Open source home automation that puts local control and privacy first.
The official gpt4free repository | various collection of powerful language models | o4, o3 and deepseek r1, gpt-4.1, gemini 2.5
🌟 The Multi-Agent Framework: First AI Software Company, Towards Natural Language Programming
利用AI大模型,一键生成高清短视频 Generate short videos with one click using AI LLM.
LlamaIndex is the leading framework for building LLM-powered agents over your data.
🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
Developer-first error tracking and performance monitoring
小红书笔记 | 评论爬虫、抖音视频 | 评论爬虫、快手视频 | 评论爬虫、B 站视频 | 评论爬虫、微博帖子 | 评论爬虫、百度贴吧帖子 | 百度贴吧评论回复爬虫 | 知乎问答文章|评论爬虫
A generative speech model for daily dialogue.
🚀AI拟声: 5秒内克隆您的声音并生成任意语音内容 Clone a voice in 5 seconds to generate arbitrary speech in real-time
🚀🚀 「大模型」2小时完全从0训练26M的小参数GPT!🌏 Train a 26M-parameter GPT from scratch in just 2h!
Real-ESRGAN aims at developing Practical Algorithms for General Image/Video Restoration.
A modular graph-based Retrieval-Augmented Generation (RAG) system
Image inpainting tool powered by SOTA AI Model. Remove any unwanted object, defect, people from your pictures or erase and replace(powered by stable diffusion) any thing on your pictures.
Build Real-Time Knowledge Graphs for AI Agents
Faster Whisper transcription with CTranslate2
A Python library for extracting structured information from unstructured text using LLMs with precise source grounding and interactive visualization.
WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)
Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.
[NeurIPS 2022] Towards Robust Blind Face Restoration with Codebook Lookup Transformer
An Industrial-Level Controllable and Efficient Zero-Shot Text-To-Speech System
🚀「Douyin_TikTok_Download_API」是一个开箱即用的高性能异步抖音、快手、TikTok、Bilibili数据爬取工具,支持API调用,在线批量解析及下载。
A collaborative note taking, wiki and documentation platform that scales. Built with Django and React.
Wan: Open and Advanced Large-Scale Video Generative Models
AgentScope: Agent-Oriented Programming for Building LLM Applications