Lists (9)
Sort Name ascending (A-Z)
Starred repositories
Agentic Design Patterns: A Hands-On Guide to Building Intelligent Systems by Antonio Gulli
The Open-Source Multimodal AI Agent Stack: Connecting Cutting-Edge AI Models and Agent Infra
This is a continuously updated handbook for readers to easily track the latest Text-to-SQL techniques in the literature and provide practical guidance for researchers and practitioners.
A GitHub Action invoking the Gemini CLI.
Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.
An open source implementation of Microsoft's VALL-E X zero-shot TTS model. Demo is available in https://plachtaa.github.io/vallex/
autogenhub / autogen
Forked from microsoft/autogenA programming framework for agentic AI. Discord: https://discord.gg/pAbnFJrkgZ
🚀 DeepSeek-V3 & R1大模型逆向API【特长:良心厂商】(官方贼便宜,建议直接走官方),支持高速流式输出、多轮对话,联网搜索,R1深度思考,零配置部署,多路token支持,仅供测试,如需商用请前往官方开放平台。
Train transformer language models with reinforcement learning.
LMDeploy is a toolkit for compressing, deploying, and serving LLMs.
Based on GroundingDino and SAM, use semantic strings to segment any element in an image. The comfyui version of sd-webui-segment-anything.
AIDE: AI-Driven Exploration in the Space of Code. The machine Learning engineering agent that automates AI R&D.
Agent framework and applications built upon Qwen>=3.0, featuring Function Calling, MCP, Code Interpreter, RAG, Chrome extension, etc.
This is work done by the Oxen.ai Community, trying to reproduce the Self-Rewarding Language Model paper from MetaAI.
Autonomous coding agent right in your IDE, capable of creating/editing files, executing commands, using the browser, and more with your permission every step of the way.
C0untFloyd / bark-gui
Forked from suno-ai/bark🔊 Text-Prompted Generative Audio Model with Gradio
Soft speech units for voice conversion
1 min voice data can also be used to train a good TTS model! (few shot voice cloning)
Curated tutorials and resources for Large Language Models, Text2SQL, Text2DSL、Text2API、Text2Vis and more.
The code for the paper C3: Zero-shot Text-to-SQL with ChatGPT
A natural language interface for computers
Devika is the first open-source implementation of an Agentic Software Engineer. Initially started as an open-source alternative to Devin.
Label, clean and enrich text datasets with LLMs.
Openai style api for open large language models, using LLMs just as chatgpt! Support for LLaMA, LLaMA-2, BLOOM, Falcon, Baichuan, Qwen, Xverse, SqlCoder, CodeLLaMA, ChatGLM, ChatGLM2, ChatGLM3 etc.…

