Lists (1)
Sort Name ascending (A-Z)
Stars
Your own personal AI assistant. Any OS. Any Platform. The lobster way. 🦞
清华大学计算机系课程攻略 Guidance for courses in Department of Computer Science and Technology, Tsinghua University
一个基于nano banana pro🍌的原生AI PPT生成应用,迈向真正的"Vibe PPT"; 支持上传任意模板图片;上传任意素材&智能解析;一句话/大纲/页面描述自动生成PPT;口头修改指定区域、一键导出可编辑ppt - An AI-native slides generator based on nano banana pro🍌
Awesome Deep Research list! For more details, please refer to our survey paper -- A Comprehensive Survey of Deep Research: Systems, Methodologies, and Applications
[Lumina具身智能社区] 具身智能技术指南 Embodied-AI-Guide
Conversational AI cookbook for developers — exploring real-time voice agents, streaming, and orchestration. 对话式 AI 开发者手册:探索实时语音、编排与工程实践。
Multi-agent autonomous research system using LangGraph and LangChain. Generates citation-backed reports with credibility scoring and web search
Multilingual Voice Understanding Model
【PyTorch】Easy-to-use,Modular and Extendible package of deep-learning based CTR models.
Silero VAD: pre-trained enterprise-grade Voice Activity Detector
NVIDIA Isaac Sim™ is an open-source application on NVIDIA Omniverse for developing, simulating, and testing AI-driven robots in realistic virtual environments.
Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding
A single hub to find Claude Skills, Agents, Commands, Hooks, Plugins, and Marketplace collections to extend Claude Code, Claude Desktop, Agent SDK and OpenClaw
🤗 LeRobot: Making AI for Robotics more accessible with end-to-end learning
Github repository for ACL 2025 paper: Recent Advances in Speech Language Models: A Survey.
Paper, Code and Resources for Speech Language Model and End2End Speech Dialogue System.
Automatic Speech Recognition (ASR), Speaker Verification, Speech Synthesis, Text-to-Speech (TTS), Language Modelling, Singing Voice Synthesis (SVS), Voice Conversion (VC)
A Survey of Reinforcement Learning for Large Reasoning Models
Next-gen AI+IoT framework for T2/T3/T5AI/ESP32/and more – Fast IoT and AI Agent hardware integration
MuseTalk: Real-Time High Quality Lip Synchorization with Latent Space Inpainting
Open-source framework for conversational voice AI agents
Codebase for Berkeley Humanoid Lite
Memory for 24/7 proactive agents like openclaw (moltbot, clawdbot).