Lists (32)
Sort Name ascending (A-Z)
ALT
Automatic Lyrics TranscriptionASR
Attention
Audio Separation
Speech and Music SeparationAudio Synthsis
Biology
Computer Vision
Computer Vision TasksContinual Learning
Data Engineering
Data Testing
Datasets
Federated Learning
Finance
FrontEnd
GNN
Hugo
k8s
Knowledge Graph
ML
MLOPs
MLTemplate
Music
N-shot
NLP
Recommender
Roadmap
Self-Supervised
Synthetic Data
System Design
TTS
VPN
Vue
Starred repositories
Conditional Memory via Scalable Lookup: A New Axis of Sparsity for Large Language Models
Hierarchical Reasoning Model Official Release
Edit, preview and share mermaid charts/diagrams. New implementation of the live editor.
OpenStock is an open-source alternative to expensive market platforms. Track real-time prices, set personalized alerts, and explore detailed company insights — built openly, for everyone, forever f…
Qwen3-omni is a natively end-to-end, omni-modal LLM developed by the Qwen team at Alibaba Cloud, capable of understanding text, audio, images, and video, as well as generating speech in real time.
Claude Code is an agentic coding tool that lives in your terminal, understands your codebase, and helps you code faster by executing routine tasks, explaining complex code, and handling git workflo…
State-of-the-Art Deep Learning scripts organized by models - easy to train and deploy with reproducible accuracy and performance on enterprise-grade infrastructure.
A rigorous framework for evaluating and guiding the development of next-generation AI assistants.
GLM-TTS: Controllable & Emotion-Expressive Zero-shot TTS with Multi-Reward Reinforcement Learning
Agent framework and applications built upon Qwen>=3.0, featuring Function Calling, MCP, Code Interpreter, RAG, Chrome extension, etc.
💼 Your own AI-powered voice interviewer for hiring.
Sample application to add voice capabilities to the Agents SDK
Official inference framework for 1-bit LLMs
[EMNLP2025] "LightRAG: Simple and Fast Retrieval-Augmented Generation"
"DeepTutor: AI-Powered Personalized Learning Assistant"
SALMONN family: A suite of advanced multi-modal LLMs
Fun-Audio-Chat is a Large Audio Language Model built for natural, low-latency voice interactions.
The official code for the SALMon🍣 benchmark (ICASSP 2025 - Oral)
A Foundation Model for Generalist Gaming Agents
NeMo Guardrails is an open-source toolkit for easily adding programmable guardrails to LLM-based conversational systems.
This repository allows reproduction of Poetiq's record-breaking submission to the ARC-AGI-1 and ARC-AGI-2 benchmarks.
Readymade evaluators for agent trajectories
LangSmith Client SDK Implementations



