chenyiminch

Yimin Chen chenyiminch

SenseTime Research; City University of Hong Kong; Huazhong University of Science and Techonology;

10 followers · 16 following

SenseTime Research
Shenzhen

Stars

microsoft / autogen

A programming framework for agentic AI

Python 53,259 8,085 Updated Oct 8, 2025

FunAudioLLM / CosyVoice

Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.

Python 18,823 2,092 Updated Jan 7, 2026

modelscope / FunASR

A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.

Python 14,359 1,496 Updated Jan 7, 2026

Juzezhang / language_of_motion

This repository contains the official implementation of "The Language of Motion: Unifying Verbal and Non-verbal Language of 3D Human Motion".

Python 69 7 Updated Oct 20, 2025

ByteDance-Seed / Depth-Anything-3

Depth Anything 3

Python 3,880 338 Updated Dec 12, 2025

facebookresearch / sam-3d-body

The repository provides code for running inference with the SAM 3D Body Model (3DB), links for downloading the trained model checkpoints and datasets, and example notebooks that show how to use the…

Python 2,413 238 Updated Dec 19, 2025

SwanHubX / SwanLab

⚡️SwanLab - an open-source, modern-design AI training tracking and visualization tool. Supports Cloud / Self-hosted use. Integrated with PyTorch / Transformers / verl / LLaMA Factory / ms-swift / U…

Python 3,388 180 Updated Jan 6, 2026

jingyaogong / minimind

🚀🚀 「大模型」2小时完全从0训练26M的小参数GPT！🌏 Train a 26M-parameter GPT from scratch in just 2h!

Python 36,734 4,356 Updated Jan 7, 2026

Zilize / awesome-text-to-motion

Text-driven human motion generation surveys, datasets and models.

TypeScript 56 2 Updated Aug 17, 2025

OpenMotionLab / MotionGPT3

MotionGPT3: Human Motion as a Second Modality, a MoT-based framework for unified motion understanding and generation

Python 162 13 Updated Nov 28, 2025

zju3dv / GVHMR

Code for "GVHMR: World-Grounded Human Motion Recovery via Gravity-View Coordinates", Siggraph Asia 2024

Jupyter Notebook 1,286 140 Updated Jul 14, 2025

apple / ml-depth-pro

Depth Pro: Sharp Monocular Metric Depth in Less Than a Second.

Python 5,162 393 Updated Apr 21, 2025

OpenMotionLab / MotionGPT

[NeurIPS 2023] MotionGPT: Human Motion as a Foreign Language, a unified motion-language generation model using LLMs

Python 1,838 135 Updated Jul 1, 2025

shubham-goel / 4D-Humans

4DHumans: Reconstructing and Tracking Humans with Transformers

Python 1,509 146 Updated May 17, 2024

akanazawa / hmr

Project page for End-to-end Recovery of Human Shape and Pose

Python 1,645 396 Updated Jul 10, 2023

mkocabas / VIBE

Official implementation of CVPR2020 paper "VIBE: Video Inference for Human Body Pose and Shape Estimation"

Python 3,131 572 Updated Mar 24, 2023

google-deepmind / mujoco

Multi-Joint dynamics with Contact. A general purpose physics simulator.

C++ 11,609 1,273 Updated Jan 7, 2026

IFL-CAMP / easy_handeye

Automated, hardware-independent Hand-Eye Calibration

Python 1,094 238 Updated Nov 30, 2025

real-stanford / universal_manipulation_interface

Universal Manipulation Interface: In-The-Wild Robot Teaching Without In-The-Wild Robots

Python 1,190 221 Updated Jul 21, 2025

pipecat-ai / pipecat

Open Source framework for voice and multimodal conversational AI

Python 9,695 1,597 Updated Jan 7, 2026

Physical-Intelligence / openpi

Python 9,697 1,308 Updated Dec 27, 2025

ARISE-Initiative / robomimic

robomimic: A Modular Framework for Robot Learning from Demonstration

Python 1,237 329 Updated Nov 10, 2025

real-stanford / diffusion_policy

[RSS 2023] Diffusion Policy Visuomotor Policy Learning via Action Diffusion

Python 3,605 655 Updated Dec 24, 2024

coqui-ai / TTS

🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production

Python 44,148 5,894 Updated Aug 16, 2024

invoke-ai / InvokeAI

Invoke is a leading creative engine for Stable Diffusion models, empowering professionals, artists, and enthusiasts to generate and create visual media using the latest AI-driven technologies. The …

TypeScript 26,526 2,756 Updated Jan 6, 2026

huggingface / diffusers

🤗 Diffusers: State-of-the-art diffusion models for image, video, and audio generation in PyTorch.

Python 32,370 6,667 Updated Jan 7, 2026

modelscope / ms-swift

Use PEFT or Full-parameter to CPT/SFT/DPO/GRPO 600+ LLMs (Qwen3, Qwen3-MoE, DeepSeek-R1, GLM4.5, InternLM3, Llama4, ...) and 300+ MLLMs (Qwen3-VL, Qwen3-Omni, InternVL3.5, Ovis2.5, GLM4.5v, Llava, …

Python 12,021 1,106 Updated Jan 7, 2026

TapXWorld / ChinaTextbook

所有小初高、大学PDF教材。

Roff 63,832 14,198 Updated Oct 18, 2025

hiyouga / LlamaFactory

Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)

Python 65,159 7,916 Updated Jan 7, 2026

QwenLM / Qwen3-VL

Qwen3-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.

Jupyter Notebook 17,613 1,493 Updated Jan 4, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly