Skip to content
View chenyiminch's full-sized avatar
  • SenseTime Research
  • Shenzhen

Block or report chenyiminch

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

A programming framework for agentic AI

Python 53,259 8,085 Updated Oct 8, 2025

Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.

Python 18,823 2,092 Updated Jan 7, 2026

A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.

Python 14,359 1,496 Updated Jan 7, 2026

This repository contains the official implementation of "The Language of Motion: Unifying Verbal and Non-verbal Language of 3D Human Motion".

Python 69 7 Updated Oct 20, 2025

Depth Anything 3

Python 3,880 338 Updated Dec 12, 2025

The repository provides code for running inference with the SAM 3D Body Model (3DB), links for downloading the trained model checkpoints and datasets, and example notebooks that show how to use the…

Python 2,413 238 Updated Dec 19, 2025

⚡️SwanLab - an open-source, modern-design AI training tracking and visualization tool. Supports Cloud / Self-hosted use. Integrated with PyTorch / Transformers / verl / LLaMA Factory / ms-swift / U…

Python 3,388 180 Updated Jan 6, 2026

🚀🚀 「大模型」2小时完全从0训练26M的小参数GPT!🌏 Train a 26M-parameter GPT from scratch in just 2h!

Python 36,734 4,356 Updated Jan 7, 2026

Text-driven human motion generation surveys, datasets and models.

TypeScript 56 2 Updated Aug 17, 2025

MotionGPT3: Human Motion as a Second Modality, a MoT-based framework for unified motion understanding and generation

Python 162 13 Updated Nov 28, 2025

Code for "GVHMR: World-Grounded Human Motion Recovery via Gravity-View Coordinates", Siggraph Asia 2024

Jupyter Notebook 1,286 140 Updated Jul 14, 2025

Depth Pro: Sharp Monocular Metric Depth in Less Than a Second.

Python 5,162 393 Updated Apr 21, 2025

[NeurIPS 2023] MotionGPT: Human Motion as a Foreign Language, a unified motion-language generation model using LLMs

Python 1,838 135 Updated Jul 1, 2025

4DHumans: Reconstructing and Tracking Humans with Transformers

Python 1,509 146 Updated May 17, 2024

Project page for End-to-end Recovery of Human Shape and Pose

Python 1,645 396 Updated Jul 10, 2023

Official implementation of CVPR2020 paper "VIBE: Video Inference for Human Body Pose and Shape Estimation"

Python 3,131 572 Updated Mar 24, 2023

Multi-Joint dynamics with Contact. A general purpose physics simulator.

C++ 11,609 1,273 Updated Jan 7, 2026

Automated, hardware-independent Hand-Eye Calibration

Python 1,094 238 Updated Nov 30, 2025

Universal Manipulation Interface: In-The-Wild Robot Teaching Without In-The-Wild Robots

Python 1,190 221 Updated Jul 21, 2025

Open Source framework for voice and multimodal conversational AI

Python 9,695 1,597 Updated Jan 7, 2026

robomimic: A Modular Framework for Robot Learning from Demonstration

Python 1,237 329 Updated Nov 10, 2025

[RSS 2023] Diffusion Policy Visuomotor Policy Learning via Action Diffusion

Python 3,605 655 Updated Dec 24, 2024

🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production

Python 44,148 5,894 Updated Aug 16, 2024

Invoke is a leading creative engine for Stable Diffusion models, empowering professionals, artists, and enthusiasts to generate and create visual media using the latest AI-driven technologies. The …

TypeScript 26,526 2,756 Updated Jan 6, 2026

🤗 Diffusers: State-of-the-art diffusion models for image, video, and audio generation in PyTorch.

Python 32,370 6,667 Updated Jan 7, 2026

Use PEFT or Full-parameter to CPT/SFT/DPO/GRPO 600+ LLMs (Qwen3, Qwen3-MoE, DeepSeek-R1, GLM4.5, InternLM3, Llama4, ...) and 300+ MLLMs (Qwen3-VL, Qwen3-Omni, InternVL3.5, Ovis2.5, GLM4.5v, Llava, …

Python 12,021 1,106 Updated Jan 7, 2026

所有小初高、大学PDF教材。

Roff 63,832 14,198 Updated Oct 18, 2025

Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)

Python 65,159 7,916 Updated Jan 7, 2026

Qwen3-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.

Jupyter Notebook 17,613 1,493 Updated Jan 4, 2026
Next