Lists (5)
Sort Name ascending (A-Z)
Stars
🔥🔥🔥 [IEEE TCSVT] Latest Papers, Codes and Datasets on Vid-LLMs.
Paper list for Efficient Reasoning.
RLinf: Reinforcement Learning Infrastructure for Embodied and Agentic AI
[ICLR 2025] NextBestPath: Efficient 3D Mapping of Unseen Environments
Being-H0.5: Scaling Human-Centric Robot Learning for Cross-Embodiment Generalization
Enjoy the magic of Diffusion models!
Collection of Composed Image Retrieval (CIR) papers.
HORT: Monocular Hand-held Objects Reconstruction with Transformers, ICCV 2025
A curated list of awesome model based RL resources (continually updated)
Qwen3-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.
[ICLR 2025, Oral] EmbodiedSAM: Online Segment Any 3D Thing in Real Time
Tactile Sensing • Simulation • Representation • Manipulation • IL/RL/VLA • Open Source
An open-source library for GPU-accelerated robot learning and sim-to-real transfer.
A generative world for general-purpose robotics & embodied AI learning.
New repo collection for NVIDIA Cosmos: https://github.com/nvidia-cosmos
[NeurIPS 2023] Scalable 3D Captioning with Pretrained Models
🤗 LeRobot: Making AI for Robotics more accessible with end-to-end learning
RoboCasa: Large-Scale Simulation of Everyday Tasks for Generalist Robots
[CoRL 2024 Outstanding Paper Award Finalist] Equivariant Diffusion Policy
Official implementation of "Towards Generalizable Vision-Language Robotic Manipulation: A Benchmark and LLM-guided 3D Policy."
A modern, high customizable, responsive Jekyll theme for documentation with built-in search.
python tools to work with habitat-sim environment.
The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained model checkpoints, and example notebooks that show how to use th…

