Stars
Official code of Motus: A Unified Latent Action World Model
Official repository of LIBERO-plus, a generalized benchmark for in-depth robustness analysis of vision-language-action models.
This website is for the collection of VLA SOTA results.
StarVLA: A Lego-like Codebase for Vision-Language-Action Model Developing
PyTorch implementation of JiT https://arxiv.org/abs/2511.13720
Dexbotic: Open-Source Vision-Language-Action Toolbox
A comprehensive list of excellent research papers, models, datasets, and other resources on Vision-Language-Action (VLA) models in robotics.
A curated list of state-of-the-art research in embodied AI, focusing on vision-language-action (VLA) models, vision-language navigation (VLN), and related multimodal learning approaches.
[ICLR 2026] The offical Implementation of "Soft-Prompted Transformer as Scalable Cross-Embodiment Vision-Language-Action Model"
Continuously updated collection of the latest autonomous driving research papers.
A curated list of awesome HD map construction methods
[ICLR2026] Official implementation for "JanusVLN: Decoupling Semantics and Spatiality with Dual Implicit Memory for Vision-Language Navigation"
missTL / SeqGrowGraph
Forked from MIV-XJTU/SeqGrowGraphSeqGrowGraph: Learning Lane Topology as a Chain of Graph Expansions
[ICCV 2025] Official implementation for "SeqGrowGraph: Learning Lane Topology as a Chain of Graph Expansions"
📚这个仓库是在arxiv上收集的有关VLN,VLA,World Model,SLAM,Gaussian Splatting,非线性优化等相关论文。每天都会自动更新!issue区域是最新10篇论文
Zhaoyibinn / vggt
Forked from facebookresearch/vggt[CVPR 2025 Best Paper Award Candidate] VGGT: Visual Geometry Grounded Transformer
An app for collecting raw RGB-D scans on iOS devices.
Application for camera and sensor data logging (iOS)
missTL / FSDrive
Forked from MIV-XJTU/FSDriveThe repository has been moved to https://github.com/MIV-XJTU/FSDrive
[NeurIPS 2025 spotlight] Official implementation for "FutureSightDrive: Thinking Visually with Spatio-Temporal CoT for Autonomous Driving"
HairGuard StereoPilot Elastic3D StereoWorld BetterDepth BRIDGE BriGeS ChronoDepth Depth Any Video Depth Anything Depth Pro DepthCrafter Distill Any Depth FE2E GRIN M2SVid MegaSaM Metric3D Metric-So…
Python tools for rendering, viewing and generating metric 3D depth videos. Tools for recovering and exporting camera pose and 3D geometry to popular formats as well as tools for projecting depthvid…
[AAAI 2026] Official implementation for "PriorDrive: Enhancing Online HD Mapping with Unified Vector Priors"
