A curated list of state-of-the-art research in embodied AI, focusing on vision-language-action (VLA) models, vision-language navigation (VLN), and related multimodal learning approaches.

2,227 96 Updated Dec 17, 2025

NVlabs / LongLive

LongLive: Real-time Interactive Long Video Generation

Python 925 63 Updated Dec 4, 2025

SamsungSAILMontreal / TinyRecursiveModels

Python 6,088 932 Updated Dec 2, 2025

simplescaling / s1

s1: Simple test-time scaling

Python 6,620 764 Updated Jun 25, 2025

RLinf / RLinf

RLinf is a flexible and scalable open-source infrastructure designed for post-training foundation models (LLMs, VLMs, VLAs) via reinforcement learning.

Python 1,820 173 Updated Dec 24, 2025

MasterXiong / Hyper-VLA

Code of paper "HyperVLA: Efficient Inference in Vision-Language-Action Models via Hypernetworks"

Python 15 1 Updated Oct 8, 2025

zyzkevin / dyva-worldlm

Python 20 3 Updated Nov 18, 2025

QwenLM / Qwen3-VL

Qwen3-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.

Jupyter Notebook 17,379 1,455 Updated Nov 28, 2025

volcengine / verl

verl: Volcano Engine Reinforcement Learning for LLMs

Python 17,765 2,889 Updated Dec 24, 2025

stdstu12 / YUME

Python 347 20 Updated Aug 13, 2025

nvidia-cosmos / cosmos-predict2.5

Cosmos-Predict2.5, the latest version of the Cosmos World Foundation Models (WFMs) family, specialized for simulating and predicting the future state of the world in the form of video.

Python 538 45 Updated Dec 20, 2025

TRI-ML / prismatic-vlms

A flexible and efficient codebase for training visually-conditioned language models (VLMs)

Python 890 840 Updated Jul 4, 2024

OpenHelix-Team / VLA-Adapter

VLA-Adapter: An Effective Paradigm for Tiny-Scale Vision-Language-Action Model

Python 1,836 162 Updated Nov 18, 2025

AIM-Intelligence / SUDO

Forked from jiankimr/SUDO

🤖 "sudo rm -rf agentic_security" – Investigating computer-use agent security

Python 9 1 Updated Jul 29, 2025

snumprlab / hima

Official Implementation of HIMA (COLM'25)

Python 17 1 Updated Nov 25, 2025

raff17 / Stretch_Expressive_Eyes

We created a custom LED eye module that has expressions and is able to follow (gaze objects) using the Pan-tilt head camera.

C++ 2 Updated Sep 19, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Seongwon Cho seongwon980

Achievements

Achievements

Highlights

Block or report seongwon980

Stars

isaac-sim / IsaacAutomator

jimmyyhwu / tidybot2

jimmyyhwu / tidybot

PRIME-RL / P1

mihdalal / manipgen

Genesis-Embodied-AI / RoboGen

hello-robot / stretch_isaacsim

yizhengzhang1 / agent_world

behavior-robot-suite / brs-algo

StanfordVL / BEHAVIOR-1K

MoMaKitchen / MoMaKitchen

trantor2nd / MindExplore

open-webui / open-webui

princeton-vl / infinigen

jonyzhang2023 / awesome-embodied-vla-va-vln