-
SNU AI
- Seoul, South Korea
Highlights
- Pro
Stars
Isaac Sim/Lab in AWS, Azure, Google Cloud, Alibaba Cloud
TidyBot++: An Open-Source Holonomic Mobile Manipulator for Robot Learning
TidyBot: Personalized Robot Assistance with Large Language Models
[ICRA 2025] PyTorch Code for Local Policies Enable Zero-shot Long-Horizon Manipulation
A generative and self-guided robotic agent that endlessly propose and master new skills.
Official Algorithm Codebase for the Paper "BEHAVIOR Robot Suite: Streamlining Real-World Whole-Body Manipulation for Everyday Household Activities"
BEHAVIOR-1K: a platform for accelerating Embodied AI research. Join our Discord for support: https://discord.gg/bccR5vGFEx
[ICCV 2025] MoMa-Kitchen: A 100K+ Benchmark for Affordance-Grounded Last-Mile Navigation in Mobile Manipulation
Towards Long-Horizon Vision-Language-Action System: Reasoning, Acting and Memory
User-friendly AI Interface (Supports Ollama, OpenAI API, ...)
Infinite Photorealistic Worlds using Procedural Generation
A curated list of state-of-the-art research in embodied AI, focusing on vision-language-action (VLA) models, vision-language navigation (VLN), and related multimodal learning approaches.
LongLive: Real-time Interactive Long Video Generation
RLinf is a flexible and scalable open-source infrastructure designed for post-training foundation models (LLMs, VLMs, VLAs) via reinforcement learning.
Code of paper "HyperVLA: Efficient Inference in Vision-Language-Action Models via Hypernetworks"
Qwen3-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.
verl: Volcano Engine Reinforcement Learning for LLMs
Cosmos-Predict2.5, the latest version of the Cosmos World Foundation Models (WFMs) family, specialized for simulating and predicting the future state of the world in the form of video.
A flexible and efficient codebase for training visually-conditioned language models (VLMs)
VLA-Adapter: An Effective Paradigm for Tiny-Scale Vision-Language-Action Model
AIM-Intelligence / SUDO
Forked from jiankimr/SUDO🤖 "sudo rm -rf agentic_security" – Investigating computer-use agent security
We created a custom LED eye module that has expressions and is able to follow (gaze objects) using the Pan-tilt head camera.
