Research
My main research question is: "how to build generalist robots that can be applied to any tasks?".
I believe offline reinforcement learning with large-scale data is a promising approach, but many real-world robotic datasets lack reward annotations.
To address this, I am interested in reward-free learning from offline data, especially through offline and offline-to-online RL, unsupervised RL, self-supervised RL, and world models.
|
|
|
Learning Social Navigation from Positive and Negative Demonstrations and Rule-Based Specifications
Chanwoo Kim,
Jihwan Yoon,
Hyeonseong Kim,
Taemoon Jeong,
Changwoo Yoo,
Seungbeen Lee,
Soohwan Byeon,
Hoon Chung,
Matthew Pan,
Jean Oh,
Kyungjae Lee,
Sungjoon Choi
ICRA, 2026  
project page
/
arXiv
Learning a density-based reward from positive and negative demonstrations augmented with rule-based safety constraints to balance adaptability and reliability.
|
|