Ming Hu1,2,3 * · Zhengdi Yu *4 · Feilong Tang1,2,3 · Kaiwen Chen5 · Yulong Li3 · Imran Razzak3 · Junjun He2 · Tolga Birdal4 · Kaijing Zhou †5 · Zongyuan Ge †1
1Monash University · 2Shanghai AI Laboratory · 3MBZUAI · 4Imperial College London · 5Eye Hospital, Wenzhou Medical Univeristy
We introduce OphNet-3D, the first large-scale RGB-D dataset for dynamic 3D hand-instrument reconstruction in ophthalmic microsurgery, supported by an efficient multi-stage annotation pipeline, and propose novel architectures (H-Net and OH-Net) that significantly outperform existing methods in accurate hand and instrument reconstruction tasks.
- [2025/5/26] Paper is now available. ⭐
- Release dataset
- Release baseline experimental results and checkpoints
@misc{hu2025ophnet-3d,
title={Towards Dynamic 3D Reconstruction of Hand-Instrument Interaction in Ophthalmic Surgery},
author={Ming Hu and Zhendi Yu and Feilong Tang and Kaiwen Chen and Yulong Li and Imran Razzak and Junjun He and Tolga Birdal and Kaijing Zhou and Zongyuan Ge},
year={2025},
eprint={2505.17677},
archivePrefix={arXiv},
primaryClass={cs.CV},
url={https://arxiv.org/abs/2505.17677},
}For any questions, please contact ming.hu@monash.edu or z.yu23@imperial.ac.uk .

