Stars
PaperBanana: Automating Academic Illustration For AI Scientists
An ultimately comprehensive paper list of Vision Transformer/Attention, including papers, codes, and related websites
Discrete Diffusion VLA: Bringing Discrete Diffusion to Action Decoding in Vision-Language-Action Policies
[NeurIPS 2023 Track Datasets and Benchmarks] OpenLane-V2: The First Perception and Reasoning Benchmark for Road Driving
[ICCV2023 Oral] LATR: 3D Lane Detection from Monocular Images with Transformer
[ICCV 2023 & ICLR 2026] VAD: Vectorized Scene Representation for Efficient Autonomous Driving
A brief introduction to the quaternions and its applications in 3D geometry.
Cross-view transformers for multi-view analysis of unregistered medical images.
Calibrate the extrinsic parameters between Livox LiDAR and camera
A Robust LiDAR-Inertial Odometry for Livox LiDAR
Point Cloud Registration using Representative Overlapping Points. https://arxiv.org/abs/2107.02583.
Efficient 3D Point Cloud Feature Learning for Large-Scale Place Recognition (EPC-Net)
FrameNet: Learning Local Canonical Frames of 3D Surfaces from a Single RGB Image
Light-weight Deformable Registration using Adversarial Learning with Distilling Knowledge (IEEE Transactions on Medical Imaging 2021))
Master thesis project realized in partnership with Aalto University and Finnish Geospatial Research Institute of Finland. Stereo Camera-LiDAR calibration. Covers: mono and stereo camera calibration…
This is a package for LiDARTag, described in paper: LiDARTag: A Real-Time Fiducial Tag System for Point Clouds
CNN's for bone segmentation of CT-scans.
Weakly Supervised 3D Object Detection from Point Clouds (VS3D), ACM MM 2020
PRIN/SPRIN: On Extracting Point-wise Rotation Invariant Features
Visual localization made easy with hloc
CorrNet3D: Unsupervised End-to-end Learning of Dense Correspondence for 3D Point Clouds
Code release for "learning to find good correspondences" CVPR 2018
ETH-Microsoft dataset for the ICCV 2021 visual localization challenge
