Introduction

This repository implements the unsupervised KD-Tree DBSCAN clustering LiDAR point cloud instance segmentation algorithm from scratch. Except for DBSCAN, we could use the following algorithms as well. DBSCAN is a clustering-based algorithm. According to Comparing Different Sklearn Clustering Algorithms On Toy Datasets, DBSCAN is one of the best clustering algorithms. Other reasons we selected DBSCAN are:

Cluster discovery: DBSCAN can find clusters of any shape and size, including non-linear shapes.
Noise identification: DBSCAN can identify noise data while clustering.
Cluster density separation: DBSCAN is good at separating high-density clusters from low-density clusters.
Cluster number specification: DBSCAN doesn't require the number of clusters to be specified in advance.

Algorithm	Advantages	Disadvantages
Deep Learning-Based Segmentation	Can learn complex patterns and features in point clouds	Requires a large amount of labeled data for training
	Highly accurate and can handle noisy data	Can be computationally expensive and require powerful hardware
Clustering-Based Segmentation	Simple and easy to implement	May struggle with complex geometries and irregular shapes
	Efficient for finding clusters in dense point clouds	Sensitivity to hyperparameters and starting conditions
Region Growing-Based Segmentation	Can handle noise and outliers effectively	May be sensitive to initial seed points and parameters
	Robust to varying densities and shapes	Can be computationally expensive for large point clouds
Supervoxel-Based Segmentation	Can effectively group points into compact and homogeneous regions	May struggle with separating instances with overlapping supervoxels
	Can provide more meaningful representation of point clouds	Sensitivity to supervoxel size and parameters
Graph-Based Segmentation	Can capture relationships between points and model complex structures	Computational complexity and memory requirements for constructing graphs
	Flexible and adaptable to different types of point cloud data	Difficulty in defining edge weights and graph construction

How to Use

Install with Python 3.7+:

python -m venv .venv
source .venv/bin/activate
pip install -r requirements.txt

Run:

python implementation.py

Output:

Evaluation

Since we don't have ground truth, we use sklearn HDBSCAN output as the pseudo ground truth to crosscheck and calculate our implementation's following Clustering Performance Evaluation Metrics:

V-measure: 0.694
Adjusted Rand Index: 0.094
Adjusted Mutual Information: 0.477
Silhouette Coefficient: -0.060

We could use mAP and mIoU to evaluate as well.

Limitations and Potential Improvements

Currently, we haven't detected the cluster class (such as person, car, etc.). Compared to sklearn's DBSCAN, the current implemented DBSCAN is slower. Potential ways to improve are: 1) to use depth only for DBSCAN to speed up; 2) use Ball Tree.

Potential Future Work

Evaluate the algorithm with mIoU, Panoptic Quality (PQ), and Segmentation Quality (SQ) with the better pseudo Instance Segmentation ground truth generated by the pre-trained Cylinder3D model.
Detect the 3d object classes (such as a person, car, etc)
Ignore ground

Extra: Generate Pseudo Semantic Segmentation Ground Truth by Pre-trained Cylinder3D Model

The SOTA 3D instance segmentation algorithms are based on deep learning. We could use the pre-trained deep learning model to generate better pseudo ground truth by a SOTA pre-trained model. For 3D instance and panoptic segmentation, there are mainly two types of application scenarios:

Indoor Scene Understanding: ScanNet is one of the popular benchmark datasets and PointNet++ is one of the SOTA models.
Outdoor Driving: SemanticKITTI is one of the popular benchmark datasets and Cylinder3D is one of the SOTA outdoor point cloud panoptic segmentation models.

We use Cylinder3D in MMDetection3D to generate the pseudo ground truth:

Convert the original [[x1,y1,z1], [x2,y2,z2], ...] pointcloud_data.npy to SematicKITTI's [[x1,y1,z1,intensity1], [x2,y2,z2,intensity2], ...] binary format pointcloud_data.bin:

python ./tools/export_xyz_to_bin.py

Install MMDetection3D
Download the pre-trained Cylinder3D model and config

# switch to the mmdetection3d repository folder after the installation
cd ../mmdetection3d

wget https://download.openmmlab.com/mmdetection3d/v1.1.0_models/cylinder3d/cylinder3d_8xb2-amp-laser-polar-mix-3x_semantickitti_20230425_144950-372cdf69.pth

Infer 3D Semantic Segmentation (Currently, MMDetection3D Cylinder3D only supports Semantic Segmentation)

python demo/pcd_seg_demo.py pointcloud_data_4d_downsample1.bin ../point_cloud_instance_segmentation/data/cylinder3d_8xb2-laser-polar-mix-3x_semantickitti.py cylinder3d_8xb2-amp-laser-polar-mix-3x_semantickitti_20230425_144950-372cdf69.pth --out-dir outputs

# you can find the output at outputs/preds/pointcloud_data_4d_downsample1.json

Cylinder3D Semantic Segmentation Output:

Appendix

Comparison of Semantic Segmentation, Instance Segmentation, and Panoptic Segmentation:

Task	Definition	Example
Semantic Segmentation	Divides an image into segments based on categories or classes (without differentiate each instance)	Separating pixels that belong to different objects/classes in an image
Instance Segmentation	Identifies individual objects within an image and assigns unique labels to each object (ignoring other objects and background).	Distinguishing between different instances of the same object in an image
Panoptic Segmentation	Combines semantic segmentation and instance segmentation to provide a holistic understanding of an image (segmenting all pixels and putting into different categories including background).	Provides both semantic segmentation for scene understanding and instance segmentation for object-level understanding in an image

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
algorithms		algorithms
data		data
doc		doc
tools		tools
.gitignore		.gitignore
README.md		README.md
implementation.py		implementation.py
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Introduction

How to Use

Evaluation

Limitations and Potential Improvements

Potential Future Work

Extra: Generate Pseudo Semantic Segmentation Ground Truth by Pre-trained Cylinder3D Model

Appendix

References

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Introduction

How to Use

Evaluation

Limitations and Potential Improvements

Potential Future Work

Extra: Generate Pseudo Semantic Segmentation Ground Truth by Pre-trained Cylinder3D Model

Appendix

References

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages