PointNet

1. 개요

https://adioshun.gitbooks.io/paper-3d-object-detection-and-tracking/content/2017-pointnet-deep-learning-on-point-sets-for-3d-classification-and-segmentation.html

2. 설치

활용 데이터넷

  • 학습 : Semantic3D

  • 추론 : Semantic3D + KITTI

During both training and inference, PointNet++ is fed with fix-sized cropped point clouds within boxes, we set the box size to be 60m x 20m x Inf, with the Z-axis allowing all values. During inference with KITTI, we set the region of interest to be 30m in front and behind the car, 10m to the left and right of the car center to fit the box size. This allows the PointNet++ model to only predict one sample per frame.

차별점 : In PointNet++’s set abstraction layer, the original points are subsampled, and features of the subsampled points must be propagated to all of the original points by interpolation

  • 기존 : This is achieved by 3-nearest neighbors search (called ThreeNN)---

  • 변경 : Open3D uses FLANN to build KDTrees for fast retrieval of nearest neighbors, which can be used to accelerate the ThreeNN op.

1. 수행 절차

1.1 Post processing: accelerating label interpolation

Inference on sparse pointcloud (KITTI)

Inference results after interpolation

The sparse labels need to be interpolated to generate labels for all input points. This interpolation can be achieved with nearest neighbor search using open3d.KDTreeFlann and majority voting, similar to what we did above in the ThreeNN op.

보간(interpolation)작업은 전체 소요 시간의 90%를 차지하고, 1FPS의 속도를 보인다. 해결을 위해 custom TensorFlow C++ op InterploateLabel를 적용하여 10+FPS속도 향상을 보였다.

2. Install

권장 환경 : ubuntu 16.04, cuda 9.0, python3, tensorflow 1.2 (1.8권장), open3d 0.6+

wget http://developer.download.nvidia.com/compute/cuda/repos/ubuntu1604/x86_64/cuda-repo-ubuntu1604_9.0.176-1_amd64.deb
sudo apt-key adv --fetch-keys http://developer.download.nvidia.com/compute/cuda/repos/ubuntu1604/x86_64/7fa2af80.pub
sudo dpkg -i cuda-repo-ubuntu1604_9.0.176-1_amd64.deb 
sudo apt-get update
sudo apt-get install cuda-9-0

2.1 Docker

2.2 Code

build TF ops. You’ll need CUDA and CMake 3.8+.

cd tf_ops
mkdir build
cd build
cmake -DCMAKE_C_COMPILER=/usr/bin/gcc ..
make
cd ./dataset
ln -s /media/adioshun/data/datasets/semantic3D/ ./semantic_raw

Last updated