To the Point: Efficient 3D Object Detection in the Range Image with Graph Convolution Kernels

Chai, Yuning; Sun, Pei; Ngiam, Jiquan; Wang, Weiyue; Caine, Benjamin; Vasudevan, Vijay; Zhang, Xiao; Anguelov, Dragomir

Computer Science > Computer Vision and Pattern Recognition

arXiv:2106.13381 (cs)

[Submitted on 25 Jun 2021]

Title:To the Point: Efficient 3D Object Detection in the Range Image with Graph Convolution Kernels

Authors:Yuning Chai, Pei Sun, Jiquan Ngiam, Weiyue Wang, Benjamin Caine, Vijay Vasudevan, Xiao Zhang, Dragomir Anguelov

View PDF

Abstract:3D object detection is vital for many robotics applications. For tasks where a 2D perspective range image exists, we propose to learn a 3D representation directly from this range image view. To this end, we designed a 2D convolutional network architecture that carries the 3D spherical coordinates of each pixel throughout the network. Its layers can consume any arbitrary convolution kernel in place of the default inner product kernel and exploit the underlying local geometry around each pixel. We outline four such kernels: a dense kernel according to the bag-of-words paradigm, and three graph kernels inspired by recent graph neural network advances: the Transformer, the PointNet, and the Edge Convolution. We also explore cross-modality fusion with the camera image, facilitated by operating in the perspective range image view. Our method performs competitively on the Waymo Open Dataset and improves the state-of-the-art AP for pedestrian detection from 69.7% to 75.5%. It is also efficient in that our smallest model, which still outperforms the popular PointPillars in quality, requires 180 times fewer FLOPS and model parameters

Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:2106.13381 [cs.CV]
	(or arXiv:2106.13381v1 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2106.13381
Journal reference:	CVPR 2021

Submission history

From: Pei Sun [view email]
[v1] Fri, 25 Jun 2021 01:27:26 UTC (3,680 KB)

Full-text links:

Access Paper:

view license

Current browse context:

cs.CV

< prev | next >

new | recent | 2021-06

Change to browse by:

References & Citations

DBLP - CS Bibliography

listing | bibtex

Yuning Chai
Pei Sun
Jiquan Ngiam
Weiyue Wang
Vijay Vasudevan

…

export BibTeX citation

Computer Science > Computer Vision and Pattern Recognition

Title:To the Point: Efficient 3D Object Detection in the Range Image with Graph Convolution Kernels

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:To the Point: Efficient 3D Object Detection in the Range Image with Graph Convolution Kernels

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators