MnasFPN: Learning Latency-aware Pyramid Architecture for Object Detection on Mobile Devices

Chen, Bo; Ghiasi, Golnaz; Liu, Hanxiao; Lin, Tsung-Yi; Kalenichenko, Dmitry; Adams, Hartwig; Le, Quoc V.

Computer Science > Computer Vision and Pattern Recognition

arXiv:1912.01106 (cs)

[Submitted on 2 Dec 2019 (v1), last revised 30 Jul 2020 (this version, v2)]

Title:MnasFPN: Learning Latency-aware Pyramid Architecture for Object Detection on Mobile Devices

Authors:Bo Chen, Golnaz Ghiasi, Hanxiao Liu, Tsung-Yi Lin, Dmitry Kalenichenko, Hartwig Adams, Quoc V. Le

View PDF

Abstract:Despite the blooming success of architecture search for vision tasks in resource-constrained environments, the design of on-device object detection architectures have mostly been manual. The few automated search efforts are either centered around non-mobile-friendly search spaces or not guided by on-device latency. We propose MnasFPN, a mobile-friendly search space for the detection head, and combine it with latency-aware architecture search to produce efficient object detection models. The learned MnasFPN head, when paired with MobileNetV2 body, outperforms MobileNetV3+SSDLite by 1.8 mAP at similar latency on Pixel. It is also both 1.0 mAP more accurate and 10% faster than NAS-FPNLite. Ablation studies show that the majority of the performance gain comes from innovations in the search space. Further explorations reveal an interesting coupling between the search space design and the search algorithm, and that the complexity of MnasFPN search space may be at a local optimum.

Comments:	10 pages, 7 figures
Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:1912.01106 [cs.CV]
	(or arXiv:1912.01106v2 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.1912.01106

Submission history

From: Bo Chen [view email]
[v1] Mon, 2 Dec 2019 22:42:43 UTC (593 KB)
[v2] Thu, 30 Jul 2020 18:22:02 UTC (1,213 KB)

Full-text links:

Access Paper:

view license

Current browse context:

cs.CV

< prev | next >

new | recent | 2019-12

Change to browse by:

References & Citations

DBLP - CS Bibliography

listing | bibtex

Bo Chen
Golnaz Ghiasi
Hanxiao Liu
Tsung-Yi Lin
Dmitry Kalenichenko

…

export BibTeX citation

Computer Science > Computer Vision and Pattern Recognition

Title:MnasFPN: Learning Latency-aware Pyramid Architecture for Object Detection on Mobile Devices

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:MnasFPN: Learning Latency-aware Pyramid Architecture for Object Detection on Mobile Devices

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators