An Analysis of Scale Invariance in Object Detection - SNIP

Singh, Bharat; Davis, Larry S.

Computer Science > Computer Vision and Pattern Recognition

arXiv:1711.08189 (cs)

[Submitted on 22 Nov 2017 (v1), last revised 25 May 2018 (this version, v2)]

Title:An Analysis of Scale Invariance in Object Detection - SNIP

Authors:Bharat Singh, Larry S. Davis

View PDF

Abstract:An analysis of different techniques for recognizing and detecting objects under extreme scale variation is presented. Scale specific and scale invariant design of detectors are compared by training them with different configurations of input data. By evaluating the performance of different network architectures for classifying small objects on ImageNet, we show that CNNs are not robust to changes in scale. Based on this analysis, we propose to train and test detectors on the same scales of an image-pyramid. Since small and large objects are difficult to recognize at smaller and larger scales respectively, we present a novel training scheme called Scale Normalization for Image Pyramids (SNIP) which selectively back-propagates the gradients of object instances of different sizes as a function of the image scale. On the COCO dataset, our single model performance is 45.7% and an ensemble of 3 networks obtains an mAP of 48.3%. We use off-the-shelf ImageNet-1000 pre-trained models and only train with bounding box supervision. Our submission won the Best Student Entry in the COCO 2017 challenge. Code will be made available at \url{this http URL}.

Comments:	CVPR 2018, camera ready version
Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:1711.08189 [cs.CV]
	(or arXiv:1711.08189v2 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.1711.08189

Submission history

From: Bharat Singh [view email]
[v1] Wed, 22 Nov 2017 09:30:06 UTC (2,203 KB)
[v2] Fri, 25 May 2018 12:47:23 UTC (1,240 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:An Analysis of Scale Invariance in Object Detection - SNIP

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:An Analysis of Scale Invariance in Object Detection - SNIP

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators