Abstract

Many modern approaches for object detection are two-staged pipelines. The first stage identifies regions of interest which are then classified in the second stage. Faster R-CNN is such an approach for object detection which combines both stages into a single pipeline. In this paper we apply Faster R-CNN to the task of company logo detection. Motivated by its weak performance on small object instances, we examine in detail both the proposal and the classification stage with respect to a wide range of object sizes. We investigate the influence of feature map resolution on the performance of those stages. Based on theoretical considerations, we introduce an improved scheme for generating anchor proposals and propose a modification to Faster R-CNN which leverages higher-resolution feature maps for small objects. We evaluate our approach on the FlickrLogos dataset improving the RPN performance from 0.52 to 0.71 (MABO) and the detection performance from 0.52 to 0.67 (mAP).

Comment: 8 Pages, ICMR 2017


Original document

The different versions of the original document can be found in:

http://dx.doi.org/10.1145/3078971.3078990 under the license http://www.acm.org/publications/policies/copyright_policy#Background
https://arxiv.org/pdf/1704.08881.pdf,
https://arxiv.org/abs/1704.08881,
https://dl.acm.org/citation.cfm?id=3078990,
https://export.arxiv.org/pdf/1704.08881,
https://ui.adsabs.harvard.edu/abs/2017arXiv170408881E/abstract,
https://doi.acm.org/10.1145/3078971.3078990,
https://export.arxiv.org/abs/1704.08881,
https://core.ac.uk/display/83853850,
https://academic.microsoft.com/#/detail/2610420510
https://doi.org/10.1145/3078971.3078990
Back to Top

Document information

Published on 01/01/2017

Volume 2017, 2017
DOI: 10.1145/3078971.3078990
Licence: Other

Document Score

0

Views 4
Recommendations 0

Share this document

claim authorship

Are you one of the authors of this document?