Authors:
Luis Gustavo Lorgus Decker
1
;
Allan da Silva Pinto
1
;
Jose Luis Flores Campana
1
;
Manuel Cordova Neira
1
;
Andreza A. dos Santos
1
;
Jhonatas S. Conceição
1
;
Marcus A. Angeloni
2
;
Lin Tzy Li
2
and
Ricardo da S. Torres
3
Affiliations:
1
RECOD Lab., Institute of Computing, University of Campinas, 13083-852, Brazil
;
2
AI R&D Lab, Samsung R&D Institute Brazil, 13097-160, Brazil
;
3
Department of ICT and Natural Sciences, Norwegian University of Science and Technology (NTNU), Ålesund, Norway
Keyword(s):
Scene Text Detection, Mobile Devices, Object Detector Networks, MobileNetV2, Single Shot Detector.
Abstract:
Multiple research initiatives have been reported to yield highly effective results for the text detection problem. However, most of those solutions are very costly, which hamper their use in several applications that rely on the use of devices with restrictive processing power, like smartwatches and mobile phones. In this paper, we address this issue by investigating the use of efficient object detection networks for this problem. We propose the combination of two light architectures, MobileNetV2 and Single Shot Detector (SSD), for the text detection problem. Experimental results in the ICDAR’11 and ICDAR’13 datasets demonstrate that our solution yields the best trade-off between effectiveness and efficiency and also achieved the state-of-the-art results in the ICDAR’11 dataset with an f-measure of 96.09%.