GAPF: Curve Text Detection based on Generative Adversarial Networks and Pixel Fluctuations

Jun Yang, Zhaogong Zhang, Xuexia Wang

Abstract

Scene text detection has witnessed rapid progress especially with the recent development of convolutional neural networks. However, curved text detection is still a difficult problem that has not been addressed sufficiently. Presently, the most advanced method is based on segmentation to detect curved text. However, most segmentation algorithms based on convolutional neural networks have the problem of inaccurate segmentation results. In order to improve the effect of image segmentation, we propose a semantic segmentation network model based on generative adversarial networks and pixel fluctuations, denoted as GAPF; which is able to effectively improve the accuracy of text segmentation. The model consists of two parts: the generative model and the discriminative model. The main function of the generative model is to generate semantic segmentation graph, and then the discriminative model and generative model perform adversarial learning, which optimize the generative model to make the generated image closer to the ground truth. In this paper, the information about pixel fluctuations numbers is input into the generative network as the segmentation condition to enhance the invariance of translation and rotation. Finally, a text boundary generation algorithm for text is designed, and the final detection result is obtained from the segmentation result. Experimental results on CTW1500, Total-Text, ICDAR 2015 and MSRA-TD500 demonstrate the effectiveness of our work.

Download


Paper Citation


in Harvard Style

Yang J., Zhang Z. and Wang X. (2021). GAPF: Curve Text Detection based on Generative Adversarial Networks and Pixel Fluctuations.In Proceedings of the 16th International Joint Conference on Computer Vision, Imaging and Computer Graphics Theory and Applications - Volume 4: VISAPP, ISBN 978-989-758-488-6, pages 545-552. DOI: 10.5220/0010298905450552


in Bibtex Style

@conference{visapp21,
author={Jun Yang and Zhaogong Zhang and Xuexia Wang},
title={GAPF: Curve Text Detection based on Generative Adversarial Networks and Pixel Fluctuations},
booktitle={Proceedings of the 16th International Joint Conference on Computer Vision, Imaging and Computer Graphics Theory and Applications - Volume 4: VISAPP,},
year={2021},
pages={545-552},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0010298905450552},
isbn={978-989-758-488-6},
}


in EndNote Style

TY - CONF

JO - Proceedings of the 16th International Joint Conference on Computer Vision, Imaging and Computer Graphics Theory and Applications - Volume 4: VISAPP,
TI - GAPF: Curve Text Detection based on Generative Adversarial Networks and Pixel Fluctuations
SN - 978-989-758-488-6
AU - Yang J.
AU - Zhang Z.
AU - Wang X.
PY - 2021
SP - 545
EP - 552
DO - 10.5220/0010298905450552