Content area

Abstract

Since convolutional neural networks(CNNs) were applied to scene text detection, the accuracy of text detection has been improved a lot. However, limited by the receptive fields of regular CNNs and due to the large scale variations of texts in images, current text detection methods may fail to detect some texts well when dealing with more challenging text instances, such as arbitrarily shaped texts and extremely small texts. In this paper, we propose a new segmentation based scene text detector, which is equipped with deformable convolution and global channel attention. In order to detect texts of arbitrary shapes, our method replaces traditional convolutions with deformable convolutions, the sampling locations of deformable convolutions are deformed with augmented offsets so that it can better adapt to any shapes of texts, especially curved texts. To get more representative features for texts, an Adaptive Feature Selection module is introduced to better exploit text content through global channel attention. Meanwhile, a scale-aware loss, which adjusts the weights of text instances with different sizes, is formulated to solve the text scale variation problem. Experiments on several standard benchmarks, including ICDAR2015, SCUT-CTW1500, ICDAR2017-MLT and MSRA-TD500 verify the superiority of the proposed method.

Details

Title
Scene text detection by adaptive feature selection with text scale-aware loss
Author
Wu, Qin 1   VIAFID ORCID Logo  ; Luo Wenli 2 ; Chai Zhilei 1 ; Guo Guodong 3 

 Jiangnan University, Department of Computer Science, Wuxi, China (GRID:grid.258151.a) (ISNI:0000 0001 0708 1323); Jiangnan University, Jiangsu Provincial Engineerinig Laboratory of Pattern Recognition and Computational Intelligence, Wuxi, China (GRID:grid.258151.a) (ISNI:0000 0001 0708 1323) 
 Jiangnan University, Department of Computer Science, Wuxi, China (GRID:grid.258151.a) (ISNI:0000 0001 0708 1323) 
 West Virginia University, Department of Computer Science and Electrical Engineering, Morgantown, USA (GRID:grid.268154.c) (ISNI:0000 0001 2156 6140) 
Pages
514-529
Publication year
2022
Publication date
Jan 2022
Publisher
Springer Nature B.V.
ISSN
0924669X
e-ISSN
1573-7497
Source type
Scholarly Journal
Language of publication
English
ProQuest document ID
2619611832
Copyright
© The Author(s), under exclusive licence to Springer Science+Business Media, LLC, part of Springer Nature 2021.