计算机科学
最小边界框
旋转(数学)
文本检测
跳跃式监视
人工智能
联营
分类器(UML)
方向(向量空间)
计算机视觉
建筑
模式识别(心理学)
图像(数学)
几何学
数学
艺术
视觉艺术
作者
Jianqi Ma,Weiyuan Shao,Yue Hao,Li Wang,Hong Wang,Yingbin Zheng,Xiangyang Xue
标识
DOI:10.1109/tmm.2018.2818020
摘要
This paper introduces a novel rotation-based framework for arbitrary-oriented text detection in natural scene images. We present the Rotation Region Proposal Networks (RRPN), which are designed to generate inclined proposals with text orientation angle information. The angle information is then adapted for bounding box regression to make the proposals more accurately fit into the text region in terms of the orientation. The Rotation Region-of-Interest (RRoI) pooling layer is proposed to project arbitrary-oriented proposals to a feature map for a text region classifier. The whole framework is built upon a region-proposal-based architecture, which ensures the computational efficiency of the arbitrary-oriented text detection compared with previous text detection systems. We conduct experiments using the rotation-based framework on three real-world scene text detection datasets and demonstrate its superiority in terms of effectiveness and efficiency over previous approaches.
科研通智能强力驱动
Strongly Powered by AbleSci AI