分割
人工智能
模式识别(心理学)
计算机科学
不变(物理)
比例(比率)
相似性(几何)
方向(向量空间)
数字图像
计算机视觉
数学
图像(数学)
图像处理
几何学
地图学
地理
数学物理
作者
Huan Yang,Shiqian Wu,Chenwei Deng,Weisi Lin
出处
期刊:IEEE transactions on cybernetics
[Institute of Electrical and Electronics Engineers]
日期:2014-06-27
卷期号:45 (3): 519-533
被引量:33
标识
DOI:10.1109/tcyb.2014.2330657
摘要
Many recent applications require text segmentation for born-digital compound images. To this end, we propose a coarse-to-fine framework for segmenting texts of arbitrary scales and orientations in born-digital compound images. In the coarse stage, the local image activity measure is designed based upon the variation distribution of characters, to highlight the difference between textual and pictorial regions. This stage outputs a coarse textual layer including textual regions as well as a few pictorial regions with high activity. In the fine stage, a textual connected component (TCC) based refinement is proposed to eliminate the survived pictorial regions. In particular, a scale and orientation invariant grouping algorithm is proposed to adaptively generate TCCs with uniform statistical features. The minimum average distance and morphological operations are employed to assist the formation of candidate TCCs. Then, three string-level features (i.e., shapeness, color similarity, and mean activity level) are designed to distinguish the true TCCs from the false positive ones that are formed by connecting the high activity pictorial components. Extensive experiments show that the proposed framework can segment textual regions precisely from born-digital compound images, while preserving the integrity of texts with varied scales and orientations, and avoiding over-connection of textual regions.
科研通智能强力驱动
Strongly Powered by AbleSci AI