计算机科学
人工智能
深度学习
模式识别(心理学)
文本检测
文本识别
特征提取
计算机视觉
语音识别
图像(数学)
作者
Sridhar Gujjeti,M. S. Sriram,V. Ganesan
标识
DOI:10.1109/iitcee59897.2024.10467856
摘要
The method of scene text detection and recognition is useful in a variety of applications ranging from multimedia retrieval, traffic monitoring, and navigation for visually impaired individuals to semantic natural scene understanding. Many researchers have worked hard to create algorithms for detecting and recognizing text in photos and movies. The three major areas of scene text recognition are detection, recognition, and end-to-end text recognition systems. Scene text detection and recognition systems have significant challenges due to scene complexity, text diversity, and real-time constraints. The traditional optical character recognition method focuses on recognizing words from image sources but fails to identify multi-oriented and multi-lingual languages. Proposed deep learning approach combined with neural network based scene text detection and recognition method achieves competitive performance. It is critical in complicated scene and large scale dataset detection and identification, multilingual and multi-oriented detection and recognition, real time detection and recognition, and visual understanding. Integrated approach helps to localize, detect and recognize the text in different orientation and multi-language. Also it will reduce the recognition error by employing the advantage of dictionary search method. Proposed approach will provide reasonable performance for curve text, real time video and image files. On several standard text detection benchmarks, it will obtain competitive results in terms of speed and accuracy.
科研通智能强力驱动
Strongly Powered by AbleSci AI