计算机科学
水准点(测量)
人工智能
循环神经网络
光学字符识别
卷积神经网络
模式识别(心理学)
深度学习
像素
图像(数学)
语音识别
人工神经网络
大地测量学
地理
作者
Aditya Yadav,Shauryan Singh,M.I. Siddique,Nileshkumar Mehta,Archana Kotangale
标识
DOI:10.1109/incet57972.2023.10170436
摘要
Optical Character Recognition (OCR) is a widely used technology that converts image text or handwritten text into digital form. However, recognizing handwritten text, printed text, and image text poses a significant challenge due to variations in writing styles and the complexity of characters. This paper proposes a novel approach for OCR using Convolutional Recurrent Neural Network (CRNN) that combines convolutional neural networks (CNNs) and recurrent neural networks (RNNs). The proposed CRNN architecture can automatically learn and extract features from raw image pixels and recognize sequential patterns of characters. This research paper presents a robust OCR system using CRNN architecture with 7 convolutional layers and 2 LSTM layers for recognizing text in images with complex backgrounds and varying fonts. The proposed system achieved state-of-the-art performance on several benchmark datasets, demonstrating the effectiveness of the proposed approach. Our experimental results demonstrate that the proposed CRNN approach is better than other methods and achieves higher accuracy with less latency in recognizing text from an image. We also analyze the impact of different parameters, such as the number of layers, filter sizes, and hidden units, on the performance of the CRNN model. This paper provides a comprehensive study on OCR using CRNN and its potential to improve the accuracy and efficiency of recognizing text.
科研通智能强力驱动
Strongly Powered by AbleSci AI