Multi-font Devanagari text recognition using LSTM neural networks

Kundaikar, T.; Pawar, J.D.

IR Home
→
Business, Commerce, Economics & Computer Sciences
→
Goa Business School
→
View Item

dc.contributor.author	Kundaikar, T.
dc.contributor.author	Pawar, J.D.
dc.date.accessioned	2019-11-08T05:44:22Z
dc.date.available	2019-11-08T05:44:22Z
dc.date.issued	2019
dc.identifier.citation	Advances in Intelligent Systems and Computing. First International Conference on Sustainable Technologies for Computational Intelligence, Ed. by: Luhach, A.; Kosa, J.; Poonia, R.; Gao, Xiao-Zhi; Singh, Dharm. 1045; 2019; 495-506.	en_US
dc.identifier.uri	https://doi.org/10.1007/978-981-15-0029-9_39
dc.identifier.uri	http://irgu.unigoa.ac.in/drs/handle/unigoa/5884
dc.description.abstract	Current research in OCR is focusing on the effect of multi-font and multi-size text on OCR accuracy. To the best of our knowledge, no study has been carried out to study the effect of multi-fonts and multi-size text on the accuracy of Devanagari OCRs. The most popular Devanagari OCRs in the market today are Tesseract OCR, Indsenz OCR and eAksharayan OCR. In this research work, we have studied the effect of font styles, namely Nakula, Baloo, Dekko, Biryani and Aparajita on these three OCRs. It has been observed that the accuracy of the Devanagari OCRs is dependent on the type of font style in text document images. Hence, we have proposed a multi-font Devanagari OCR (MFD_OCR), text line recognition model using long short-term memory (LSTM) neural networks. We have created training dataset Multi_Font_Train, which consists of text document images and its corresponding text file. This consists of each text line in five different font styles, namely Nakula, Baloo, Dekko, Biryani and Aparajita. The test dataset is created using the text from benchmark dataset [1] for each of the font styles as mentioned above, and they are named as BMT_Nakula, BMT_Baloo, BMT_Dekko, BMT_Biryani and BMT_Aparajita test dataset. On the evaluation of all OCRs, the MFD_OCR showed consistent accuracy across all these test datasets. It obtained comparatively good accuracy for BMT_Dekko and BMT_Biryani test datasets. On performing detailed error analysis, we noticed that compared to other Devanagari OCRs, the MFD_OCR has consistent, insertion and deletion type of errors, across all test dataset for each font style. The deletion errors are negligible, ranging from 0.8 to 1.4 percent.	en_US
dc.publisher	Springer	en_US
dc.subject	Computer Science and Technology	en_US
dc.title	Multi-font Devanagari text recognition using LSTM neural networks	en_US
dc.type	Conference article	en_US
dc.identifier.impf	cs