RATNet: A deep learning model for Bengali handwritten characters recognition

Md Shafiqul Islam,; Md Moklesur Rahman,; Md Hafizur Rahman,; Rivolta, M.W.; Aktaruzzaman, M.

doi:10.1007/s11042-022-12070-4

The Bengali language is based on a set of symbols for basic characters, modifiers, compound characters, and numerals. The recognition rates of handwritten basic characters and numerals are very high. However, the recognition rates of compound characters and modifiers are still poor. This might be due to their large class size with huge writing styles, much similarity, and unavailability of sufficient data for deep learning. In fact, there are some compound characters which appear very rare in practice. A proper selection of frequently used characters may reduce class size, and hence improving the accuracy. In this study, we performed a statistics on the frequency of compound characters, we developed two datasets for modifiers and compound characters, and finally we proposed a heterogeneous deep learning model (RATNet) for characters recognition. A statistics was performed on two daily Bengali newspapers, and characters with frequency >= 5% were selected. The handwriting of selected characters was collected from 130 writers of different ages and professions. The performance of RATNet model was evaluated on the proposed datasets and also three other existing datasets (i.e., ISI, CMATERdb, BanglaLekha-Isolated). In addition, the performance of RATNet was also compared with LeNet-5, VGG-16, ResNet-50, and DenseNet-121 models. We selected 87 out of 107 compound characters. The proposed RATNet model outperforms other models providing 99.66%, 99.27%, 98.78%, and 97.70% accuracy, respectively for the recognition of numerals, basic characters, modifiers, and compound characters on the CMATERdb dataset while keeping the number of parameters relatively low likely due to layer heterogeneity.

RATNet: A deep learning model for Bengali handwritten characters recognition / M. Shafiqul Islam, M. Moklesur Rahman, M. Hafizur Rahman, M.W. Rivolta, M. Aktaruzzaman. - In: MULTIMEDIA TOOLS AND APPLICATIONS. - ISSN 1380-7501. - 81:8(2022 Mar), pp. 10631-10651. [10.1007/s11042-022-12070-4]

RATNet: A deep learning model for Bengali handwritten characters recognition

Md Shafiqul Islam^Primo;Md Moklesur Rahman;Md Hafizur Rahman;M.W. Rivolta^Penultimo;

2022

Abstract

The Bengali language is based on a set of symbols for basic characters, modifiers, compound characters, and numerals. The recognition rates of handwritten basic characters and numerals are very high. However, the recognition rates of compound characters and modifiers are still poor. This might be due to their large class size with huge writing styles, much similarity, and unavailability of sufficient data for deep learning. In fact, there are some compound characters which appear very rare in practice. A proper selection of frequently used characters may reduce class size, and hence improving the accuracy. In this study, we performed a statistics on the frequency of compound characters, we developed two datasets for modifiers and compound characters, and finally we proposed a heterogeneous deep learning model (RATNet) for characters recognition. A statistics was performed on two daily Bengali newspapers, and characters with frequency >= 5% were selected. The handwriting of selected characters was collected from 130 writers of different ages and professions. The performance of RATNet model was evaluated on the proposed datasets and also three other existing datasets (i.e., ISI, CMATERdb, BanglaLekha-Isolated). In addition, the performance of RATNet was also compared with LeNet-5, VGG-16, ResNet-50, and DenseNet-121 models. We selected 87 out of 107 compound characters. The proposed RATNet model outperforms other models providing 99.66%, 99.27%, 98.78%, and 97.70% accuracy, respectively for the recognition of numerals, basic characters, modifiers, and compound characters on the CMATERdb dataset while keeping the number of parameters relatively low likely due to layer heterogeneity.

Scheda breve

Scheda completa

Scheda completa (DC)

	Parole chiave
	
				Bengali; Handwritten character recognition; Dataset; Convolutional neural network; Residual attention; Deep learning;
			
	Settori scientifico-disciplinari dell'articolo (sola visualizzazione)
	
				Settore INF/01 - Informatica
			
	Data di pubblicazione
	
				mar-2022
			
	Data ahead of print o data di stampa
	
				16-feb-2022
			
	Rivista in ANCE
	
				MULTIMEDIA TOOLS AND APPLICATIONS
			
	DOI
	
				https://dx.doi.org/10.1007/s11042-022-12070-4
			
	Sostituita dalla registrazione
	
				hdl:2434/937745
			
	Tipologia
	
				Article (author)
			
	Appare nelle tipologie:
	
				01 - Articolo su periodico

File in questo prodotto:

File	Dimensione	Formato
J23_RATNet_PostPrint.pdf Open Access dal 02/03/2023 Tipologia: Post-print, accepted manuscript ecc. (versione accettata dall'editore) Dimensione 4.21 MB Formato Adobe PDF Visualizza/Apri	4.21 MB	Adobe PDF	Visualizza/Apri
s11042-022-12070-4(1).pdf accesso riservato Tipologia: Publisher's version/PDF Dimensione 1.66 MB Formato Adobe PDF Visualizza/Apri Richiedi una copia	1.66 MB	Adobe PDF	Visualizza/Apri Richiedi una copia

Pubblicazioni consigliate

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/2434/937745

Citazioni

ND

14

6

ND

IRIS Institutional Research Information System - AIR Archivio Istituzionale della Ricerca