Data-Driven Container Marking Detection and Recognition System With an Open Large-Scale Scene Text Dataset

Ying, X.; Liang, Z.; Liang, Y.; Xinru, L.; Pan, W.; You, J.; Long, Z.; Zhai, Y.; Genovese, A.; Piuri, V.; Scotti, F.

doi:10.1109/tetci.2024.3377680

With the widespread use of containers, the demand for Container Marking Detection and Recognition (CMDR) is gradually increasing. The use of deep learning algorithms can greatly improve the efficiency of marking detection and recognition. However, there is still a lack of research on CMDR in both academia and industry, resulting in the current task being completed manually and inefficiently. In this paper, we probe into the importance of data-driven and task paradigms for CMDR tasks. Firstly, we constructed an open large scale container surface marking text dataset called ContainerText. This dataset consists of 12 k high-resolution images and provides two types of annotation information: bounding box used for detection and text for recognition tasks. In addition, we also propose an efficient semi-automatic annotation method based on deep learning, which reduces the cost of manual annotation. Subsequently, we have innovatively proposed a CMDR method combining Scene Text Recognition (STR) with CMDR tasks. The method based on STR can locate and recognize container marking from a fine-grained level. We conducted a comprehensive series of experiments on the ContainerText dataset using state-of-the-art (SOTA) scene text detection and scene text recognition models. The experimental results demonstrate that the CMDR method, based on STR, exhibits exceptional adaptability and feasibility. All experimental results obtained from the ContainerText dataset will act as performance benchmarks for future researchers. Finally, an automated Container Marking Image Acquisition Mechanism (CMIAM) are construucted, which can effectively avoid complex lighting in the workshop environment and achieve high-quality and automated image acquisition. We have conducted extensive experiments to measure the distance, resolution, and field of view required for clearly capturing container markings. Our research providing reference for future CMDR research from task solution and hardware selection.

Data-Driven Container Marking Detection and Recognition System With an Open Large-Scale Scene Text Dataset / Y. Xu, Z. Liang, Y. Liang, X. Li, W. Pan, J. You, Z. Long, Y. Zhai, A. Genovese, V. Piuri, F. Scotti. - In: IEEE TRANSACTIONS ON EMERGING TOPICS IN COMPUTATIONAL INTELLIGENCE. - ISSN 2471-285X. - 8:5(2024 Oct), pp. 3368-3381. [10.1109/tetci.2024.3377680]

Data-Driven Container Marking Detection and Recognition System With an Open Large-Scale Scene Text Dataset

Xu, Ying;Liang, Zhangzhao;Liang, Yanyang;Li, Xinru;Pan, Wenfeng;You, Jie;Long, Zhihao;Zhai, Yikui;A. Genovese;V. Piuri^Penultimo;F. Scotti^Ultimo

2024

Abstract

With the widespread use of containers, the demand for Container Marking Detection and Recognition (CMDR) is gradually increasing. The use of deep learning algorithms can greatly improve the efficiency of marking detection and recognition. However, there is still a lack of research on CMDR in both academia and industry, resulting in the current task being completed manually and inefficiently. In this paper, we probe into the importance of data-driven and task paradigms for CMDR tasks. Firstly, we constructed an open large scale container surface marking text dataset called ContainerText. This dataset consists of 12 k high-resolution images and provides two types of annotation information: bounding box used for detection and text for recognition tasks. In addition, we also propose an efficient semi-automatic annotation method based on deep learning, which reduces the cost of manual annotation. Subsequently, we have innovatively proposed a CMDR method combining Scene Text Recognition (STR) with CMDR tasks. The method based on STR can locate and recognize container marking from a fine-grained level. We conducted a comprehensive series of experiments on the ContainerText dataset using state-of-the-art (SOTA) scene text detection and scene text recognition models. The experimental results demonstrate that the CMDR method, based on STR, exhibits exceptional adaptability and feasibility. All experimental results obtained from the ContainerText dataset will act as performance benchmarks for future researchers. Finally, an automated Container Marking Image Acquisition Mechanism (CMIAM) are construucted, which can effectively avoid complex lighting in the workshop environment and achieve high-quality and automated image acquisition. We have conducted extensive experiments to measure the distance, resolution, and field of view required for clearly capturing container markings. Our research providing reference for future CMDR research from task solution and hardware selection.

Scheda breve

Scheda completa

Scheda completa (DC)

	Parole chiave
	
				Annotations; Character recognition; Container marking detection and recognition; Containers; Deep learning; deep learning; Image resolution; scene text dataset; scene text recognition; Task analysis; Text recognition;
			
	Settori scientifico-disciplinari dell'articolo (sola visualizzazione)
	
				Settore INF/01 - Informatica
Settore ING-INF/05 - Sistemi di Elaborazione delle Informazioni
			
	Settori scientifico-disciplinari dell'articolo (validi dal 09/05/2024)
	
				Settore INFO-01/A - Informatica
Settore IINF-05/A - Sistemi di elaborazione delle informazioni
			
	Data di pubblicazione
	
				ott-2024
			
	Data ahead of print o data di stampa
	
				mar-2024
			
	Rivista in ANCE
	
				IEEE TRANSACTIONS ON EMERGING TOPICS IN COMPUTATIONAL INTELLIGENCE
			
	DOI
	
				https://dx.doi.org/10.1109/tetci.2024.3377680
			
	Tipologia
	
				Article (author)
			
	Appare nelle tipologie:
	
				01 - Articolo su periodico

File in questo prodotto:

File	Dimensione	Formato
Data-Driven_Container_Marking_Detection_and_Recognition_System_With_an_Open_Large-Scale_Scene_Text_Dataset.pdf accesso riservato Tipologia: Publisher's version/PDF Dimensione 7.05 MB Formato Adobe PDF Visualizza/Apri Richiedi una copia	7.05 MB	Adobe PDF	Visualizza/Apri Richiedi una copia
PREPRINT_Data-Driven_Container_Marking_Detection_and_Recognition_System_With_an_Open_Large-Scale_Scene_Text_Dataset 1.pdf accesso aperto Tipologia: Pre-print (manoscritto inviato all'editore) Dimensione 7.17 MB Formato Adobe PDF Visualizza/Apri	7.17 MB	Adobe PDF	Visualizza/Apri

Pubblicazioni consigliate

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/2434/1043072

Citazioni

ND

3

2

ND

IRIS Institutional Research Information System - AIR Archivio Istituzionale della Ricerca