Compression strategies and space-conscious representations for deep neural networks

Marino, G.C.; Ghidoli, G.; Frasca, M.; Malchiodi, D.

doi:10.1109/ICPR48806.2021.9412209

Recent advances in deep learning have made available large, powerful convolutional neural networks (CNN) with state-of-the-art performance in several real-world applications. Unfortunately, these large-sized models have millions of parameters, thus they are not deployable on resource-limited platforms (e.g., where RAM is limited). Compression of CNNs becomes therefore a critical problem to achieve memory-efficient and possibly computationally faster model representations. In this paper, we investigate the impact of lossy compression of CNNs by weight pruning and quantization, and lossless weight matrix representations based on source coding. We tested several combinations of these techniques on four benchmark datasets for classification and regression problems, achieving compression rates up to 165 times, while preserving or improving the model performance.

Compression strategies and space-conscious representations for deep neural networks / G.C. Marino, G. Ghidoli, M. Frasca, D. Malchiodi (INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION). - In: 2020 25th International Conference on Pattern Recognition (ICPR)[s.l] : IEEE, 2020. - ISBN 978-1-7281-8808-9. - pp. 9835-9842 (( Intervento presentato al 25. convegno International Conference on Pattern Recognition, ICPR 2020 tenutosi a Milano nel 2021 [10.1109/ICPR48806.2021.9412209].

Compression strategies and space-conscious representations for deep neural networks

Marino G. C.;Ghidoli G.;M. Frasca;D. Malchiodi

2020

Abstract

Recent advances in deep learning have made available large, powerful convolutional neural networks (CNN) with state-of-the-art performance in several real-world applications. Unfortunately, these large-sized models have millions of parameters, thus they are not deployable on resource-limited platforms (e.g., where RAM is limited). Compression of CNNs becomes therefore a critical problem to achieve memory-efficient and possibly computationally faster model representations. In this paper, we investigate the impact of lossy compression of CNNs by weight pruning and quantization, and lossless weight matrix representations based on source coding. We tested several combinations of these techniques on four benchmark datasets for classification and regression problems, achieving compression rates up to 165 times, while preserving or improving the model performance.

Scheda breve

Scheda completa

Scheda completa (DC)

	Presenza di coautori internazionali
	
				No
			
	Lingua del contributo
	
				English
			
	Parole chiave
	
				CNN compression; Drug-target prediction; Entropy coding; Probabilistic quantization; Weight pruning
			
	Settori scientifico-disciplinari del contributo (sola visualizzazione)
	
				Settore INF/01 - Informatica
			
	Tipo
	
				Intervento a convegno
			
	Revisione (peer review)
	
				Esperti anonimi
			
	Classificazione in base al tipo di ricerca
	
				Ricerca di base
			
	Classificazione della pubblicazione
	
				Pubblicazione scientifica
			
	Titolo del progetto
	
	Titolo Progetto
	
									Multi-criteria optimized data structures: from compressed indexes to learned indexes, and beyond
								
	Nome finanziatore
	
										MINISTERO DELL'ISTRUZIONE E DEL MERITO
									
	N. Contratto
	
									2017WR7SHH_004
								
	Titolo del volume
	
				2020 25th International Conference on Pattern Recognition (ICPR)
			
	Editore
	
				IEEE
			
	Data di pubblicazione
	
				2020
			
	Pagina iniziale
	
				9835
			
	Pagina finale
	
				9842
			
	Numero di pagine
	
				8
			
	ISBN
	
				978-1-7281-8808-9
			
	Collana
	
				INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION
			
	Tipo di volume
	
				Volume a diffusione internazionale
			
	Contributo pubblicato in Open Access GOLD o DIAMOND
	
				No
			
	Nome del convegno
	
				International Conference on Pattern Recognition, ICPR 2020
			
	Luogo del convegno
	
				Milano
			
	Anno del convegno
	
				2021
			
	Numero del convegno
	
				25
			
	Tipo di convegno
	
				Convegno internazionale
			
	Sezione
	
				Intervento inviato
			
	DOI
	
				https://dx.doi.org/10.1109/ICPR48806.2021.9412209
			
	Banca dati sorgente
	
				scopus
crossref
wos
			
	Identificativo ISI
	
				WOS:000681331402046
			
	Identificativo SCOPUS
	
				2-s2.0-85110507378
			
	Adesione alla policy Open Access di Ateneo
	
				Aderisco
			
	Tutti gli autori
	
						G.C. Marino, G. Ghidoli, M. Frasca, D. Malchiodi
					
	Tipologia
	
				Book Part (author)
			
	Fulltext
	
				reserved
			
	Tipologia sito docente
	
				273
			
	Citazione
	
				Compression strategies and space-conscious representations for deep neural networks / G.C. Marino, G. Ghidoli, M. Frasca, D. Malchiodi (INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION). - In: 2020 25th International Conference on Pattern Recognition (ICPR)[s.l] : IEEE, 2020. - ISBN 978-1-7281-8808-9. - pp. 9835-9842 (( Intervento presentato al 25. convegno International Conference on Pattern Recognition, ICPR 2020 tenutosi a Milano nel 2021 [10.1109/ICPR48806.2021.9412209].
			
	Tipologia
	
				info:eu-repo/semantics/bookPart
			
	Numero autori
	
				4
			
	Tipologia
	
				Prodotti della ricerca::03 - Contributo in volume
			
	Appare nelle tipologie:
	
				03 - Contributo in volume

File in questo prodotto:

File	Dimensione	Formato
ICPR-published.pdf accesso riservato Tipologia: Publisher's version/PDF Dimensione 337.88 kB Formato Adobe PDF Visualizza/Apri Richiedi una copia	337.88 kB	Adobe PDF	Visualizza/Apri Richiedi una copia

Pubblicazioni consigliate

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/2434/876038

Citazioni

ND

9

9

ND

IRIS Institutional Research Information System - AIR Archivio Istituzionale della Ricerca