Applications and limits of image-to-image translation models

Coscia, P.; Genovese, A.; Scotti, F.; Piuri, V.

doi:10.1109/DSP58604.2023.10167879

Image-to-image (I2I) translation models are widely employed in several fields, e.g., computer vision, security or medicine. Their goal is to map images from a source domain to a target domain while preserving content information. Despite their success, these models suffer from multiple weaknesses. For example, many practical scenarios do not consent to collect a sufficient amount of images, leading to imbalanced domains. Furthermore, mode collapse and training instability require a careful design and further discourage their deployment on edge devices. Finally, I2I models need an intensive computation to learn conditional probability distributions and are difficult to adapt to different contexts. These drawbacks mainly limit their large scale applicability. In this work, we want to shed light on the main solutions adopted to overcome the above issues and their impact on the performance. We also investigate several approaches to deploy these models on low-powered devices and weight sharing techniques to reduce the number of parameters and resources used.

Applications and limits of image-to-image translation models / P. Coscia, A. Genovese, F. Scotti, V. Piuri (INTERNATIONAL CONFERENCE ON DIGITAL SIGNAL PROCESSING PROCEEDINGS). - In: 2023 24th International Conference on Digital Signal Processing (DSP)[s.l] : IEEE, 2023 Jun 11. - ISBN 979-8-3503-3959-8. - pp. 1-5 (( 24. International Conference on Digital Signal Processing Rhodes 2023 [10.1109/DSP58604.2023.10167879].

Applications and limits of image-to-image translation models

P. Coscia^Primo;A. Genovese^Secondo;F. Scotti^Penultimo;V. Piuri^Ultimo

2023

Abstract

Image-to-image (I2I) translation models are widely employed in several fields, e.g., computer vision, security or medicine. Their goal is to map images from a source domain to a target domain while preserving content information. Despite their success, these models suffer from multiple weaknesses. For example, many practical scenarios do not consent to collect a sufficient amount of images, leading to imbalanced domains. Furthermore, mode collapse and training instability require a careful design and further discourage their deployment on edge devices. Finally, I2I models need an intensive computation to learn conditional probability distributions and are difficult to adapt to different contexts. These drawbacks mainly limit their large scale applicability. In this work, we want to shed light on the main solutions adopted to overcome the above issues and their impact on the performance. We also investigate several approaches to deploy these models on low-powered devices and weight sharing techniques to reduce the number of parameters and resources used.

Scheda breve

Scheda completa

Scheda completa (DC)

	Parole chiave
	
				Image-to-image translation; GAN; cyclic loss
			
	Settori scientifico-disciplinari del contributo (sola visualizzazione)
	
				Settore INF/01 - Informatica
Settore ING-INF/05 - Sistemi di Elaborazione delle Informazioni
			
	Settori scientifico-disciplinari del contributo (validi dal 09/05/2024)
	
				Settore INFO-01/A - Informatica
Settore IINF-05/A - Sistemi di elaborazione delle informazioni
			
	Titolo del progetto
	
	Titolo Progetto
	
									Edge AI Technologies for Optimised Performance Embedded Processing (EdgeAI)
								
	Acronimo
	
									EdgeAI
								
	Nome finanziatore
	
										MINISTERO DELLO SVILUPPO ECONOMICO
									
	N. Contratto
	
									101097300
								
	Titolo Progetto
	
									SEcurity and RIghts in the CyberSpace (SERICS)
								
	Acronimo
	
									SERICS
								
	Nome finanziatore
	
										MINISTERO DELL'UNIVERSITA' E DELLA RICERCA
									
	N. Contratto
	
									codice identificativo PE00000014
								
	Data di pubblicazione
	
				11-giu-2023
			
	Enti collegati al convegno
	
				Institute of Electrical and Electronics Engineers (IEEE)
			
	DOI
	
				https://dx.doi.org/10.1109/DSP58604.2023.10167879
			
	Tipologia
	
				Book Part (author)
			
	Appare nelle tipologie:
	
				03 - Contributo in volume

File in questo prodotto:

File	Dimensione	Formato
Applications_and_Limits_of_Image-to-Image_Translation_Models.pdf accesso riservato Tipologia: Publisher's version/PDF Licenza: Nessuna licenza Dimensione 2.57 MB Formato Adobe PDF Visualizza/Apri Richiedi una copia	2.57 MB	Adobe PDF	Visualizza/Apri Richiedi una copia
dsp23.pdf accesso aperto Tipologia: Post-print, accepted manuscript ecc. (versione accettata dall'editore) Licenza: Creative commons Dimensione 1.97 MB Formato Adobe PDF Visualizza/Apri	1.97 MB	Adobe PDF	Visualizza/Apri

Pubblicazioni consigliate

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/2434/967179

Citazioni

ND

3

ND

3

IRIS Institutional Research Information System - AIR Archivio Istituzionale della Ricerca