Image-to-image (I2I) translation models are widely employed in several fields, e.g., computer vision, security or medicine. Their goal is to map images from a source domain to a target domain while preserving content information. Despite their success, these models suffer from multiple weaknesses. For example, many practical scenarios do not consent to collect a sufficient amount of images, leading to imbalanced domains. Furthermore, mode collapse and training instability require a careful design and further discourage their deployment on edge devices. Finally, I2I models need an intensive computation to learn conditional probability distributions and are difficult to adapt to different contexts. These drawbacks mainly limit their large scale applicability. In this work, we want to shed light on the main solutions adopted to overcome the above issues and their impact on the performance. We also investigate several approaches to deploy these models on low-powered devices and weight sharing techniques to reduce the number of parameters and resources used.

Applications and limits of image-to-image translation models / P. Coscia, A. Genovese, F. Scotti, V. Piuri (INTERNATIONAL CONFERENCE ON DIGITAL SIGNAL PROCESSING PROCEEDINGS). - In: DSP[s.l] : Institute of Electrical and Electronics Engineers (IEEE), 2023 Jun 11. - ISBN 979-8-3503-3959-8. - pp. 1-5 (( Intervento presentato al 24. convegno International Conference on Digital Signal Processing : 11 through 13 June tenutosi a Rhodes nel 2023 [10.1109/DSP58604.2023.10167879].

Applications and limits of image-to-image translation models

P. Coscia
Primo
;
A. Genovese
Secondo
;
F. Scotti
Penultimo
;
V. Piuri
Ultimo
2023

Abstract

Image-to-image (I2I) translation models are widely employed in several fields, e.g., computer vision, security or medicine. Their goal is to map images from a source domain to a target domain while preserving content information. Despite their success, these models suffer from multiple weaknesses. For example, many practical scenarios do not consent to collect a sufficient amount of images, leading to imbalanced domains. Furthermore, mode collapse and training instability require a careful design and further discourage their deployment on edge devices. Finally, I2I models need an intensive computation to learn conditional probability distributions and are difficult to adapt to different contexts. These drawbacks mainly limit their large scale applicability. In this work, we want to shed light on the main solutions adopted to overcome the above issues and their impact on the performance. We also investigate several approaches to deploy these models on low-powered devices and weight sharing techniques to reduce the number of parameters and resources used.
Image-to-image translation; GAN; cyclic loss
Settore INF/01 - Informatica
Settore ING-INF/05 - Sistemi di Elaborazione delle Informazioni
   Edge AI Technologies for Optimised Performance Embedded Processing (EdgeAI)
   EdgeAI
   MINISTERO DELLO SVILUPPO ECONOMICO
   101097300

   SEcurity and RIghts in the CyberSpace (SERICS)
   SERICS
   MINISTERO DELL'UNIVERSITA' E DELLA RICERCA
   codice identificativo PE00000014
11-giu-2023
Institute of Electrical and Electronics Engineers (IEEE)
Book Part (author)
File in questo prodotto:
File Dimensione Formato  
dsp23.pdf

accesso aperto

Tipologia: Post-print, accepted manuscript ecc. (versione accettata dall'editore)
Dimensione 1.99 MB
Formato Adobe PDF
1.99 MB Adobe PDF Visualizza/Apri
Applications_and_Limits_of_Image-to-Image_Translation_Models.pdf

accesso riservato

Tipologia: Publisher's version/PDF
Dimensione 2.57 MB
Formato Adobe PDF
2.57 MB Adobe PDF   Visualizza/Apri   Richiedi una copia
Pubblicazioni consigliate

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/2434/967179
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 0
  • ???jsp.display-item.citation.isi??? ND
social impact