Image-to-image (I2I) translation models are widely employed in several fields, e.g., computer vision, security or medicine. Their goal is to map images from a source domain to a target domain while preserving content information. Despite their success, these models suffer from multiple weaknesses. For example, many practical scenarios do not consent to collect a sufficient amount of images, leading to imbalanced domains. Furthermore, mode collapse and training instability require a careful design and further discourage their deployment on edge devices. Finally, I2I models need an intensive computation to learn conditional probability distributions and are difficult to adapt to different contexts. These drawbacks mainly limit their large scale applicability. In this work, we want to shed light on the main solutions adopted to overcome the above issues and their impact on the performance. We also investigate several approaches to deploy these models on low-powered devices and weight sharing techniques to reduce the number of parameters and resources used.

Applications and limits of image-to-image translation models / P. Coscia, A. Genovese, F. Scotti, V. Piuri (INTERNATIONAL CONFERENCE ON DIGITAL SIGNAL PROCESSING PROCEEDINGS). - In: 2023 24th International Conference on Digital Signal Processing (DSP)[s.l] : IEEE, 2023 Jun 11. - ISBN 979-8-3503-3959-8. - pp. 1-5 (( 24. International Conference on Digital Signal Processing Rhodes 2023 [10.1109/DSP58604.2023.10167879].

Applications and limits of image-to-image translation models

P. Coscia
Primo
;
A. Genovese
Secondo
;
F. Scotti
Penultimo
;
V. Piuri
Ultimo
2023

Abstract

Image-to-image (I2I) translation models are widely employed in several fields, e.g., computer vision, security or medicine. Their goal is to map images from a source domain to a target domain while preserving content information. Despite their success, these models suffer from multiple weaknesses. For example, many practical scenarios do not consent to collect a sufficient amount of images, leading to imbalanced domains. Furthermore, mode collapse and training instability require a careful design and further discourage their deployment on edge devices. Finally, I2I models need an intensive computation to learn conditional probability distributions and are difficult to adapt to different contexts. These drawbacks mainly limit their large scale applicability. In this work, we want to shed light on the main solutions adopted to overcome the above issues and their impact on the performance. We also investigate several approaches to deploy these models on low-powered devices and weight sharing techniques to reduce the number of parameters and resources used.
Image-to-image translation; GAN; cyclic loss
Settore INF/01 - Informatica
Settore ING-INF/05 - Sistemi di Elaborazione delle Informazioni
Settore INFO-01/A - Informatica
Settore IINF-05/A - Sistemi di elaborazione delle informazioni
   Edge AI Technologies for Optimised Performance Embedded Processing (EdgeAI)
   EdgeAI
   MINISTERO DELLO SVILUPPO ECONOMICO
   101097300

   SEcurity and RIghts in the CyberSpace (SERICS)
   SERICS
   MINISTERO DELL'UNIVERSITA' E DELLA RICERCA
   codice identificativo PE00000014
11-giu-2023
Institute of Electrical and Electronics Engineers (IEEE)
Book Part (author)
File in questo prodotto:
File Dimensione Formato  
Applications_and_Limits_of_Image-to-Image_Translation_Models.pdf

accesso riservato

Tipologia: Publisher's version/PDF
Licenza: Nessuna licenza
Dimensione 2.57 MB
Formato Adobe PDF
2.57 MB Adobe PDF   Visualizza/Apri   Richiedi una copia
dsp23.pdf

accesso aperto

Tipologia: Post-print, accepted manuscript ecc. (versione accettata dall'editore)
Licenza: Creative commons
Dimensione 1.97 MB
Formato Adobe PDF
1.97 MB Adobe PDF Visualizza/Apri
Pubblicazioni consigliate

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/2434/967179
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 3
  • ???jsp.display-item.citation.isi??? ND
  • OpenAlex ND
social impact