Image-to-image (I2I) translation models are widely employed in several fields, e.g., computer vision, security or medicine. Their goal is to map images from a source domain to a target domain while preserving content information. Despite their success, these models suffer from multiple weaknesses. For example, many practical scenarios do not consent to collect a sufficient amount of images, leading to imbalanced domains. Furthermore, mode collapse and training instability require a careful design and further discourage their deployment on edge devices. Finally, I2I models need an intensive computation to learn conditional probability distributions and are difficult to adapt to different contexts. These drawbacks mainly limit their large scale applicability. In this work, we want to shed light on the main solutions adopted to overcome the above issues and their impact on the performance. We also investigate several approaches to deploy these models on low-powered devices and weight sharing techniques to reduce the number of parameters and resources used.
Applications and limits of image-to-image translation models / P. Coscia, A. Genovese, F. Scotti, V. Piuri (INTERNATIONAL CONFERENCE ON DIGITAL SIGNAL PROCESSING PROCEEDINGS). - In: DSP[s.l] : Institute of Electrical and Electronics Engineers (IEEE), 2023 Jun 11. - ISBN 979-8-3503-3959-8. - pp. 1-5 (( Intervento presentato al 24. convegno International Conference on Digital Signal Processing : 11 through 13 June tenutosi a Rhodes nel 2023 [10.1109/DSP58604.2023.10167879].
Applications and limits of image-to-image translation models
P. CosciaPrimo
;A. GenoveseSecondo
;F. ScottiPenultimo
;V. PiuriUltimo
2023
Abstract
Image-to-image (I2I) translation models are widely employed in several fields, e.g., computer vision, security or medicine. Their goal is to map images from a source domain to a target domain while preserving content information. Despite their success, these models suffer from multiple weaknesses. For example, many practical scenarios do not consent to collect a sufficient amount of images, leading to imbalanced domains. Furthermore, mode collapse and training instability require a careful design and further discourage their deployment on edge devices. Finally, I2I models need an intensive computation to learn conditional probability distributions and are difficult to adapt to different contexts. These drawbacks mainly limit their large scale applicability. In this work, we want to shed light on the main solutions adopted to overcome the above issues and their impact on the performance. We also investigate several approaches to deploy these models on low-powered devices and weight sharing techniques to reduce the number of parameters and resources used.File | Dimensione | Formato | |
---|---|---|---|
dsp23.pdf
accesso aperto
Tipologia:
Post-print, accepted manuscript ecc. (versione accettata dall'editore)
Dimensione
1.99 MB
Formato
Adobe PDF
|
1.99 MB | Adobe PDF | Visualizza/Apri |
Applications_and_Limits_of_Image-to-Image_Translation_Models.pdf
accesso riservato
Tipologia:
Publisher's version/PDF
Dimensione
2.57 MB
Formato
Adobe PDF
|
2.57 MB | Adobe PDF | Visualizza/Apri Richiedi una copia |
Pubblicazioni consigliate
I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.