The training phase is the most crucial stage during the machine learning process. In the case of labeled data and supervised learning, machine learning entails minimizing the loss function under various constraints. We provide an innovative model for learning with numerous data sets, resulting from the application of multicriteria optimization techniques to existing deep learning algorithms. Data fitting is formulated as a multicriteria model in which each criterion measures the data fitting error on a specific data set. This is an optimization model involving a vector-valued function, and it has to be analyzed using the notion of Pareto efficiency. We present stability results for efficient solutions in the presence of input and output data perturbations. The multiple data set environment comes into play to eliminate the bias caused by the selection of a specific training set. To apply this concept, we present a scalarization strategy as well as numerical experiments in digit classification using MNIST data.

Enhancing deep learning algorithm accuracy and stability using multicriteria optimization: an application to distributed learning with MNIST digits / D. La Torre, D. Liuzzi, M. Repetto, M. Rocca. - In: ANNALS OF OPERATIONS RESEARCH. - ISSN 0254-5330. - (2022), pp. 1-21. [Epub ahead of print] [10.1007/s10479-022-04833-x]

Enhancing deep learning algorithm accuracy and stability using multicriteria optimization: an application to distributed learning with MNIST digits

D. La Torre
Primo
;
D. Liuzzi;
2022

Abstract

The training phase is the most crucial stage during the machine learning process. In the case of labeled data and supervised learning, machine learning entails minimizing the loss function under various constraints. We provide an innovative model for learning with numerous data sets, resulting from the application of multicriteria optimization techniques to existing deep learning algorithms. Data fitting is formulated as a multicriteria model in which each criterion measures the data fitting error on a specific data set. This is an optimization model involving a vector-valued function, and it has to be analyzed using the notion of Pareto efficiency. We present stability results for efficient solutions in the presence of input and output data perturbations. The multiple data set environment comes into play to eliminate the bias caused by the selection of a specific training set. To apply this concept, we present a scalarization strategy as well as numerical experiments in digit classification using MNIST data.
Artificial intelligence; Deep learning; Machine learning; Multicriteria optimization; Classification; MINST data
Settore SECS-S/06 - Metodi mat. dell'economia e Scienze Attuariali e Finanziarie
2022
11-lug-2022
Article (author)
File in questo prodotto:
File Dimensione Formato  
DeepLearningMulticriteriaOptimizationLaTorreLiuzziRepettoRocca_ANOR.pdf

accesso aperto

Tipologia: Pre-print (manoscritto inviato all'editore)
Dimensione 944.66 kB
Formato Adobe PDF
944.66 kB Adobe PDF Visualizza/Apri
s10479-022-04833-x.pdf

accesso riservato

Descrizione: online first
Tipologia: Publisher's version/PDF
Dimensione 1.11 MB
Formato Adobe PDF
1.11 MB Adobe PDF   Visualizza/Apri   Richiedi una copia
Pubblicazioni consigliate

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/2434/961137
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 2
  • ???jsp.display-item.citation.isi??? 1
social impact