IRIS Institutional Research Information System - AIR Archivio Istituzionale della Ricerca

The massive adoption of Machine Learning (ML) has deeply changed the internal structure, the design and the operation of software systems. ML has shifted the focus from code to data, especially in application areas where it is easier to collect samples that embody correct solutions to individual instances of a problem, than to design and code a deterministic algorithm solving it for all instances. There is an increasing awareness of the need to verify key non-functional properties of ML-based software applications like fairness and privacy. However, the traditional approach trying to verify these properties by code inspection is pointless, since ML models’ behavior mostly depends on the data and parameters used to train them. Classic software certification techniques cannot solve the issue as well. The Artificial Intelligence (AI) community has been working on the idea of preventing undesired behavior by controlling a priori the ML models’ training sets and parameters. In this paper, we take a different, online approach to ML verification, where novel behavioral monitoring techniques based on statistical testing are used to support a dynamic certification framework enforcing the desired properties on black-box ML models in operation. Our aim is to deliver a novel framework suitable for practical certification of distributed ML-powered applications in heavily regulated domains like transport, energy, healthcare, even when the certifying authority is not privy to the model training. To achieve this goal, we rely on three key ideas: (i) use test suites to define desired non-functional properties of ML models, (ii) Use statistical monitoring of ML models’ behavior at inference time to check that the desired behavioral properties are achieved, and (iii) compose monitors’ outcome within dynamic, virtual certificates for composite software applications.

Certified Machine-Learning Models / E. Damiani, C. Ardagna (LECTURE NOTES IN COMPUTER SCIENCE). - In: SOFSEM 2020: Theory and Practice of Computer Science / [a cura di] A. Chatzigeorgiou, R. Dondi, H. Herodotou, C. Kapoutsis, Y. Manolopoulos, G.A. Papadopoulos, F. Sikora. - [s.l] : Springer, 2020. - ISBN 9783030389185. - pp. 3-15 (( Intervento presentato al 46. convegno International Conference on Current Trends in Theory and Practice of Informatics tenutosi a Limassol nel 2020.

Certified Machine-Learning Models

E. Damiani;C. Ardagna

2020

Abstract

The massive adoption of Machine Learning (ML) has deeply changed the internal structure, the design and the operation of software systems. ML has shifted the focus from code to data, especially in application areas where it is easier to collect samples that embody correct solutions to individual instances of a problem, than to design and code a deterministic algorithm solving it for all instances. There is an increasing awareness of the need to verify key non-functional properties of ML-based software applications like fairness and privacy. However, the traditional approach trying to verify these properties by code inspection is pointless, since ML models’ behavior mostly depends on the data and parameters used to train them. Classic software certification techniques cannot solve the issue as well. The Artificial Intelligence (AI) community has been working on the idea of preventing undesired behavior by controlling a priori the ML models’ training sets and parameters. In this paper, we take a different, online approach to ML verification, where novel behavioral monitoring techniques based on statistical testing are used to support a dynamic certification framework enforcing the desired properties on black-box ML models in operation. Our aim is to deliver a novel framework suitable for practical certification of distributed ML-powered applications in heavily regulated domains like transport, energy, healthcare, even when the certifying authority is not privy to the model training. To achieve this goal, we rely on three key ideas: (i) use test suites to define desired non-functional properties of ML models, (ii) Use statistical monitoring of ML models’ behavior at inference time to check that the desired behavioral properties are achieved, and (iii) compose monitors’ outcome within dynamic, virtual certificates for composite software applications.

Scheda breve

Scheda completa

Scheda completa (DC)

	Parole chiave
	
			Certification; Intelligent systems; Machine Learning
		
	Settori scientifico-disciplinari del contributo
	
			Settore INF/01 - Informatica
		
	Titolo del progetto
	
	Titolo Progetto
	
									Cyber security cOmpeteNce fOr Research anD Innovation (CONCORDIA)
								
	Acronimo
	
									CONCORDIA
								
	Nome finanziatore
	
										EUROPEAN COMMISSION
									
	Finanziamento
	
									H2020
								
	N. Contratto
	
									830927
								
	Data di pubblicazione
	
			2020
		
	DOI
	
			https://dx.doi.org/10.1007/978-3-030-38919-2_1
		
	Tipologia
	
			Book Part (author)
		
	Appare nelle tipologie:
	
			03 - Contributo in volume

File in questo prodotto:

File	Dimensione	Formato
paper_CR.pdf accesso riservato Tipologia: Post-print, accepted manuscript ecc. (versione accettata dall'editore) Dimensione 423.16 kB Formato Adobe PDF Visualizza/Apri Richiedi una copia	423.16 kB	Adobe PDF	Visualizza/Apri Richiedi una copia
Damiani-Ardagna2020_Chapter_CertifiedMachine-LearningModel.pdf accesso riservato Tipologia: Publisher's version/PDF Dimensione 851.54 kB Formato Adobe PDF Visualizza/Apri Richiedi una copia	851.54 kB	Adobe PDF	Visualizza/Apri Richiedi una copia

Pubblicazioni consigliate

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/2434/714671

Citazioni

ND

5

5

social impact