In the context of deterministic scientific simulations, formal validity requirements have typically been defined to help us reason about the relationship between the mathematical model underlying the target system and the computational model used to simulate it. With machine learning simulations entering the picture, we argue that these formal requirements need to be reviewed, as the objects to which they apply have significantly changed. This is due to several reasons: the target system is no longer an available system object of investigation; the probabilistic mathematical model is abstracted from the target system through the mediation of the computational model, which however remains largely opaque to us due to its high complexity. For these reasons, we formulate weaker, probabilistic versions of the traditional relations of Simulation, Bisimulation, and Approximate Simulation. Accordingly, we define three corresponding validity criteria that capture a range of cases, from the strongest to the weakest, depending on the extent to which the machine learning model can be assumed to correctly represent its target system.

Defining Formal Validity Criteria for Machine Learning Models / C. Manganini, G. Primiero (SYNTHÈSE LIBRARY). - In: Philosophy of Science for Machine Learning : Core Issues and New Perspectives / [a cura di] J.M. Durán, G. Pozzi. - [s.l] : Springer Cham, 2026. - ISBN 9783032030825. - pp. 295-312 [10.1007/978-3-032-03083-2_14]

Defining Formal Validity Criteria for Machine Learning Models

C. Manganini;G. Primiero
2026

Abstract

In the context of deterministic scientific simulations, formal validity requirements have typically been defined to help us reason about the relationship between the mathematical model underlying the target system and the computational model used to simulate it. With machine learning simulations entering the picture, we argue that these formal requirements need to be reviewed, as the objects to which they apply have significantly changed. This is due to several reasons: the target system is no longer an available system object of investigation; the probabilistic mathematical model is abstracted from the target system through the mediation of the computational model, which however remains largely opaque to us due to its high complexity. For these reasons, we formulate weaker, probabilistic versions of the traditional relations of Simulation, Bisimulation, and Approximate Simulation. Accordingly, we define three corresponding validity criteria that capture a range of cases, from the strongest to the weakest, depending on the extent to which the machine learning model can be assumed to correctly represent its target system.
Non-deterministic computation; Probabilistic simulation; Validity; Modesl; Markov semantics; Process algebra
Settore PHIL-02/A - Logica e filosofia della scienza
2026
Book Part (author)
File in questo prodotto:
File Dimensione Formato  
978-3-032-03083-2_14.pdf

accesso aperto

Tipologia: Publisher's version/PDF
Licenza: Creative commons
Dimensione 198.48 kB
Formato Adobe PDF
198.48 kB Adobe PDF Visualizza/Apri
Pubblicazioni consigliate

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/2434/1205671
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus ND
  • ???jsp.display-item.citation.isi??? ND
  • OpenAlex ND
social impact