In the context of deterministic scientific simulations, formal validity requirements have typically been defined to help us reason about the relationship between the mathematical model underlying the target system and the computational model used to simulate it. With machine learning simulations entering the picture, we argue that these formal requirements need to be reviewed, as the objects to which they apply have significantly changed. This is due to several reasons: the target system is no longer an available system object of investigation; the probabilistic mathematical model is abstracted from the target system through the mediation of the computational model, which however remains largely opaque to us due to its high complexity. For these reasons, we formulate weaker, probabilistic versions of the traditional relations of Simulation, Bisimulation, and Approximate Simulation. Accordingly, we define three corresponding validity criteria that capture a range of cases, from the strongest to the weakest, depending on the extent to which the machine learning model can be assumed to correctly represent its target system.
Defining Formal Validity Criteria for Machine Learning Models / C. Manganini, G. Primiero (SYNTHÈSE LIBRARY). - In: Philosophy of Science for Machine Learning : Core Issues and New Perspectives / [a cura di] J.M. Durán, G. Pozzi. - [s.l] : Springer Cham, 2026. - ISBN 9783032030825. - pp. 295-312 [10.1007/978-3-032-03083-2_14]
Defining Formal Validity Criteria for Machine Learning Models
C. Manganini;G. Primiero
2026
Abstract
In the context of deterministic scientific simulations, formal validity requirements have typically been defined to help us reason about the relationship between the mathematical model underlying the target system and the computational model used to simulate it. With machine learning simulations entering the picture, we argue that these formal requirements need to be reviewed, as the objects to which they apply have significantly changed. This is due to several reasons: the target system is no longer an available system object of investigation; the probabilistic mathematical model is abstracted from the target system through the mediation of the computational model, which however remains largely opaque to us due to its high complexity. For these reasons, we formulate weaker, probabilistic versions of the traditional relations of Simulation, Bisimulation, and Approximate Simulation. Accordingly, we define three corresponding validity criteria that capture a range of cases, from the strongest to the weakest, depending on the extent to which the machine learning model can be assumed to correctly represent its target system.| File | Dimensione | Formato | |
|---|---|---|---|
|
978-3-032-03083-2_14.pdf
accesso aperto
Tipologia:
Publisher's version/PDF
Licenza:
Creative commons
Dimensione
198.48 kB
Formato
Adobe PDF
|
198.48 kB | Adobe PDF | Visualizza/Apri |
Pubblicazioni consigliate
I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.




