Product development is the process of creating and bringing a new or improved product to market. Formulation trials constitute a crucial stage in product development, often involving the exploration of numerous variables and product properties. Traditional methods of formulation trials involve time-consuming experimentation, trial and error, and iterative processes. In recent years, machine learning (ML) has emerged as a promising avenue to streamline this complex journey by enhancing efficiency, innovation, and customization. One of the paramount challenges in ML for product development is the models’ lack of interpretability and explainability. This challenge poses significant limitations in gaining user trust, meeting regulatory requirements, and understanding the rationale behind ML-driven decisions. Moreover, formulation trials involve the exploration of relationships and similarities among previous preparations; however, data related to formulation are typically stored in tables and not in a network-like manner. To cope with the above challenges, we propose a general methodology for fast product development leveraging graph ML models, explainability techniques, and powerful data visualization tools. Starting from tabular formulation trials, our model simultaneously learns a latent graph between items and a downstream task, i.e. predicting consumer-appealing properties of a formulation. Subsequently, explainability techniques based on graphs, perturbation, and sensitivity analysis effectively support the R&D department in identifying new recipes for reaching a desired property. We evaluate our model on two datasets derived from a case study based on food design plus a standard benchmark from the healthcare domain. Results show the effectiveness of our model in predicting the outcome of new formulations. Thanks to our solution, the company has drastically reduced the labor-intensive experiments in real laboratories and the waste of materials.
Graph Machine Learning for Fast Product Development from Formulation Trials / M. Dileo, R. Olmeda, M. Pindaro, M. Zignani (LECTURE NOTES IN ARTIFICIAL INTELLIGENCE). - In: Machine Learning and Knowledge Discovery in Databases. Applied Data Science Track. ECML PKDD / [a cura di] A. Bifet, T. Krilavicius, I. Miliou, S. Nowaczyk. - [s.l] : Springer, 2024 Aug 22. - ISBN 978-3-031-70377-5. - pp. 303-318 (( convegno European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases, ECML PKDD : September, 9 - 13 tenutosi a Vilnius nel 2024 [10.1007/978-3-031-70378-2_19].
Graph Machine Learning for Fast Product Development from Formulation Trials
M. Dileo
Primo
;M. ZignaniUltimo
2024
Abstract
Product development is the process of creating and bringing a new or improved product to market. Formulation trials constitute a crucial stage in product development, often involving the exploration of numerous variables and product properties. Traditional methods of formulation trials involve time-consuming experimentation, trial and error, and iterative processes. In recent years, machine learning (ML) has emerged as a promising avenue to streamline this complex journey by enhancing efficiency, innovation, and customization. One of the paramount challenges in ML for product development is the models’ lack of interpretability and explainability. This challenge poses significant limitations in gaining user trust, meeting regulatory requirements, and understanding the rationale behind ML-driven decisions. Moreover, formulation trials involve the exploration of relationships and similarities among previous preparations; however, data related to formulation are typically stored in tables and not in a network-like manner. To cope with the above challenges, we propose a general methodology for fast product development leveraging graph ML models, explainability techniques, and powerful data visualization tools. Starting from tabular formulation trials, our model simultaneously learns a latent graph between items and a downstream task, i.e. predicting consumer-appealing properties of a formulation. Subsequently, explainability techniques based on graphs, perturbation, and sensitivity analysis effectively support the R&D department in identifying new recipes for reaching a desired property. We evaluate our model on two datasets derived from a case study based on food design plus a standard benchmark from the healthcare domain. Results show the effectiveness of our model in predicting the outcome of new formulations. Thanks to our solution, the company has drastically reduced the labor-intensive experiments in real laboratories and the waste of materials.| File | Dimensione | Formato | |
|---|---|---|---|
|
978-3-031-70378-2_19.pdf
accesso riservato
Tipologia:
Publisher's version/PDF
Dimensione
815.57 kB
Formato
Adobe PDF
|
815.57 kB | Adobe PDF | Visualizza/Apri Richiedi una copia |
|
2024_IntellicoAdsTrack_ECML-12.pdf
Open Access dal 24/08/2025
Tipologia:
Post-print, accepted manuscript ecc. (versione accettata dall'editore)
Dimensione
916.83 kB
Formato
Adobe PDF
|
916.83 kB | Adobe PDF | Visualizza/Apri |
Pubblicazioni consigliate
I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.




