Product development is the process of creating and bringing a new or improved product to market. Formulation trials constitute a crucial stage in product development, often involving the exploration of numerous variables and product properties. Traditional methods of formulation trials involve time-consuming experimentation, trial and error, and iterative processes. In recent years, machine learning (ML) has emerged as a promising avenue to streamline this complex journey by enhancing efficiency, innovation, and customization. One of the paramount challenges in ML for product development is the models’ lack of interpretability and explainability. This challenge poses significant limitations in gaining user trust, meeting regulatory requirements, and understanding the rationale behind ML-driven decisions. Moreover, formulation trials involve the exploration of relationships and similarities among previous preparations; however, data related to formulation are typically stored in tables and not in a network-like manner. To cope with the above challenges, we propose a general methodology for fast product development leveraging graph ML models, explainability techniques, and powerful data visualization tools. Starting from tabular formulation trials, our model simultaneously learns a latent graph between items and a downstream task, i.e. predicting consumer-appealing properties of a formulation. Subsequently, explainability techniques based on graphs, perturbation, and sensitivity analysis effectively support the R&D department in identifying new recipes for reaching a desired property. We evaluate our model on two datasets derived from a case study based on food design plus a standard benchmark from the healthcare domain. Results show the effectiveness of our model in predicting the outcome of new formulations. Thanks to our solution, the company has drastically reduced the labor-intensive experiments in real laboratories and the waste of materials.

Graph Machine Learning for Fast Product Development from Formulation Trials / M. Dileo, R. Olmeda, M. Pindaro, M. Zignani (LECTURE NOTES IN ARTIFICIAL INTELLIGENCE). - In: Machine Learning and Knowledge Discovery in Databases. Applied Data Science Track. ECML PKDD / [a cura di] A. Bifet, T. Krilavicius, I. Miliou, S. Nowaczyk. - [s.l] : Springer, 2024 Aug 22. - ISBN 978-3-031-70377-5. - pp. 303-318 (( convegno European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases, ECML PKDD : September, 9 - 13 tenutosi a Vilnius nel 2024 [10.1007/978-3-031-70378-2_19].

Graph Machine Learning for Fast Product Development from Formulation Trials

M. Dileo
Primo
;
M. Zignani
Ultimo
2024

Abstract

Product development is the process of creating and bringing a new or improved product to market. Formulation trials constitute a crucial stage in product development, often involving the exploration of numerous variables and product properties. Traditional methods of formulation trials involve time-consuming experimentation, trial and error, and iterative processes. In recent years, machine learning (ML) has emerged as a promising avenue to streamline this complex journey by enhancing efficiency, innovation, and customization. One of the paramount challenges in ML for product development is the models’ lack of interpretability and explainability. This challenge poses significant limitations in gaining user trust, meeting regulatory requirements, and understanding the rationale behind ML-driven decisions. Moreover, formulation trials involve the exploration of relationships and similarities among previous preparations; however, data related to formulation are typically stored in tables and not in a network-like manner. To cope with the above challenges, we propose a general methodology for fast product development leveraging graph ML models, explainability techniques, and powerful data visualization tools. Starting from tabular formulation trials, our model simultaneously learns a latent graph between items and a downstream task, i.e. predicting consumer-appealing properties of a formulation. Subsequently, explainability techniques based on graphs, perturbation, and sensitivity analysis effectively support the R&D department in identifying new recipes for reaching a desired property. We evaluate our model on two datasets derived from a case study based on food design plus a standard benchmark from the healthcare domain. Results show the effectiveness of our model in predicting the outcome of new formulations. Thanks to our solution, the company has drastically reduced the labor-intensive experiments in real laboratories and the waste of materials.
Product Development; Structure Learning; XAI for tabular data;
Settore INFO-01/A - Informatica
22-ago-2024
Artificial Intelligence Associaton Lithuania
Vilnianus Universitetas
Book Part (author)
File in questo prodotto:
File Dimensione Formato  
978-3-031-70378-2_19.pdf

accesso riservato

Tipologia: Publisher's version/PDF
Dimensione 815.57 kB
Formato Adobe PDF
815.57 kB Adobe PDF   Visualizza/Apri   Richiedi una copia
2024_IntellicoAdsTrack_ECML-12.pdf

Open Access dal 24/08/2025

Tipologia: Post-print, accepted manuscript ecc. (versione accettata dall'editore)
Dimensione 916.83 kB
Formato Adobe PDF
916.83 kB Adobe PDF Visualizza/Apri
Pubblicazioni consigliate

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/2434/1122020
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 1
  • ???jsp.display-item.citation.isi??? 1
  • OpenAlex ND
social impact