Sex estimation is an essential task in forensic anthropology. It is not only crucial for the identification of individuals from skeletal remains, but it is also essential for improving the reliability of other methods of biological profile estimation, such as age and stature, some of which perform better when sex is taken into account. This study investigates the application of machine learning (ML) techniques to sex estimation, with a particular focus on interpretability to address the "black box" challenge inherent in AI models. Using a diverse dataset of long bone measurements from 2,969 individuals, 12 different ML algorithms were evaluated. Missing data were handled using iterative regression imputation, though challenges arising from incomplete datasets underscored the need for improved data handling strategies. Linear Discriminant Analysis (LDA) emerged as the most accurate approach, achieving 95.2% accuracy. A key feature of this study is the integration of SHapley Additive exPlanations (SHAP) values, which provide individualized insights into the factors influencing each prediction. This interpretability framework ensures transparency and addresses legal and scientific concerns about the admissibility of AI-generated evidence in court. Indeed, misclassifications possibilities highlight the importance of clear, understandable models in forensic applications. The study emphasizes the significance of individualized prediction, illustrated by the probability of male or female classification for each individual, as well as the impact of missing values on prediction accuracy. This research demonstrates that ML models can effectively balance accuracy with interpretability, offering personalized, actionable insights for forensic investigations. It paves the way for AI-driven methods that meet both scientific rigor and legal standards, transforming sex estimation in forensic science by providing individualized, defensible evidence suitable for court.

Interpretable machine learning for individualized sex estimation from long bones / S. Knecht, P. Morandini, L. Biehler-Gomez, Y. Ardagna, M. Perrin, C. Cattaneo, C. Roman, P. Adalian. - In: INTERNATIONAL JOURNAL OF LEGAL MEDICINE. - ISSN 0937-9827. - (2025 Oct). [Epub ahead of print] [10.1007/s00414-025-03635-7]

Interpretable machine learning for individualized sex estimation from long bones

L. Biehler-Gomez;C. Cattaneo;
2025

Abstract

Sex estimation is an essential task in forensic anthropology. It is not only crucial for the identification of individuals from skeletal remains, but it is also essential for improving the reliability of other methods of biological profile estimation, such as age and stature, some of which perform better when sex is taken into account. This study investigates the application of machine learning (ML) techniques to sex estimation, with a particular focus on interpretability to address the "black box" challenge inherent in AI models. Using a diverse dataset of long bone measurements from 2,969 individuals, 12 different ML algorithms were evaluated. Missing data were handled using iterative regression imputation, though challenges arising from incomplete datasets underscored the need for improved data handling strategies. Linear Discriminant Analysis (LDA) emerged as the most accurate approach, achieving 95.2% accuracy. A key feature of this study is the integration of SHapley Additive exPlanations (SHAP) values, which provide individualized insights into the factors influencing each prediction. This interpretability framework ensures transparency and addresses legal and scientific concerns about the admissibility of AI-generated evidence in court. Indeed, misclassifications possibilities highlight the importance of clear, understandable models in forensic applications. The study emphasizes the significance of individualized prediction, illustrated by the probability of male or female classification for each individual, as well as the impact of missing values on prediction accuracy. This research demonstrates that ML models can effectively balance accuracy with interpretability, offering personalized, actionable insights for forensic investigations. It paves the way for AI-driven methods that meet both scientific rigor and legal standards, transforming sex estimation in forensic science by providing individualized, defensible evidence suitable for court.
Forensic anthropology; Long bones; Machine learning; Sex estimation; Sexual dimorphism
Settore BIOS-03/B - Antropologia
Settore MEDS-25/A - Medicina legale
ott-2025
Article (author)
File in questo prodotto:
File Dimensione Formato  
Interpretable_machine_learning_2025.pdf

accesso riservato

Tipologia: Publisher's version/PDF
Licenza: Nessuna licenza
Dimensione 1.71 MB
Formato Adobe PDF
1.71 MB Adobe PDF   Visualizza/Apri   Richiedi una copia
Pubblicazioni consigliate

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/2434/1190620
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus ND
  • ???jsp.display-item.citation.isi??? ND
  • OpenAlex ND
social impact