Purpose: To perform a pilot study aiming at evaluating whether machine learning could be a useful model to evaluate semen analysis, improving the diagnostic work-up of male partner of infertile couples. Materials and Methods: A retrospective observational study was conducted using real-world data on male evaluated in routine andrological clinical practice at two Italian tertiary centers. The study utilized two distinct datasets: the first (UNIROMA) encompassed three distinct variables, including semen analysis, sex hormones, and testicular ultrasound parameters. The second dataset (UNIMORE) was constructed incorporating semen analysis, sex hormones, biochemical examinations, and parameters related to environmental pollution. The XGBoost analysis, as part of machine learning techniques, was applied separately to each dataset, as the two datasets did not share a significant overlap in terms of variables. Results: The UNIROMA dataset comprised 2,334 male subjects. The XGBoost analysis exhibited the highest accuracy (area under the curve [AUC], 0.987) in predicting patients with azoospermia compared to other categories. Remarkably, our analysis revealed that among the most influential predictive variables, follicle-stimulating hormone serum levels (F-score=492.0), inhibin B serum levels (F-score=261), and bitesticular volume (F-score=253.0) stood out. The UNIMORE dataset consisted of 11,981 records. The XGBoost analysis demonstrated a good predictive accuracy (AUC, 0.668), especially for identifying the azoospermia group. Notably, the most crucial predictive variables were environmental pollution parameters (PM10, F-score=361; NO2, F-score=299) and biochemical data (white blood cells, F-score=326; red blood cells, F-score=299). Conclusions: This pilot study applies machine learning to two extensive datasets, suggesting that changes in semen analysis may be linked to other variables, such as testicular ultrasound characteristics, red blood cell count, and environmental pollution.

Machine Learning Evaluation of Semen Analysis Could Reveal New Infertility-Related Markers: A Pilot Study / D. Santi, C. Pozza, G. Spaggiari, D. Gianfrilli, E. Sbardella, D. Paoli, L. Roli, M.C. De Santis, M. Bonomi, T. Trenti, A.M. Isidori, M. Simoni. - In: THE WORLD JOURNAL OF MEN'S HEALTH. - ISSN 2287-4208. - 43:(2025), pp. e82.1-e82.17. [10.5534/wjmh.250096]

Machine Learning Evaluation of Semen Analysis Could Reveal New Infertility-Related Markers: A Pilot Study

M. Bonomi;
2025

Abstract

Purpose: To perform a pilot study aiming at evaluating whether machine learning could be a useful model to evaluate semen analysis, improving the diagnostic work-up of male partner of infertile couples. Materials and Methods: A retrospective observational study was conducted using real-world data on male evaluated in routine andrological clinical practice at two Italian tertiary centers. The study utilized two distinct datasets: the first (UNIROMA) encompassed three distinct variables, including semen analysis, sex hormones, and testicular ultrasound parameters. The second dataset (UNIMORE) was constructed incorporating semen analysis, sex hormones, biochemical examinations, and parameters related to environmental pollution. The XGBoost analysis, as part of machine learning techniques, was applied separately to each dataset, as the two datasets did not share a significant overlap in terms of variables. Results: The UNIROMA dataset comprised 2,334 male subjects. The XGBoost analysis exhibited the highest accuracy (area under the curve [AUC], 0.987) in predicting patients with azoospermia compared to other categories. Remarkably, our analysis revealed that among the most influential predictive variables, follicle-stimulating hormone serum levels (F-score=492.0), inhibin B serum levels (F-score=261), and bitesticular volume (F-score=253.0) stood out. The UNIMORE dataset consisted of 11,981 records. The XGBoost analysis demonstrated a good predictive accuracy (AUC, 0.668), especially for identifying the azoospermia group. Notably, the most crucial predictive variables were environmental pollution parameters (PM10, F-score=361; NO2, F-score=299) and biochemical data (white blood cells, F-score=326; red blood cells, F-score=299). Conclusions: This pilot study applies machine learning to two extensive datasets, suggesting that changes in semen analysis may be linked to other variables, such as testicular ultrasound characteristics, red blood cell count, and environmental pollution.
Infertility; male; Semen analysis; Testis; Ultrasonography;
Settore MEDS-08/A - Endocrinologia
2025
23-ott-2025
Article (author)
File in questo prodotto:
File Dimensione Formato  
Santi_D_WJMH_2025.pdf

accesso aperto

Tipologia: Publisher's version/PDF
Licenza: Creative commons
Dimensione 5.08 MB
Formato Adobe PDF
5.08 MB Adobe PDF Visualizza/Apri
Pubblicazioni consigliate

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/2434/1202835
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus ND
  • ???jsp.display-item.citation.isi??? 0
  • OpenAlex ND
social impact