Purpose: To perform a pilot study aiming at evaluating whether machine learning could be a useful model to evaluate semen analysis, improving the diagnostic work-up of male partner of infertile couples. Materials and Methods: A retrospective observational study was conducted using real-world data on male evaluated in routine andrological clinical practice at two Italian tertiary centers. The study utilized two distinct datasets: the first (UNIROMA) encompassed three distinct variables, including semen analysis, sex hormones, and testicular ultrasound parameters. The second dataset (UNIMORE) was constructed incorporating semen analysis, sex hormones, biochemical examinations, and parameters related to environmental pollution. The XGBoost analysis, as part of machine learning techniques, was applied separately to each dataset, as the two datasets did not share a significant overlap in terms of variables. Results: The UNIROMA dataset comprised 2,334 male subjects. The XGBoost analysis exhibited the highest accuracy (area under the curve [AUC], 0.987) in predicting patients with azoospermia compared to other categories. Remarkably, our analysis revealed that among the most influential predictive variables, follicle-stimulating hormone serum levels (F-score=492.0), inhibin B serum levels (F-score=261), and bitesticular volume (F-score=253.0) stood out. The UNIMORE dataset consisted of 11,981 records. The XGBoost analysis demonstrated a good predictive accuracy (AUC, 0.668), especially for identifying the azoospermia group. Notably, the most crucial predictive variables were environmental pollution parameters (PM10, F-score=361; NO2, F-score=299) and biochemical data (white blood cells, F-score=326; red blood cells, F-score=299). Conclusions: This pilot study applies machine learning to two extensive datasets, suggesting that changes in semen analysis may be linked to other variables, such as testicular ultrasound characteristics, red blood cell count, and environmental pollution.
Machine Learning Evaluation of Semen Analysis Could Reveal New Infertility-Related Markers: A Pilot Study / D. Santi, C. Pozza, G. Spaggiari, D. Gianfrilli, E. Sbardella, D. Paoli, L. Roli, M.C. De Santis, M. Bonomi, T. Trenti, A.M. Isidori, M. Simoni. - In: THE WORLD JOURNAL OF MEN'S HEALTH. - ISSN 2287-4208. - 43:(2025), pp. e82.1-e82.17. [10.5534/wjmh.250096]
Machine Learning Evaluation of Semen Analysis Could Reveal New Infertility-Related Markers: A Pilot Study
M. Bonomi;
2025
Abstract
Purpose: To perform a pilot study aiming at evaluating whether machine learning could be a useful model to evaluate semen analysis, improving the diagnostic work-up of male partner of infertile couples. Materials and Methods: A retrospective observational study was conducted using real-world data on male evaluated in routine andrological clinical practice at two Italian tertiary centers. The study utilized two distinct datasets: the first (UNIROMA) encompassed three distinct variables, including semen analysis, sex hormones, and testicular ultrasound parameters. The second dataset (UNIMORE) was constructed incorporating semen analysis, sex hormones, biochemical examinations, and parameters related to environmental pollution. The XGBoost analysis, as part of machine learning techniques, was applied separately to each dataset, as the two datasets did not share a significant overlap in terms of variables. Results: The UNIROMA dataset comprised 2,334 male subjects. The XGBoost analysis exhibited the highest accuracy (area under the curve [AUC], 0.987) in predicting patients with azoospermia compared to other categories. Remarkably, our analysis revealed that among the most influential predictive variables, follicle-stimulating hormone serum levels (F-score=492.0), inhibin B serum levels (F-score=261), and bitesticular volume (F-score=253.0) stood out. The UNIMORE dataset consisted of 11,981 records. The XGBoost analysis demonstrated a good predictive accuracy (AUC, 0.668), especially for identifying the azoospermia group. Notably, the most crucial predictive variables were environmental pollution parameters (PM10, F-score=361; NO2, F-score=299) and biochemical data (white blood cells, F-score=326; red blood cells, F-score=299). Conclusions: This pilot study applies machine learning to two extensive datasets, suggesting that changes in semen analysis may be linked to other variables, such as testicular ultrasound characteristics, red blood cell count, and environmental pollution.| File | Dimensione | Formato | |
|---|---|---|---|
|
Santi_D_WJMH_2025.pdf
accesso aperto
Tipologia:
Publisher's version/PDF
Licenza:
Creative commons
Dimensione
5.08 MB
Formato
Adobe PDF
|
5.08 MB | Adobe PDF | Visualizza/Apri |
Pubblicazioni consigliate
I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.




