The significant impact of road traffic accidents on public health requires clear and effec- tive policies to combat them. However, public action can only be truly effective when supported by robust monitoring tools. This project aims to evaluate the effectiveness of a set of machine learning algorithms in predicting road accidents in Switzerland, utiliz- ing open-access Confederation drive crash databases combined with environmental and socio-economic factors. Three different algorithms are tested: Logistic Regression Model (LRM), Random Forest with Ranger (RF), and Artificial Neural Network (ANN) with Keras. Among the predictive factors, road types are shown to be of high importance in all models. Regarding model performance, all the applied algorithms show a high level of accuracy, with all models achieving over 90%. The Random Forest algorithm, optimised using the Ranger application, exhibited the best performance, particularly in terms of specificity (0.88 compared to 0.34 and 0.40 for LRM and Keras, respectively) and negative predictive value (0.96 compared to 0.65 for LRM and 0.68 for Keras). These results suggest that this approach could support public policy for traffic management, if data collection and sharing activities are constantly carried out.

Assessing the Impact of Infrastructure and Social Environment Predictors on Road Accidents in Switzerland Using Machine Learning Algorithms and Open Large-Scale Dataset / A. Auzzas, F. Capra Gian, A. Ganga. - In: URBAN SCIENCE. - ISSN 2413-8851. - 9:(2025 Aug 29), pp. 343.1-343.20. [10.3390/urbansci9090343]

Assessing the Impact of Infrastructure and Social Environment Predictors on Road Accidents in Switzerland Using Machine Learning Algorithms and Open Large-Scale Dataset

A. Ganga
Ultimo
2025

Abstract

The significant impact of road traffic accidents on public health requires clear and effec- tive policies to combat them. However, public action can only be truly effective when supported by robust monitoring tools. This project aims to evaluate the effectiveness of a set of machine learning algorithms in predicting road accidents in Switzerland, utiliz- ing open-access Confederation drive crash databases combined with environmental and socio-economic factors. Three different algorithms are tested: Logistic Regression Model (LRM), Random Forest with Ranger (RF), and Artificial Neural Network (ANN) with Keras. Among the predictive factors, road types are shown to be of high importance in all models. Regarding model performance, all the applied algorithms show a high level of accuracy, with all models achieving over 90%. The Random Forest algorithm, optimised using the Ranger application, exhibited the best performance, particularly in terms of specificity (0.88 compared to 0.34 and 0.40 for LRM and Keras, respectively) and negative predictive value (0.96 compared to 0.65 for LRM and 0.68 for Keras). These results suggest that this approach could support public policy for traffic management, if data collection and sharing activities are constantly carried out.
drive crashes; logistic regression; Keras; Ranger; health policies
Settore CEAR-04/A - Geomatica
29-ago-2025
Article (author)
File in questo prodotto:
File Dimensione Formato  
urbansci-09-00343-v4.pdf

accesso aperto

Tipologia: Publisher's version/PDF
Licenza: Creative commons
Dimensione 5.14 MB
Formato Adobe PDF
5.14 MB Adobe PDF Visualizza/Apri
Pubblicazioni consigliate

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/2434/1232105
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 0
  • ???jsp.display-item.citation.isi??? 0
  • OpenAlex ND
social impact