Generation and validation of a classification model to diagnose familial hypercholesterolaemia in adults

Albuquerque, J.; Medeiros, A.M.; Alves, A.C.; Jannes, C.E.; Mancina, R.M.; Pavanello, C.; Chora, J.R.; Mombelli, G.; Calabresi, L.; Pereira, A.D.C.; Krieger, J.E.; Romeo, S.; Bourbon, M.; Antunes, M.

doi:10.1016/j.atherosclerosis.2023.117314

Background and aims: The early diagnosis of familial hypercholesterolaemia is associated with a significant reduction in cardiovascular disease (CVD) risk. While the recent use of statistical and machine learning algorithms has shown promising results in comparison with traditional clinical criteria, when applied to screening of potential FH cases in large cohorts, most studies in this field are developed using a single cohort of patients, which may hamper the application of such algorithms to other populations. In the current study, a logistic regression (LR) based algorithm was developed combining observations from three different national FH cohorts, from Portugal, Brazil and Sweden. Independent samples from these cohorts were then used to test the model, as well as an external dataset from Italy. Methods: The area under the receiver operating characteristics (AUROC) and precision-recall (AUPRC) curves was used to assess the discriminatory ability among the different samples. Comparisons between the LR model and Dutch Lipid Clinic Network (DLCN) clinical criteria were performed by means of McNemar tests, and by the calculation of several operating characteristics. Results: AUROC and AUPRC values were generally higher for all testing sets when compared to the training set. Compared with DLCN criteria, a significantly higher number of correctly classified observations were identified for the Brazilian (p < 0.01), Swedish (p < 0.01), and Italian testing sets (p < 0.01). Higher accuracy (Acc), G mean and F1 score values were also observed for all testing sets. Conclusions: Compared to DLCN criteria, the LR model revealed improved ability to correctly classify observations, and was able to retain a similar number of FH cases, with less false positive retention. Generalization of the LR model was very good across all testing samples, suggesting it can be an effective screening tool if applied to different populations.

Generation and validation of a classification model to diagnose familial hypercholesterolaemia in adults / J. Albuquerque, A.M. Medeiros, A.C. Alves, C.E. Jannes, R.M. Mancina, C. Pavanello, J.R. Chora, G. Mombelli, L. Calabresi, A.D.C. Pereira, J.E. Krieger, S. Romeo, M. Bourbon, M. Antunes. - In: ATHEROSCLEROSIS. - ISSN 1879-1484. - 383:(2023 Oct), pp. 117314.1-117314.8. [10.1016/j.atherosclerosis.2023.117314]

Generation and validation of a classification model to diagnose familial hypercholesterolaemia in adults

Albuquerque, João;Medeiros, Ana Margarida;Alves, Ana Catarina;Jannes, Cinthia Elim;Mancina, Rosellina M;C. Pavanello;Chora, Joana Rita;Mombelli, Giuliana;L. Calabresi;Pereira, Alexandre da Costa;Krieger, José Eduardo;Romeo, Stefano;Bourbon, Mafalda;Antunes, Marília

2023

Abstract

Background and aims: The early diagnosis of familial hypercholesterolaemia is associated with a significant reduction in cardiovascular disease (CVD) risk. While the recent use of statistical and machine learning algorithms has shown promising results in comparison with traditional clinical criteria, when applied to screening of potential FH cases in large cohorts, most studies in this field are developed using a single cohort of patients, which may hamper the application of such algorithms to other populations. In the current study, a logistic regression (LR) based algorithm was developed combining observations from three different national FH cohorts, from Portugal, Brazil and Sweden. Independent samples from these cohorts were then used to test the model, as well as an external dataset from Italy. Methods: The area under the receiver operating characteristics (AUROC) and precision-recall (AUPRC) curves was used to assess the discriminatory ability among the different samples. Comparisons between the LR model and Dutch Lipid Clinic Network (DLCN) clinical criteria were performed by means of McNemar tests, and by the calculation of several operating characteristics. Results: AUROC and AUPRC values were generally higher for all testing sets when compared to the training set. Compared with DLCN criteria, a significantly higher number of correctly classified observations were identified for the Brazilian (p < 0.01), Swedish (p < 0.01), and Italian testing sets (p < 0.01). Higher accuracy (Acc), G mean and F1 score values were also observed for all testing sets. Conclusions: Compared to DLCN criteria, the LR model revealed improved ability to correctly classify observations, and was able to retain a similar number of FH cases, with less false positive retention. Generalization of the LR model was very good across all testing samples, suggesting it can be an effective screening tool if applied to different populations.

Scheda breve

Scheda completa

Scheda completa (DC)

	Parole chiave
	
				Dutch lipid clinic network criteria; Familial hypercholesterolaemia; Logistic regression; Validation;
			
	Settori scientifico-disciplinari dell'articolo (sola visualizzazione)
	
				Settore BIO/14 - Farmacologia
Settore MED/03 - Genetica Medica
			
	Data di pubblicazione
	
				ott-2023
			
	Data ahead of print o data di stampa
	
				28-set-2023
			
	Rivista in ANCE
	
				ATHEROSCLEROSIS
			
	DOI
	
				https://dx.doi.org/10.1016/j.atherosclerosis.2023.117314
			
	Tipologia
	
				Article (author)
			
	Appare nelle tipologie:
	
				01 - Articolo su periodico

File in questo prodotto:

File	Dimensione	Formato
1-s2.0-S0021915023052358-main.pdf accesso riservato Tipologia: Publisher's version/PDF Dimensione 1.73 MB Formato Adobe PDF Visualizza/Apri Richiedi una copia	1.73 MB	Adobe PDF	Visualizza/Apri Richiedi una copia

Pubblicazioni consigliate

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/2434/1008033

Citazioni

1

4

2

ND

IRIS Institutional Research Information System - AIR Archivio Istituzionale della Ricerca