Deep Learning (DL) models for automatic ECG interpretation became widely investigated in recent years. However, their performance varies highly across models and datasets. One of the main reasons is the possibility that the DL model might learn spurious correlations present in a dataset between inputs and outcomes. In this study, we proposed a novel training strategy potentially able to force the domain knowledge into a DL model, by complementing, only during training, an end-to-end approach with features known to be relevant for the outcome. We tested the approach for the creation of a DL model tuned to identify myocardial infarction (MI)from the standard 12-lead electrocardiograms (ECGs). Two models were trained: one with standard backpropagation (full model) and the second one (split model) with the proposed approach, on the PTB Diagnostic ECG Database. An explainable AI technique was then used to identify which ECG leads were considered relevant by both models for each MI site, and were compared with guidelines for MI site identification. The validation accuracy was 0.85 and 0.69 for full and split models, respectively. Despite the lower performance achieved with the proposed approach, the number of relevant leads was higher (10 vs 4), suggesting that the domain knowledge was likely percolated into the DL model, made it more robust and capable of better generalization on other dataset.
Injecting Domain Knowledge in Deep Learning Models for Automatic Identification of Myocardial Infarction from Electrocardiograms / S. Ibrahimi, M.W. Rivolta, R. Sassi - In: 2023 Computing in Cardiology (CinC)[s.l] : IEEE, 2023. - ISBN 979-8-3503-8252-5. - pp. 1-4 (( Intervento presentato al 50. convegno Computing in Cardiology tenutosi a Atlanta nel 2023 [10.22489/CinC.2023.388].
Injecting Domain Knowledge in Deep Learning Models for Automatic Identification of Myocardial Infarction from Electrocardiograms
S. Ibrahimi
;M.W. Rivolta;R. Sassi
2023
Abstract
Deep Learning (DL) models for automatic ECG interpretation became widely investigated in recent years. However, their performance varies highly across models and datasets. One of the main reasons is the possibility that the DL model might learn spurious correlations present in a dataset between inputs and outcomes. In this study, we proposed a novel training strategy potentially able to force the domain knowledge into a DL model, by complementing, only during training, an end-to-end approach with features known to be relevant for the outcome. We tested the approach for the creation of a DL model tuned to identify myocardial infarction (MI)from the standard 12-lead electrocardiograms (ECGs). Two models were trained: one with standard backpropagation (full model) and the second one (split model) with the proposed approach, on the PTB Diagnostic ECG Database. An explainable AI technique was then used to identify which ECG leads were considered relevant by both models for each MI site, and were compared with guidelines for MI site identification. The validation accuracy was 0.85 and 0.69 for full and split models, respectively. Despite the lower performance achieved with the proposed approach, the number of relevant leads was higher (10 vs 4), suggesting that the domain knowledge was likely percolated into the DL model, made it more robust and capable of better generalization on other dataset.File | Dimensione | Formato | |
---|---|---|---|
cinc2023_SI_MR_RS.pdf
accesso riservato
Tipologia:
Publisher's version/PDF
Dimensione
227.82 kB
Formato
Adobe PDF
|
227.82 kB | Adobe PDF | Visualizza/Apri Richiedi una copia |
Pubblicazioni consigliate
I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.