IRIS Institutional Research Information System - AIR Archivio Istituzionale della Ricerca

In this paper, we study human–AI collaboration protocols, a design-oriented construct aimed at establishing and evaluating how humans and AI can collaborate in cognitive tasks. We applied this construct in two user studies involving 12 specialist radiologists (the knee MRI study) and 44 ECG readers of varying expertise (the ECG study), who evaluated 240 and 20 cases, respectively, in different collaboration configurations. We confirm the utility of AI support but find that XAI can be associated with a “white-box paradox”, producing a null or detrimental effect. We also find that the order of presentation matters: AI-first protocols are associated with higher diagnostic accuracy than human-first protocols, and with higher accuracy than both humans and AI alone. Our findings identify the best conditions for AI to augment human diagnostic skills, rather than trigger dysfunctional responses and cognitive biases that can undermine decision effectiveness.

Rams, hounds and white boxes: Investigating human–AI collaboration protocols in medical diagnosis / F. Cabitza, A. Campagner, L. Ronzio, M. Cameli, G.E. Mandoli, M.C. Pastore, L.M. Sconfienza, D. Folgado, M. Barandas, H. Gamboa. - In: ARTIFICIAL INTELLIGENCE IN MEDICINE. - ISSN 0933-3657. - 138:(2023 Apr), pp. 102506.1-102506.13. [10.1016/j.artmed.2023.102506]

Rams, hounds and white boxes: Investigating human–AI collaboration protocols in medical diagnosis

Cabitza, Federico;Campagner, Andrea;Ronzio, Luca;Cameli, Matteo;Mandoli, Giulia Elena;Pastore, Maria Concetta;L.M. Sconfienza;Folgado, Duarte;Barandas, Marília;Gamboa, Hugo

2023

Abstract

In this paper, we study human–AI collaboration protocols, a design-oriented construct aimed at establishing and evaluating how humans and AI can collaborate in cognitive tasks. We applied this construct in two user studies involving 12 specialist radiologists (the knee MRI study) and 44 ECG readers of varying expertise (the ECG study), who evaluated 240 and 20 cases, respectively, in different collaboration configurations. We confirm the utility of AI support but find that XAI can be associated with a “white-box paradox”, producing a null or detrimental effect. We also find that the order of presentation matters: AI-first protocols are associated with higher diagnostic accuracy than human-first protocols, and with higher accuracy than both humans and AI alone. Our findings identify the best conditions for AI to augment human diagnostic skills, rather than trigger dysfunctional responses and cognitive biases that can undermine decision effectiveness.

Scheda breve

Scheda completa

Scheda completa (DC)

	Parole chiave
	
				Artificial intelligence; Automation bias; Cognitive biases; Explainable AI; Human–AI collaboration protocols;
			
	Settori scientifico-disciplinari dell'articolo (sola visualizzazione)
	
				Settore MED/36 - Diagnostica per Immagini e Radioterapia
Settore INF/01 - Informatica
			
	Data di pubblicazione
	
				apr-2023
			
	Rivista in ANCE
	
				ARTIFICIAL INTELLIGENCE IN MEDICINE
			
	DOI
	
				https://dx.doi.org/10.1016/j.artmed.2023.102506
			
	Tipologia
	
				Article (author)
			
	Appare nelle tipologie:
	
				01 - Articolo su periodico

File in questo prodotto:

File	Dimensione	Formato
1-s2.0-S0933365723000209-main.pdf accesso riservato Tipologia: Publisher's version/PDF Dimensione 2.63 MB Formato Adobe PDF Visualizza/Apri Richiedi una copia	2.63 MB	Adobe PDF	Visualizza/Apri Richiedi una copia

Pubblicazioni consigliate

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/2434/954451

Citazioni

1

37

29

social impact