Examining the Performance of ChatGPT 3.5 and Microsoft Copilot in Otolaryngology: A Comparative Study with Otolaryngologists’ Evaluation

Mayo-Yáñez, M.; Lechien, J.R.; Maria-Saibene, A.; Vaira, L.A.; Maniaci, A.; Chiesa-Estomba, C.M.

doi:10.1007/s12070-024-04729-1

To evaluate the response capabilities, in a public healthcare system otolaryngology job competition examination, of ChatGPT 3.5 and an internet-connected GPT-4 engine (Microsoft Copilot) with the real scores of otolaryngology specialists as the control group. In September 2023, 135 questions divided into theoretical and practical parts were input into ChatGPT 3.5 and an internet-connected GPT-4. The accuracy of AI responses was compared with the official results from otolaryngologists who took the exam, and statistical analysis was conducted using Stata 14.2. Copilot (GPT-4) outperformed ChatGPT 3.5. Copilot achieved a score of 88.5 points, while ChatGPT scored 60 points. Both AIs had discrepancies in their incorrect answers. Despite ChatGPT’s proficiency, Copilot displayed superior performance, ranking as the second-best score among the 108 otolaryngologists who took the exam, while ChatGPT was placed 83rd. A chat powered by GPT-4 with internet access (Copilot) demonstrates superior performance in responding to multiple-choice medical questions compared to ChatGPT 3.5.

Examining the Performance of ChatGPT 3.5 and Microsoft Copilot in Otolaryngology: A Comparative Study with Otolaryngologists’ Evaluation / M. Mayo-Yáñez, J.R. Lechien, A. Maria-Saibene, L.A. Vaira, A. Maniaci, C.M. Chiesa-Estomba. - In: INDIAN JOURNAL OF OTOLARYNGOLOGY AND HEAD AND NECK SURGERY. - ISSN 2231-3796. - (2024), pp. 1-5. [Epub ahead of print] [10.1007/s12070-024-04729-1]

Examining the Performance of ChatGPT 3.5 and Microsoft Copilot in Otolaryngology: A Comparative Study with Otolaryngologists’ Evaluation

Mayo-Yáñez, Miguel;Lechien, Jerome R.;A. Maria-Saibene;Vaira, Luigi A.;Maniaci, Antonino;Chiesa-Estomba, Carlos M.

2024

Abstract

To evaluate the response capabilities, in a public healthcare system otolaryngology job competition examination, of ChatGPT 3.5 and an internet-connected GPT-4 engine (Microsoft Copilot) with the real scores of otolaryngology specialists as the control group. In September 2023, 135 questions divided into theoretical and practical parts were input into ChatGPT 3.5 and an internet-connected GPT-4. The accuracy of AI responses was compared with the official results from otolaryngologists who took the exam, and statistical analysis was conducted using Stata 14.2. Copilot (GPT-4) outperformed ChatGPT 3.5. Copilot achieved a score of 88.5 points, while ChatGPT scored 60 points. Both AIs had discrepancies in their incorrect answers. Despite ChatGPT’s proficiency, Copilot displayed superior performance, ranking as the second-best score among the 108 otolaryngologists who took the exam, while ChatGPT was placed 83rd. A chat powered by GPT-4 with internet access (Copilot) demonstrates superior performance in responding to multiple-choice medical questions compared to ChatGPT 3.5.

Scheda breve

Scheda completa

Scheda completa (DC)

	Parole chiave
	
				Medicine; Surgery; Otolaryngology; Head neck; ChatGPT; Chatbot; Artificial; GPT; Instrument; Internet; Tool; Intelligence; Performance; Comparison; Diagnosis; Treatment
			
	Settori scientifico-disciplinari dell'articolo (sola visualizzazione)
	
				Settore MED/31 - Otorinolaringoiatria
			
	Data di pubblicazione
	
				2024
			
	Data ahead of print o data di stampa
	
				1-mag-2024
			
	Rivista in ANCE
	
				INDIAN JOURNAL OF OTOLARYNGOLOGY AND HEAD AND NECK SURGERY
			
	DOI
	
				https://dx.doi.org/10.1007/s12070-024-04729-1
			
	Tipologia
	
				Article (author)
			
	Appare nelle tipologie:
	
				01 - Articolo su periodico

File in questo prodotto:

File	Dimensione	Formato
chat gpt vs copilot (2024).pdf accesso riservato Descrizione: Original Article Tipologia: Publisher's version/PDF Dimensione 531.26 kB Formato Adobe PDF Visualizza/Apri Richiedi una copia	531.26 kB	Adobe PDF	Visualizza/Apri Richiedi una copia

Pubblicazioni consigliate

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/2434/1048608

Citazioni

ND

11

ND

10

IRIS Institutional Research Information System - AIR Archivio Istituzionale della Ricerca