IRIS Institutional Research Information System - AIR Archivio Istituzionale della Ricerca

Motivation: Biomedical Entity Linking (BEL) maps mentions in biomedical text to standardized identifiers, enabling structured data integration and downstream knowledge discovery. However, current BEL systems remain fundamentally constrained by the recall of the initial candidate pool, where suboptimal retrieval limits the overall effectiveness of the normalization pipeline. Results: We present the first systematic evaluation of Generative Relevance Feedback (GRF) for enhancing candidate retrieval in state-of-the-art BEL systems. GRF leverages large language models (LLMs) to enrich the expressiveness of the mention in a zero-shot fashion. We assess GRF’s impact under two scenarios—direct linking prediction and candidate generation in cascading normalization pipelines—and analyze its sensitivity to different LLMs, feedback types, and integration strategies. Experiments across eight corpora and four biomedical knowledge bases demonstrate that integrating GRF significantly improves both accuracy and recall, thereby increasing the upper bound on normalization performance. Our findings highlight GRF as an efficient, model-agnostic solution and underscore its potential as a key component for advancing BEL.

Improving biomedical entity linking with generative relevance feedback / D. Shlyk, L. Hunter. - In: BIOINFORMATICS. - ISSN 1367-4803. - 42:2(2026 Feb), pp. btag011.1-btag011.11. [10.1093/bioinformatics/btag011]

Improving biomedical entity linking with generative relevance feedback

D. Shlyk^Primo;Hunter, Lawrence^Secondo

2026

Abstract

Motivation: Biomedical Entity Linking (BEL) maps mentions in biomedical text to standardized identifiers, enabling structured data integration and downstream knowledge discovery. However, current BEL systems remain fundamentally constrained by the recall of the initial candidate pool, where suboptimal retrieval limits the overall effectiveness of the normalization pipeline. Results: We present the first systematic evaluation of Generative Relevance Feedback (GRF) for enhancing candidate retrieval in state-of-the-art BEL systems. GRF leverages large language models (LLMs) to enrich the expressiveness of the mention in a zero-shot fashion. We assess GRF’s impact under two scenarios—direct linking prediction and candidate generation in cascading normalization pipelines—and analyze its sensitivity to different LLMs, feedback types, and integration strategies. Experiments across eight corpora and four biomedical knowledge bases demonstrate that integrating GRF significantly improves both accuracy and recall, thereby increasing the upper bound on normalization performance. Our findings highlight GRF as an efficient, model-agnostic solution and underscore its potential as a key component for advancing BEL.

Scheda breve

Scheda completa

Scheda completa (DC)

	Parole chiave
	
				Biomedical Entity Linking, Concept Normalization, Large Language Models, Information Retrieval;
			
	Settori scientifico-disciplinari dell'articolo (validi dal 09/05/2024)
	
				Settore INFO-01/A - Informatica
			
	Data di pubblicazione
	
				feb-2026
			
	Data ahead of print o data di stampa
	
				14-gen-2026
			
	Rivista in ANCE
	
				BIOINFORMATICS
			
	DOI
	
				https://dx.doi.org/10.1093/bioinformatics/btag011
			
	Tipologia
	
				Article (author)
			
	Appare nelle tipologie:
	
				01 - Articolo su periodico

File in questo prodotto:

File	Dimensione	Formato
3652862.pdf accesso aperto Tipologia: Publisher's version/PDF Licenza: Creative commons Dimensione 1.97 MB Formato Adobe PDF Visualizza/Apri	1.97 MB	Adobe PDF	Visualizza/Apri

Pubblicazioni consigliate

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/2434/1234155

Citazioni

1

0

0

0

social impact