Biomedical knowledge graphs (BioKGs) are widely applied in the biomedical field to represent biological entities and their relationships. Through their simple data model, they facilitate the integration of heterogeneous information and the development of downstream ML applications (e.g., node classification, link predictions, knowledge extraction). However, their construction by integrating structured data (e.g., relational data, json, csv) or extracting facts from scientific literature (e.g., PubMed articles, digital health records) requires many efforts, and intelligent tools supporting the user in this activity are still lacking. In this paper, we describe the activities that our group is carrying out at the University of Milan to support the construction of BioKGs by applying prompt engineering approaches in combination with general-purpose Large Language Models (LLMs) with the aim of reducing the generation of incorrect facts due to hallucinations that are not acceptable in sensitive areas like precision medicine.

Prompt Engineering Approaches for Working with Biomedical Knowledge Graphs through LLMs / M. Mesiti, E. Cavalleri, M. Castagna, P. Perlasca, D. Shlyk (CEUR WORKSHOP PROCEEDINGS). - In: Ital-IA-TW 2025 : Thematic Workshops at Ital-IA 2025 / [a cura di] L. Manzoni, L. Bortolussi, G. Cisotto, F. Anselmi. - [s.l] : CEUR-WS, 2025. - pp. 1-7 (( 5. National Conference on Artificial Intelligence, organized by CINI (Ital-IA 2025) Trieste 2025.

Prompt Engineering Approaches for Working with Biomedical Knowledge Graphs through LLMs

M. Mesiti
;
E. Cavalleri;M. Castagna;P. Perlasca;D. Shlyk
2025

Abstract

Biomedical knowledge graphs (BioKGs) are widely applied in the biomedical field to represent biological entities and their relationships. Through their simple data model, they facilitate the integration of heterogeneous information and the development of downstream ML applications (e.g., node classification, link predictions, knowledge extraction). However, their construction by integrating structured data (e.g., relational data, json, csv) or extracting facts from scientific literature (e.g., PubMed articles, digital health records) requires many efforts, and intelligent tools supporting the user in this activity are still lacking. In this paper, we describe the activities that our group is carrying out at the University of Milan to support the construction of BioKGs by applying prompt engineering approaches in combination with general-purpose Large Language Models (LLMs) with the aim of reducing the generation of incorrect facts due to hallucinations that are not acceptable in sensitive areas like precision medicine.
Biomedical Knowledge Graphs; Large Language Models; Knowledge Extraction; Prompt Engineering
Settore INFO-01/A - Informatica
2025
https://ceur-ws.org/Vol-4121/Ital-IA_2025_paper_104.pdf
Book Part (author)
File in questo prodotto:
File Dimensione Formato  
Ital-IA_2025_paper_104.pdf

accesso aperto

Tipologia: Publisher's version/PDF
Licenza: Creative commons
Dimensione 1.19 MB
Formato Adobe PDF
1.19 MB Adobe PDF Visualizza/Apri
Pubblicazioni consigliate

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/2434/1204538
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus ND
  • ???jsp.display-item.citation.isi??? ND
  • OpenAlex ND
social impact