Biomedical knowledge graphs (BioKGs) are widely applied in the biomedical field to represent biological entities and their relationships. Through their simple data model, they facilitate the integration of heterogeneous information and the development of downstream ML applications (e.g., node classification, link predictions, knowledge extraction). However, their construction by integrating structured data (e.g., relational data, json, csv) or extracting facts from scientific literature (e.g., PubMed articles, digital health records) requires many efforts, and intelligent tools supporting the user in this activity are still lacking. In this paper, we describe the activities that our group is carrying out at the University of Milan to support the construction of BioKGs by applying prompt engineering approaches in combination with general-purpose Large Language Models (LLMs) with the aim of reducing the generation of incorrect facts due to hallucinations that are not acceptable in sensitive areas like precision medicine.
Prompt Engineering Approaches for Working with Biomedical Knowledge Graphs through LLMs / M. Mesiti, E. Cavalleri, M. Castagna, P. Perlasca, D. Shlyk (CEUR WORKSHOP PROCEEDINGS). - In: Ital-IA-TW 2025 : Thematic Workshops at Ital-IA 2025 / [a cura di] L. Manzoni, L. Bortolussi, G. Cisotto, F. Anselmi. - [s.l] : CEUR-WS, 2025. - pp. 1-7 (( 5. National Conference on Artificial Intelligence, organized by CINI (Ital-IA 2025) Trieste 2025.
Prompt Engineering Approaches for Working with Biomedical Knowledge Graphs through LLMs
M. Mesiti
;E. Cavalleri;M. Castagna;P. Perlasca;D. Shlyk
2025
Abstract
Biomedical knowledge graphs (BioKGs) are widely applied in the biomedical field to represent biological entities and their relationships. Through their simple data model, they facilitate the integration of heterogeneous information and the development of downstream ML applications (e.g., node classification, link predictions, knowledge extraction). However, their construction by integrating structured data (e.g., relational data, json, csv) or extracting facts from scientific literature (e.g., PubMed articles, digital health records) requires many efforts, and intelligent tools supporting the user in this activity are still lacking. In this paper, we describe the activities that our group is carrying out at the University of Milan to support the construction of BioKGs by applying prompt engineering approaches in combination with general-purpose Large Language Models (LLMs) with the aim of reducing the generation of incorrect facts due to hallucinations that are not acceptable in sensitive areas like precision medicine.| File | Dimensione | Formato | |
|---|---|---|---|
|
Ital-IA_2025_paper_104.pdf
accesso aperto
Tipologia:
Publisher's version/PDF
Licenza:
Creative commons
Dimensione
1.19 MB
Formato
Adobe PDF
|
1.19 MB | Adobe PDF | Visualizza/Apri |
Pubblicazioni consigliate
I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.




