Translational research requires data at multiple scales of biological organization. Advancements in sequencing and multi-omics technologies have increased the availability of these data but researchers face significant integration challenges. Knowledge graphs (KGs) are used to model complex phenomena, and methods exist to automatically construct them. However, tackling complex biomedical integration problems requires flexibility in the way knowledge is modeled. Moreover, existing KG construction methods provide robust tooling at the cost of fixed or limited choices among knowledge representation models. PheKnowLator (Phenotype Knowledge Translator) is a semantic ecosystem for automating the FAIR (Findable, Accessible, Interoperable, and Reusable) construction of ontologically grounded KGs with fully customizable knowledge representation. The ecosystem includes KG construction resources (e.g., data preparation APIs), analysis tools (e.g., SPARQL endpoints and abstraction algorithms), and benchmarks (e.g., prebuilt KGs and embeddings). We evaluate the ecosystem by surveying open-source KG construction methods and analyzing its computational performance when constructing 12 large-scale KGs. With flexible knowledge representation, PheKnowLator enables fully customizable KGs without compromising performance or usability.

An Open-Source Knowledge Graph Ecosystem for the Life Sciences / T.J. Callahan, I.J. Tripodi, A.L. Stefanski, L. Cappelletti, S.B. Taneja, J.M. Wyrwa, E. Casiraghi, N.A. Matentzoglu, J. Reese, J.C. Silverstein, C. Tapley Hoyt, R.D. Boyce, S.A. Malec, D.R. Unni, M.P. Joachimiak, P.N. Robinson, C.J. Mungall, E. Cavalleri, T. Fontana, G. Valentini, M. Mesiti, L.A. Gillenwater, B. Santangelo, N.A. Vasilevsky, R. Hoehndorf, T.D. Bennett, P.B. Ryan, G. Hripcsak, M.G. Kahn, M. Bada, W.A. Baumgartner Jr, L.E. Hunter. - (2023 Jul 11). [10.48550/arXiv.2307.05727]

An Open-Source Knowledge Graph Ecosystem for the Life Sciences

E. Casiraghi;E. Cavalleri;G. Valentini;M. Mesiti;
2023

Abstract

Translational research requires data at multiple scales of biological organization. Advancements in sequencing and multi-omics technologies have increased the availability of these data but researchers face significant integration challenges. Knowledge graphs (KGs) are used to model complex phenomena, and methods exist to automatically construct them. However, tackling complex biomedical integration problems requires flexibility in the way knowledge is modeled. Moreover, existing KG construction methods provide robust tooling at the cost of fixed or limited choices among knowledge representation models. PheKnowLator (Phenotype Knowledge Translator) is a semantic ecosystem for automating the FAIR (Findable, Accessible, Interoperable, and Reusable) construction of ontologically grounded KGs with fully customizable knowledge representation. The ecosystem includes KG construction resources (e.g., data preparation APIs), analysis tools (e.g., SPARQL endpoints and abstraction algorithms), and benchmarks (e.g., prebuilt KGs and embeddings). We evaluate the ecosystem by surveying open-source KG construction methods and analyzing its computational performance when constructing 12 large-scale KGs. With flexible knowledge representation, PheKnowLator enables fully customizable KGs without compromising performance or usability.
Computer Science - Artificial Intelligence; Computer Science - Artificial Intelligence; Computer Science - Computational Engineering; Finance; Science
Settore INF/01 - Informatica
11-lug-2023
http://arxiv.org/abs/2307.05727v1
File in questo prodotto:
File Dimensione Formato  
2307.05727.pdf

accesso aperto

Tipologia: Pre-print (manoscritto inviato all'editore)
Dimensione 12.14 MB
Formato Adobe PDF
12.14 MB Adobe PDF Visualizza/Apri
Pubblicazioni consigliate

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/2434/1022001
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus ND
  • ???jsp.display-item.citation.isi??? ND
social impact