Common data models solve many challenges of standardizing electronic health record (EHR) data but are unable to semantically integrate all of the resources needed for deep phenotyping. Open Biological and Biomedical Ontology (OBO) Foundry ontologies provide computable representations of biological knowledge and enable the integration of heterogeneous data. However, mapping EHR data to OBO ontologies requires significant manual curation and domain expertise. We introduce OMOP2OBO, an algorithm for mapping Observational Medical Outcomes Partnership (OMOP) vocabularies to OBO ontologies. Using OMOP2OBO, we produced mappings for 92,367 conditions, 8611 drug ingredients, and 10,673 measurement results, which covered 68-99% of concepts used in clinical practice when examined across 24 hospitals. When used to phenotype rare disease patients, the mappings helped systematically identify undiagnosed patients who might benefit from genetic testing. By aligning OMOP vocabularies to OBO ontologies our algorithm presents new opportunities to advance EHR-based deep phenotyping.

Ontologizing health systems data at scale: making translational discovery a reality / T.J. Callahan, A.L. Stefanski, J.M. Wyrwa, C. Zeng, A. Ostropolets, J.M. Banda, W.A. Baumgartner, R.D. Boyce, E. Casiraghi, B.D. Coleman, J.H. Collins, S.J. Deakyne Davies, J.A. Feinstein, A.Y. Lin, B. Martin, N.A. Matentzoglu, D. Meeker, J. Reese, J. Sinclair, S.B. Taneja, K.E. Trinkley, N.A. Vasilevsky, A.E. Williams, X.A. Zhang, J.C. Denny, P.B. Ryan, G. Hripcsak, T.D. Bennett, M.A. Haendel, P.N. Robinson, L.E. Hunter, M.G. Kahn. - In: NPJ DIGITAL MEDICINE. - ISSN 2398-6352. - 6:1(2023), pp. 89.1-89.18. [10.1038/s41746-023-00830-x]

Ontologizing health systems data at scale: making translational discovery a reality

E. Casiraghi
Conceptualization
;
2023

Abstract

Common data models solve many challenges of standardizing electronic health record (EHR) data but are unable to semantically integrate all of the resources needed for deep phenotyping. Open Biological and Biomedical Ontology (OBO) Foundry ontologies provide computable representations of biological knowledge and enable the integration of heterogeneous data. However, mapping EHR data to OBO ontologies requires significant manual curation and domain expertise. We introduce OMOP2OBO, an algorithm for mapping Observational Medical Outcomes Partnership (OMOP) vocabularies to OBO ontologies. Using OMOP2OBO, we produced mappings for 92,367 conditions, 8611 drug ingredients, and 10,673 measurement results, which covered 68-99% of concepts used in clinical practice when examined across 24 hospitals. When used to phenotype rare disease patients, the mappings helped systematically identify undiagnosed patients who might benefit from genetic testing. By aligning OMOP vocabularies to OBO ontologies our algorithm presents new opportunities to advance EHR-based deep phenotyping.
Settore INF/01 - Informatica
2023
https://www.nature.com/articles/s41746-023-00830-x#citeas
Article (author)
File in questo prodotto:
File Dimensione Formato  
npj_digital_medicine_s41746-023-00830-x.pdf

accesso aperto

Descrizione: Article
Tipologia: Publisher's version/PDF
Dimensione 2.21 MB
Formato Adobe PDF
2.21 MB Adobe PDF Visualizza/Apri
Pubblicazioni consigliate

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/2434/970159
Citazioni
  • ???jsp.display-item.citation.pmc??? 2
  • Scopus 3
  • ???jsp.display-item.citation.isi??? 2
social impact