Motivation: The latest advances in cancer sequencing, and the availability of a wide range of methods to infer the evolutionary history of tumors, have made it important to evaluate, reconcile and cluster different tumor phylogenies. Recently, several notions of distance or similarities have been proposed in the literature, but none of them has emerged as the golden standard. Moreover, none of the known similarity measures is able to manage mutations occurring multiple times in the tree, a circumstance often occurring in real cases. Results: To overcome these limitations, in this article, we propose MP3, the first similarity measure for tumor phylogenies able to effectively manage cases where multiple mutations can occur at the same time and mutations can occur multiple times. Moreover, a comparison of MP3 with other measures shows that it is able to classify correctly similar and dissimilar trees, both on simulated and on real data. Availability and implementation: An open source implementation of MP3 is publicly available at https://github.com/AlgoLab/mp3treesim.

Triplet-based similarity score for fully multilabeled trees with poly-occurring labels / S. Ciccolella, G. Bernardini, L. Denti, P. Bonizzoni, M. Previtali, G. Della Vedova. - In: BIOINFORMATICS. - ISSN 1367-4803. - 37:2(2021), pp. 178-184. [10.1093/bioinformatics/btaa676]

Triplet-based similarity score for fully multilabeled trees with poly-occurring labels

G. Bernardini
Secondo
;
2021

Abstract

Motivation: The latest advances in cancer sequencing, and the availability of a wide range of methods to infer the evolutionary history of tumors, have made it important to evaluate, reconcile and cluster different tumor phylogenies. Recently, several notions of distance or similarities have been proposed in the literature, but none of them has emerged as the golden standard. Moreover, none of the known similarity measures is able to manage mutations occurring multiple times in the tree, a circumstance often occurring in real cases. Results: To overcome these limitations, in this article, we propose MP3, the first similarity measure for tumor phylogenies able to effectively manage cases where multiple mutations can occur at the same time and mutations can occur multiple times. Moreover, a comparison of MP3 with other measures shows that it is able to classify correctly similar and dissimilar trees, both on simulated and on real data. Availability and implementation: An open source implementation of MP3 is publicly available at https://github.com/AlgoLab/mp3treesim.
algorithm; evolution; phylogeny; sequence analysis; software; tree
Settore INFO-01/A - Informatica
2021
30-lug-2020
Article (author)
File in questo prodotto:
File Dimensione Formato  
btaa676.pdf

accesso aperto

Tipologia: Publisher's version/PDF
Dimensione 3.07 MB
Formato Adobe PDF
3.07 MB Adobe PDF Visualizza/Apri
Pubblicazioni consigliate

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/2434/1131839
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 21
  • ???jsp.display-item.citation.isi??? 15
  • OpenAlex ND
social impact