Combining a set of phylogenetic trees into a single phylogenetic network that explains all of them is a fundamental challenge in evolutionary studies. In this paper, we apply the recently-introduced theoretical framework of cherry picking to design a class of heuristics that are guaranteed to produce a network containing each of the input trees, for practical-size datasets. The main contribution of this paper is the design and training of a machine learning model that captures essential information on the structure of the input trees and guides the algorithms towards better solutions. This is one of the first applications of machine learning to phylogenetic studies, and we show its promise with a proof-of-concept experimental study conducted on both simulated and real data consisting of binary trees with no missing taxa.

Reconstructing Phylogenetic Networks via Cherry Picking and Machine Learning / G. Bernardini, L. van Iersel, E. Julien, L. Stougie (LEIBNIZ INTERNATIONAL PROCEEDINGS IN INFORMATICS). - In: 22nd International Workshop on Algorithms in Bioinformatics (WABI 2022)[s.l] : Schloss Dagstuhl -- Leibniz-Zentrum für Informatik, 2022. - ISBN 9783959772433. - pp. 16:1-16:22 (( Intervento presentato al 22. convegno International Workshop on Algorithms in Bioinformatics (WABI) tenutosi a Potsdam nel 2022 [10.4230/lipics.wabi.2022.16].

Reconstructing Phylogenetic Networks via Cherry Picking and Machine Learning

G. Bernardini
Primo
;
2022

Abstract

Combining a set of phylogenetic trees into a single phylogenetic network that explains all of them is a fundamental challenge in evolutionary studies. In this paper, we apply the recently-introduced theoretical framework of cherry picking to design a class of heuristics that are guaranteed to produce a network containing each of the input trees, for practical-size datasets. The main contribution of this paper is the design and training of a machine learning model that captures essential information on the structure of the input trees and guides the algorithms towards better solutions. This is one of the first applications of machine learning to phylogenetic studies, and we show its promise with a proof-of-concept experimental study conducted on both simulated and real data consisting of binary trees with no missing taxa.
Phylogenetics; Hybridization; Cherry Picking; Machine Learning; Heuristic
Settore INFO-01/A - Informatica
2022
Book Part (author)
File in questo prodotto:
File Dimensione Formato  
LIPIcs.WABI.2022.16.pdf

accesso aperto

Tipologia: Publisher's version/PDF
Dimensione 1.3 MB
Formato Adobe PDF
1.3 MB Adobe PDF Visualizza/Apri
Pubblicazioni consigliate

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/2434/1131535
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 4
  • ???jsp.display-item.citation.isi??? ND
  • OpenAlex ND
social impact