Elastic Degenerate (ED) strings and Elastic Founder (EF) graphs are two versions of acyclic components of pangenomes. Both ED strings and EF graphs (which we collectively name variable strings) extend the well-known notion of indeterminate string. Recent work has extensively investigated algorithmic tasks over these structures, and over several other variable strings notions that they generalise. Among such tasks, the basic operation of matching a pattern into a text, which can serve as a toolkit for many pangenomic data analyses using these data structures, deserves special attention. In this paper we: (1) highlight a clear taxonomy within both ED strings and EF graphs ranging through variable strings of all types, from the linear string up to the most general one; (2) investigate the problem PvarT(X,Y) of matching a solid or variable pattern of type X into a variable text of type Y; (3) using as a reference the quadratic conditional lower bounds that are known for PvarT(solid,ED) and PvarT(solid,EF), for all possible types of variable strings X and Y we either prove the quadratic conditional lower bound for PvarT(X,Y), or provide non-trivial, often sub-quadratic, upper bounds, also exploiting the above-mentioned taxonomy.

A Unifying Taxonomy of Pattern Matching in Degenerate Strings and Founder Graphs / R. Ascone, G. Bernardini, A. Conte, M. Equi, E. Gabory, R. Grossi, N. Pisanti (LEIBNIZ INTERNATIONAL PROCEEDINGS IN INFORMATICS). - In: 24th International Workshop on Algorithms in Bioinformatics (WABI 2024) / [a cura di] S. P. Pissis, Wing-Kin Sung. - Wadern : Schloss Dagstuhl- Leibniz-Zentrum fur Informatik GmbH, Dagstuhl Publishing, 2024. - ISBN 978-3-95977-340-9. - pp. 1-21 (( Intervento presentato al 24. convegno International Workshop on Algorithms in Bioinformatics, WABI tenutosi a London nel 2024 [10.4230/LIPIcs.WABI.2024.14].

A Unifying Taxonomy of Pattern Matching in Degenerate Strings and Founder Graphs

G. Bernardini
Secondo
;
2024

Abstract

Elastic Degenerate (ED) strings and Elastic Founder (EF) graphs are two versions of acyclic components of pangenomes. Both ED strings and EF graphs (which we collectively name variable strings) extend the well-known notion of indeterminate string. Recent work has extensively investigated algorithmic tasks over these structures, and over several other variable strings notions that they generalise. Among such tasks, the basic operation of matching a pattern into a text, which can serve as a toolkit for many pangenomic data analyses using these data structures, deserves special attention. In this paper we: (1) highlight a clear taxonomy within both ED strings and EF graphs ranging through variable strings of all types, from the linear string up to the most general one; (2) investigate the problem PvarT(X,Y) of matching a solid or variable pattern of type X into a variable text of type Y; (3) using as a reference the quadratic conditional lower bounds that are known for PvarT(solid,ED) and PvarT(solid,EF), for all possible types of variable strings X and Y we either prove the quadratic conditional lower bound for PvarT(X,Y), or provide non-trivial, often sub-quadratic, upper bounds, also exploiting the above-mentioned taxonomy.
degenerate string; fine-grained complexity; founder graph; Pangenomics; pattern matching
Settore INFO-01/A - Informatica
   Pan-genome Graph Algorithms and Data Integration
   PANGAIA
   European Commission
   Horizon 2020 Framework Programme
   872539

   ALgorithms for PAngenome Computational Analysis
   ALPACA
   European Commission
   Horizon 2020 Framework Programme
   956229
2024
Book Part (author)
File in questo prodotto:
File Dimensione Formato  
LIPIcs.WABI.2024.14.pdf

accesso aperto

Descrizione: Article
Tipologia: Publisher's version/PDF
Dimensione 1.08 MB
Formato Adobe PDF
1.08 MB Adobe PDF Visualizza/Apri
Pubblicazioni consigliate

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/2434/1131438
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 1
  • ???jsp.display-item.citation.isi??? ND
social impact