Motif finding in nucleotide sequences for the discovery of over-represented transcription factor binding sites is a very challenging problem, both from the computational and the experimental points of view. Transcription factors in fact recognize very weakly conserved sequence elements, that in typical applications are very hard to discriminate against random sequence similarities. Recent advances in technology like ChIP-Seq can generate better datasets to be investigated, in which the degree of conservation of binding sites is higher: on the other hand, the size itself of the datasets has posed new challenges for the design of efficient algorithms able to produce results in reasonable time. In this work we present an updated version of our algorithm Weeder, in which time and space requirements are significantly reduced and, moreover, also the accuracy of the results is notably improved.

A Faster Algorithm for Motif Finding in Sequences from ChIP-Seq Data / F. Zambelli, G. Pavesi - In: Computational Intelligence Methods for Bioinformatics and Biostatistics / [a cura di] E. Biganzoli, A. Vellido, F. Ambrogi, R. Tagliaferri. - [s.l] : Springer, 2012. - ISBN 978-3-642-35685-8. - pp. 201-212 (( Intervento presentato al 8. convegno CIBB 2011 tenutosi a Gargnano del Garda nel 2011 [10.1007/978-3-642-35686-5_17].

A Faster Algorithm for Motif Finding in Sequences from ChIP-Seq Data

F. Zambelli
Primo
;
G. Pavesi
Ultimo
2012

Abstract

Motif finding in nucleotide sequences for the discovery of over-represented transcription factor binding sites is a very challenging problem, both from the computational and the experimental points of view. Transcription factors in fact recognize very weakly conserved sequence elements, that in typical applications are very hard to discriminate against random sequence similarities. Recent advances in technology like ChIP-Seq can generate better datasets to be investigated, in which the degree of conservation of binding sites is higher: on the other hand, the size itself of the datasets has posed new challenges for the design of efficient algorithms able to produce results in reasonable time. In this work we present an updated version of our algorithm Weeder, in which time and space requirements are significantly reduced and, moreover, also the accuracy of the results is notably improved.
motif finding ; chip-seq ; motif discovery ;
Settore INF/01 - Informatica
Settore BIO/11 - Biologia Molecolare
2012
Book Part (author)
File in questo prodotto:
File Dimensione Formato  
10.1007_978-3-642-35686-5_17.pdf

accesso riservato

Tipologia: Publisher's version/PDF
Dimensione 303.56 kB
Formato Adobe PDF
303.56 kB Adobe PDF   Visualizza/Apri   Richiedi una copia
Pubblicazioni consigliate

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/2434/219503
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 3
  • ???jsp.display-item.citation.isi??? ND
social impact