IRIS Institutional Research Information System - AIR Archivio Istituzionale della Ricerca

Motif finding in nucleotide sequences for the discovery of over-represented transcription factor binding sites is a very challenging problem, both from the computational and the experimental points of view. Transcription factors in fact recognize very weakly conserved sequence elements, that in typical applications are very hard to discriminate against random sequence similarities. Recent advances in technology like ChIP-Seq can generate better datasets to be investigated, in which the degree of conservation of binding sites is higher: on the other hand, the size itself of the datasets has posed new challenges for the design of efficient algorithms able to produce results in reasonable time. In this work we present an updated version of our algorithm Weeder, in which time and space requirements are significantly reduced and, moreover, also the accuracy of the results is notably improved.

A Faster Algorithm for Motif Finding in Sequences from ChIP-Seq Data / F. Zambelli, G. Pavesi - In: Computational Intelligence Methods for Bioinformatics and Biostatistics / [a cura di] E. Biganzoli, A. Vellido, F. Ambrogi, R. Tagliaferri. - [s.l] : Springer, 2012. - ISBN 978-3-642-35685-8. - pp. 201-212 (( Intervento presentato al 8. convegno CIBB 2011 tenutosi a Gargnano del Garda nel 2011 [10.1007/978-3-642-35686-5_17].

A Faster Algorithm for Motif Finding in Sequences from ChIP-Seq Data

F. Zambelli^Primo;G. Pavesi^Ultimo

2012

Abstract

Motif finding in nucleotide sequences for the discovery of over-represented transcription factor binding sites is a very challenging problem, both from the computational and the experimental points of view. Transcription factors in fact recognize very weakly conserved sequence elements, that in typical applications are very hard to discriminate against random sequence similarities. Recent advances in technology like ChIP-Seq can generate better datasets to be investigated, in which the degree of conservation of binding sites is higher: on the other hand, the size itself of the datasets has posed new challenges for the design of efficient algorithms able to produce results in reasonable time. In this work we present an updated version of our algorithm Weeder, in which time and space requirements are significantly reduced and, moreover, also the accuracy of the results is notably improved.

Scheda breve

Scheda completa

Scheda completa (DC)

	Parole chiave
	
				motif finding ; chip-seq ; motif discovery ;
			
	Settori scientifico-disciplinari del contributo (sola visualizzazione)
	
				Settore INF/01 - Informatica
Settore BIO/11 - Biologia Molecolare
			
	Data di pubblicazione
	
				2012
			
	DOI
	
				https://dx.doi.org/10.1007/978-3-642-35686-5_17
			
	Tipologia
	
				Book Part (author)
			
	Appare nelle tipologie:
	
				03 - Contributo in volume

File in questo prodotto:

File	Dimensione	Formato
10.1007_978-3-642-35686-5_17.pdf accesso riservato Tipologia: Publisher's version/PDF Dimensione 303.56 kB Formato Adobe PDF Visualizza/Apri Richiedi una copia	303.56 kB	Adobe PDF	Visualizza/Apri Richiedi una copia

Pubblicazioni consigliate

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/2434/219503

Citazioni

ND

3

ND

ND

social impact