RNA sequencing (RNA-Seq) has become the experimental standard in transcriptome studies. While most of the bioinformatic pipelines for the analysis of RNA-Seq data and the identification of significant changes in transcript abundance are based on the comparison of two conditions, it is common practice to perform several experiments in parallel (e.g. from different individuals, developmental stages, tissues), for the identification of genes showing a significant variation of expression across all the conditions studied. In this work we present RNentropy, a methodology based on information theory devised for this task, which given expression estimates from any number of RNA-Seq samples and conditions identifies genes or transcripts with a significant variation of expression across all the conditions studied, together with the samples in which they are over- or under-expressed. To show the capabilities offered by our methodology, we applied it to different RNA-Seq datasets: 48 biological replicates of two different yeast conditions; samples extracted from six human tissues of three individuals; seven different mouse brain cell types; human liver samples from six individuals. Results, and their comparison to different state of the art bioinformatic methods, show that RNentropy can provide a quick and in depth analysis of significant changes in gene expression profiles over any number of conditions.

RNentropy : an entropy-based tool for the detection of significant variation of gene expression across multiple RNA-Seq experiments / F. Zambelli, F. Mastropasqua, E. Picardi, A.M. D'Erchia, G. Pesole, G. Pavesi. - In: NUCLEIC ACIDS RESEARCH. - ISSN 0305-1048. - (2018 Jan 30). [Epub ahead of print] [10.1093/nar/gky055]

RNentropy : an entropy-based tool for the detection of significant variation of gene expression across multiple RNA-Seq experiments

F. Zambelli;G. Pesole;G. Pavesi
2018

Abstract

RNA sequencing (RNA-Seq) has become the experimental standard in transcriptome studies. While most of the bioinformatic pipelines for the analysis of RNA-Seq data and the identification of significant changes in transcript abundance are based on the comparison of two conditions, it is common practice to perform several experiments in parallel (e.g. from different individuals, developmental stages, tissues), for the identification of genes showing a significant variation of expression across all the conditions studied. In this work we present RNentropy, a methodology based on information theory devised for this task, which given expression estimates from any number of RNA-Seq samples and conditions identifies genes or transcripts with a significant variation of expression across all the conditions studied, together with the samples in which they are over- or under-expressed. To show the capabilities offered by our methodology, we applied it to different RNA-Seq datasets: 48 biological replicates of two different yeast conditions; samples extracted from six human tissues of three individuals; seven different mouse brain cell types; human liver samples from six individuals. Results, and their comparison to different state of the art bioinformatic methods, show that RNentropy can provide a quick and in depth analysis of significant changes in gene expression profiles over any number of conditions.
Settore BIO/11 - Biologia Molecolare
Settore INF/01 - Informatica
30-gen-2018
30-gen-2018
Article (author)
File in questo prodotto:
File Dimensione Formato  
gky055.pdf

accesso aperto

Tipologia: Publisher's version/PDF
Dimensione 2.64 MB
Formato Adobe PDF
2.64 MB Adobe PDF Visualizza/Apri
Pubblicazioni consigliate

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/2434/549695
Citazioni
  • ???jsp.display-item.citation.pmc??? 12
  • Scopus 22
  • ???jsp.display-item.citation.isi??? 22
social impact