RNA sequencing (RNAseq) has become the method of choice for transcriptome analysis, yet no consensus exists as to the most appropriate pipeline for its analysis, with current benchmarks suffering important limitations. Here, we address these challenges through a rich benchmarking resource harnessing (i) two RNAseq datasets including ERCC ExFold spike-ins; (ii) Nanostring measurements of a panel of 150 genes on the same samples; (iii) a set of internal, genetically-determined controls; (iv) a reanalysis of the SEQC dataset; and (v) a focus on relative quantification (i.e. across-samples). We use this resource to compare different approaches to each step of RNAseq analysis, from alignment to differential expression testing. We show that methods providing the best absolute quantification do not necessarily provide good relative quantification across samples, that count-based methods are superior for gene-level relative quantification, and that the new generation of pseudo-alignment-based software performs as well as established methods, at a fraction of the computing time. We also assess the impact of library type and size on quantification and differential expression analysis. Finally, we have created a R package and a web platform to enable the simple and streamlined application of this resource to the benchmarking of future methods. © The Author(s) 2016. Published by Oxford University Press on behalf of Nucleic Acids Research.

RNAontheBENCH : computational and empirical resources for benchmarking RNAseq quantification and differential expression methods / G. Testa, V. Das, P. Laise, A. Adamo, A. Vitriolo, P. Germain. - In: NUCLEIC ACIDS RESEARCH. - ISSN 1362-4962. - 44:11(2016 Jun), pp. 5054-5067. [10.1093/nar/gkw448]

RNAontheBENCH : computational and empirical resources for benchmarking RNAseq quantification and differential expression methods

G. Testa
Primo
;
V. Das
Secondo
;
A. Adamo;A. Vitriolo
Penultimo
;
2016

Abstract

RNA sequencing (RNAseq) has become the method of choice for transcriptome analysis, yet no consensus exists as to the most appropriate pipeline for its analysis, with current benchmarks suffering important limitations. Here, we address these challenges through a rich benchmarking resource harnessing (i) two RNAseq datasets including ERCC ExFold spike-ins; (ii) Nanostring measurements of a panel of 150 genes on the same samples; (iii) a set of internal, genetically-determined controls; (iv) a reanalysis of the SEQC dataset; and (v) a focus on relative quantification (i.e. across-samples). We use this resource to compare different approaches to each step of RNAseq analysis, from alignment to differential expression testing. We show that methods providing the best absolute quantification do not necessarily provide good relative quantification across samples, that count-based methods are superior for gene-level relative quantification, and that the new generation of pseudo-alignment-based software performs as well as established methods, at a fraction of the computing time. We also assess the impact of library type and size on quantification and differential expression analysis. Finally, we have created a R package and a web platform to enable the simple and streamlined application of this resource to the benchmarking of future methods. © The Author(s) 2016. Published by Oxford University Press on behalf of Nucleic Acids Research.
Settore INF/01 - Informatica
Settore BIO/11 - Biologia Molecolare
Settore BIO/18 - Genetica
Settore BIO/13 - Biologia Applicata
giu-2016
Article (author)
File in questo prodotto:
File Dimensione Formato  
Nucl. Acids Res.-2016-Germain-5054-67-2.pdf

accesso aperto

Tipologia: Publisher's version/PDF
Dimensione 4.59 MB
Formato Adobe PDF
4.59 MB Adobe PDF Visualizza/Apri
Pubblicazioni consigliate

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/2434/426469
Citazioni
  • ???jsp.display-item.citation.pmc??? 21
  • Scopus 28
  • ???jsp.display-item.citation.isi??? 29
social impact