Heterogeneity is a fundamental feature of complex phenotypes. So far, genomic screenings have profiled thousands of samples providing insights into the transcriptome of the cell. However, disentangling the heterogeneity of these transcriptomic Big Data to identify defective biological processes remains challenging. Here we present GSECA, a method exploiting the bimodal behavior of RNA-sequencing gene expression profiles to identify altered gene sets in heterogeneous patient cohorts. Using simulated and experimental RNA-sequencing data sets, we show that GSECA provides higher performances than other available algorithms in detecting truly altered biological processes in large cohorts. Applied to 5941 samples from 14 different cancer types, GSECA correctly identified the alteration of the PI3K/AKT signaling pathway driven by the somatic loss of PTEN and verified the emerging role of PTEN in modulating immune-related processes. In particular, we showed that, in prostate cancer, PTEN loss appears to establish an immunosuppressive tumor microenvironment through the activation of STAT3, and low PTEN expression levels have a detrimental impact on patient disease-free survival. GSECA is available at https://github.com/matteocereda/GSECA.

Identification of altered biological processes in heterogeneous RNA-sequencing data by discretization of expression profiles / A. Lauria, S. Peirone, M.D. Giudice, F. Priante, P. Rajan, M. Caselle, S. Oliviero, M. Cereda. - In: NUCLEIC ACIDS RESEARCH. - ISSN 0305-1048. - 48:4(2020 Feb 28), pp. 1730-1747. [10.1093/nar/gkz1208]

Identification of altered biological processes in heterogeneous RNA-sequencing data by discretization of expression profiles

M. Cereda
Ultimo
2020

Abstract

Heterogeneity is a fundamental feature of complex phenotypes. So far, genomic screenings have profiled thousands of samples providing insights into the transcriptome of the cell. However, disentangling the heterogeneity of these transcriptomic Big Data to identify defective biological processes remains challenging. Here we present GSECA, a method exploiting the bimodal behavior of RNA-sequencing gene expression profiles to identify altered gene sets in heterogeneous patient cohorts. Using simulated and experimental RNA-sequencing data sets, we show that GSECA provides higher performances than other available algorithms in detecting truly altered biological processes in large cohorts. Applied to 5941 samples from 14 different cancer types, GSECA correctly identified the alteration of the PI3K/AKT signaling pathway driven by the somatic loss of PTEN and verified the emerging role of PTEN in modulating immune-related processes. In particular, we showed that, in prostate cancer, PTEN loss appears to establish an immunosuppressive tumor microenvironment through the activation of STAT3, and low PTEN expression levels have a detrimental impact on patient disease-free survival. GSECA is available at https://github.com/matteocereda/GSECA.
Cell Line, Tumor; Disease-Free Survival; Gene Expression Regulation; Humans; Internet; PTEN Phosphohydrolase; RNA; STAT3 Transcription Factor; Sequence Analysis, RNA; Signal Transduction; Software; Transcriptome; Tumor Microenvironment; Whole Exome Sequencing; Big Data
Settore BIO/11 - Biologia Molecolare
Settore MED/06 - Oncologia Medica
28-feb-2020
Article (author)
File in questo prodotto:
File Dimensione Formato  
gkz1208.pdf

accesso aperto

Tipologia: Publisher's version/PDF
Dimensione 7.95 MB
Formato Adobe PDF
7.95 MB Adobe PDF Visualizza/Apri
Pubblicazioni consigliate

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/2434/898610
Citazioni
  • ???jsp.display-item.citation.pmc??? 6
  • Scopus 8
  • ???jsp.display-item.citation.isi??? 8
  • OpenAlex ND
social impact