Gene set analysis (GSA) aims at identifying gene sets whose cumulative expression is altered in the phenotype of interest. However, heterogeneous expression changes of single genes across samples can result in small pathway expression differences that are difficult to detect with state-of-art GSA algorithms. We recently showed that exploiting the bimodal behavior of RNA-sequencing gene expression profiles through data discretization allows for the identification of the altered biological processes, especially in presence of heterogeneity. Recently, this issue has begun to be exploited with single cell technologies. However, single-cell RNA-sequencing data are marked by technical noise and sparsity. To overcome these issues, we applied our “less is more” paradigm to single cell transcriptomes for the detection of altered gene sets. Here we present single-cell GSECA (scGSECA), a single cell extension of our Gene Set Enrichment Class Analysis. Applied to ~22.000 cells from 12 published studies from normal and tumor tissues, scGSECA highlighted the previously proposed bimodal behavior of single cell expression profiles. scGSECA exploits a cell-centric rather than gene-centric GSA. Non-zero expression levels are modeled as a superposition of gaussian distributions and discretized before statistical testing. We applied our method to synthetic data with controlled alterations and variable cohort sizes. We show that scGSECA provides higher performances than other available algorithms in detecting truly altered biological processes in large cohorts.

scGSECA: identification of altered biological processes in single-cell RNA-sequencing data by discretization of expression profiles / S. Perrone, S. Peirone, A. Lauria, M. Caselle, S. Oliviero, M. Cereda. ((Intervento presentato al convegno Società Italiana di Biofisica e Biologia Molecolare (SIBBM) Annual Meeting tenutosi a Roma nel 2022.

scGSECA: identification of altered biological processes in single-cell RNA-sequencing data by discretization of expression profiles

S. Peirone;M. Cereda
Ultimo
2022

Abstract

Gene set analysis (GSA) aims at identifying gene sets whose cumulative expression is altered in the phenotype of interest. However, heterogeneous expression changes of single genes across samples can result in small pathway expression differences that are difficult to detect with state-of-art GSA algorithms. We recently showed that exploiting the bimodal behavior of RNA-sequencing gene expression profiles through data discretization allows for the identification of the altered biological processes, especially in presence of heterogeneity. Recently, this issue has begun to be exploited with single cell technologies. However, single-cell RNA-sequencing data are marked by technical noise and sparsity. To overcome these issues, we applied our “less is more” paradigm to single cell transcriptomes for the detection of altered gene sets. Here we present single-cell GSECA (scGSECA), a single cell extension of our Gene Set Enrichment Class Analysis. Applied to ~22.000 cells from 12 published studies from normal and tumor tissues, scGSECA highlighted the previously proposed bimodal behavior of single cell expression profiles. scGSECA exploits a cell-centric rather than gene-centric GSA. Non-zero expression levels are modeled as a superposition of gaussian distributions and discretized before statistical testing. We applied our method to synthetic data with controlled alterations and variable cohort sizes. We show that scGSECA provides higher performances than other available algorithms in detecting truly altered biological processes in large cohorts.
English
20-giu-2022
Settore BIO/11 - Biologia Molecolare
Poster
Intervento inviato
Comitato scientifico
Pubblicazione scientifica
Società Italiana di Biofisica e Biologia Molecolare (SIBBM) Annual Meeting
Roma
2022
Convegno nazionale
manual
S. Perrone, S. Peirone, A. Lauria, M. Caselle, S. Oliviero, M. Cereda
scGSECA: identification of altered biological processes in single-cell RNA-sequencing data by discretization of expression profiles / S. Perrone, S. Peirone, A. Lauria, M. Caselle, S. Oliviero, M. Cereda. ((Intervento presentato al convegno Società Italiana di Biofisica e Biologia Molecolare (SIBBM) Annual Meeting tenutosi a Roma nel 2022.
Prodotti della ricerca::14 - Intervento a convegno non pubblicato
info:eu-repo/semantics/conferenceObject
none
Conference Object
6
File in questo prodotto:
Non ci sono file associati a questo prodotto.
Pubblicazioni consigliate

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/2434/1050069
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus ND
  • ???jsp.display-item.citation.isi??? ND
social impact