Transposable elements (TEs) are mobile DNA repeats known to shape the evolution of eukaryotic genomes. In complex organisms, they exhibit tissue-specific transcription. However, understanding their role in cellular diversity across most tissues remains a challenge, when employing single-cell RNA sequencing (scRNA-seq), due to their widespread presence and genetic similarity. To address this, we present IRescue (Interspersed Repeats single-cell quantifier), a software capable of estimating the expression of TE subfamilies at the single-cell level. IRescue incorporates a unique UMI deduplication algorithm to rectify sequencing errors and employs an Expectation-Maximization procedure to effectively redistribute the counts of multi-mapping reads. Our study showcases the precision of IRescue through analysis of both simulated and real single cell and nuclei RNA-seq data from human colorectal cancer, brain, skin aging, and PBMCs during SARS-CoV-2 infection and recovery. By linking the expression patterns of TE signatures to specific conditions and biological contexts, we unveil insights into their potential roles in cellular heterogeneity and disease progression.

IRescue: uncertainty-aware quantification of transposable elements expression at single cell level / B. Polimeni, F. Marasca, V. Ranzani, B. Bodega. - In: NUCLEIC ACIDS RESEARCH. - ISSN 0305-1048. - 52:19(2024 Oct 28), pp. e93.1-e93.15. [10.1093/nar/gkae793]

IRescue: uncertainty-aware quantification of transposable elements expression at single cell level

B. Polimeni
Primo
;
F. Marasca;B. Bodega
Ultimo
2024

Abstract

Transposable elements (TEs) are mobile DNA repeats known to shape the evolution of eukaryotic genomes. In complex organisms, they exhibit tissue-specific transcription. However, understanding their role in cellular diversity across most tissues remains a challenge, when employing single-cell RNA sequencing (scRNA-seq), due to their widespread presence and genetic similarity. To address this, we present IRescue (Interspersed Repeats single-cell quantifier), a software capable of estimating the expression of TE subfamilies at the single-cell level. IRescue incorporates a unique UMI deduplication algorithm to rectify sequencing errors and employs an Expectation-Maximization procedure to effectively redistribute the counts of multi-mapping reads. Our study showcases the precision of IRescue through analysis of both simulated and real single cell and nuclei RNA-seq data from human colorectal cancer, brain, skin aging, and PBMCs during SARS-CoV-2 infection and recovery. By linking the expression patterns of TE signatures to specific conditions and biological contexts, we unveil insights into their potential roles in cellular heterogeneity and disease progression.
Settore BIOS-08/A - Biologia molecolare
   One Health Basic and Translational Research Actions addressing Unmet Need on Emerging Infectious Diseases (INF-ACT)
   INF-ACT
   MINISTERO DELL'UNIVERSITA' E DELLA RICERCA
   PE00000007

   Reverting the T-cell exhaustion mediated by LINE1 in tumor microenvironment (1° anno)
   FONDAZIONE AIRC PER LA RICERCA SUL CANCRO ETS
   IG 2022 ID 27066
28-ott-2024
Article (author)
File in questo prodotto:
File Dimensione Formato  
gkae793.pdf

accesso aperto

Tipologia: Publisher's version/PDF
Dimensione 2.59 MB
Formato Adobe PDF
2.59 MB Adobe PDF Visualizza/Apri
Pubblicazioni consigliate

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/2434/1116081
Citazioni
  • ???jsp.display-item.citation.pmc??? 0
  • Scopus 1
  • ???jsp.display-item.citation.isi??? 1
social impact