Spinal Muscular Atrophy (SMA) is among the most common genetic neurological diseases that cause infant mortality. SMA is caused by deletion or mutations in the survival motor neuron 1 gene (SMN1), which are expected to generate alterations in RNA transcription, or splicing and most importantly reductions in mRNA transport within the axons of motor neurons (MNs). SMA ultimately results in the selective degeneration of MNs in spinal cord, but the underlying reason is still not clear entirely. The aim of this study is to investigate splicing abnormalities in SMA, and to identify genes presenting differential splicing possibly involved in the pathogenesis of SMA at genome-wide level. We performed RNA-Sequencing data analysis on 2 SMA patients and 2 controls, with 2 biological replicates each sample, derived from their induced Pluripotent Stem Cells-differentiated-MNs. Three types of analyses were executed. Firstly, differential expression analysis was performed to identify possibly mis-regulated genes using Cufflinks. Secondly, alternative splicing analysis was conducted to find differentially-used exons (DUEs; using DEXSeq) as splicing patterns are known to be altered in MNs by the suboptimal levels of SMN protein. Thirdly, we did RNA-binding protein (RBP) - motif discovery for the set of identified alternative cassette-DUEs, to pinpoint possible mechanisms of such alterations, specific to MNs. The gene ontology enrichment analysis of significant DEGs and alternative cassette-DUEs revealed various interesting terms including axon-guidance, muscle-contraction, microtubule-based transport, axon-cargo transport, synapse etc. which suggests their involvement in SMA. Further, promising results were obtained from motif analysis which has identified 22 RBPs out of which 7 RBPs namely, PABPC1, PABPC3, PABPC4, PABPC5, PABPN1, SART3 and KHDRBS1 are known for mRNAs stabilization and mRNA transport across MN-axon. Five RBPs from PABP family are known to interact directly with SMN protein that enhance mRNA transport in MNs. To validate our results specific wet-lab experiments are required, involving precise recognition of RNA-binding sites correspondent with our findings. Our work has provided a promising set of putative targets which might offer potential therapeutic role towards treating SMA. During the course of our study, we have observed that current methods for an effective understanding of differential splicing events within the transcriptomic landscape at high resolution are insufficient. To address this problem, we developed a computational model which has a potential to precisely estimate the “transcript expression levels” within a given gene locus by disentangling mature and nascent transcription contributions for each transcript at per base resolution. We modeled exonic and intronic read coverages by applying a non-linear computational model and estimated expression for each transcript, which best approximated the observed expression in total RNA-Seq data. The performance of our model was good in terms of computational processing time and memory usage. The application of our model is in the detection of differential splicing events. At exon level, differences in the ratio of the sum of mature and the sum of nascent transcripts over all the transcripts in a gene locus gives an indication of differential splicing. We have implemented our model in R-statistical language.

COMPUTATIONAL APPROACHES IN THE ESTIMATION AND ANALYSIS OF TRANSCRIPTS DIFFERENTIAL EXPRESSION AND SPLICING: APPLICATION TO SPINAL MUSCULAR ATROPHY / S. Vashisht ; added supervisor: F. BIANCHI - He has provided very useful suggestions and recommendations to improve my research work that I have presented in the form of yearly reports during my PhD project. In every discussion or during work presentations he appreciated my work and motivated me to keep on going with more good work external supervisor: ULE, JERNEJ - Due to similar area of study, he helped more specifically throughout my PhD project by analyzing my yearly work reports very deeply. Every discussion with him was very fruitful and helped me to draw strong conclusions from my results ; supervisor: U. Pozzoli. UNIVERSITA' DEGLI STUDI DI MILANO, 2017 Mar 02. 28. ciclo, Anno Accademico 2016. [10.13130/s-vashisht_phd2017-03-02].

COMPUTATIONAL APPROACHES IN THE ESTIMATION AND ANALYSIS OF TRANSCRIPTS DIFFERENTIAL EXPRESSION AND SPLICING: APPLICATION TO SPINAL MUSCULAR ATROPHY

S. Vashisht
2017

Abstract

Spinal Muscular Atrophy (SMA) is among the most common genetic neurological diseases that cause infant mortality. SMA is caused by deletion or mutations in the survival motor neuron 1 gene (SMN1), which are expected to generate alterations in RNA transcription, or splicing and most importantly reductions in mRNA transport within the axons of motor neurons (MNs). SMA ultimately results in the selective degeneration of MNs in spinal cord, but the underlying reason is still not clear entirely. The aim of this study is to investigate splicing abnormalities in SMA, and to identify genes presenting differential splicing possibly involved in the pathogenesis of SMA at genome-wide level. We performed RNA-Sequencing data analysis on 2 SMA patients and 2 controls, with 2 biological replicates each sample, derived from their induced Pluripotent Stem Cells-differentiated-MNs. Three types of analyses were executed. Firstly, differential expression analysis was performed to identify possibly mis-regulated genes using Cufflinks. Secondly, alternative splicing analysis was conducted to find differentially-used exons (DUEs; using DEXSeq) as splicing patterns are known to be altered in MNs by the suboptimal levels of SMN protein. Thirdly, we did RNA-binding protein (RBP) - motif discovery for the set of identified alternative cassette-DUEs, to pinpoint possible mechanisms of such alterations, specific to MNs. The gene ontology enrichment analysis of significant DEGs and alternative cassette-DUEs revealed various interesting terms including axon-guidance, muscle-contraction, microtubule-based transport, axon-cargo transport, synapse etc. which suggests their involvement in SMA. Further, promising results were obtained from motif analysis which has identified 22 RBPs out of which 7 RBPs namely, PABPC1, PABPC3, PABPC4, PABPC5, PABPN1, SART3 and KHDRBS1 are known for mRNAs stabilization and mRNA transport across MN-axon. Five RBPs from PABP family are known to interact directly with SMN protein that enhance mRNA transport in MNs. To validate our results specific wet-lab experiments are required, involving precise recognition of RNA-binding sites correspondent with our findings. Our work has provided a promising set of putative targets which might offer potential therapeutic role towards treating SMA. During the course of our study, we have observed that current methods for an effective understanding of differential splicing events within the transcriptomic landscape at high resolution are insufficient. To address this problem, we developed a computational model which has a potential to precisely estimate the “transcript expression levels” within a given gene locus by disentangling mature and nascent transcription contributions for each transcript at per base resolution. We modeled exonic and intronic read coverages by applying a non-linear computational model and estimated expression for each transcript, which best approximated the observed expression in total RNA-Seq data. The performance of our model was good in terms of computational processing time and memory usage. The application of our model is in the detection of differential splicing events. At exon level, differences in the ratio of the sum of mature and the sum of nascent transcripts over all the transcripts in a gene locus gives an indication of differential splicing. We have implemented our model in R-statistical language.
2-mar-2017
Settore MED/04 - Patologia Generale
RNA transcription; mRNA splicing; Alternative splicing; Alternative cassette exons; Neurodegenerative genetic disorder; Spinal Muscular Atrophy; Survival Motor Neuron 1 gene; RNA-Sequencing; Biological replicates; induced Pluripotent Stem Cells; Motor neurons; Differentially expressed genes; Differentially-used exon; Differential splicing; Gene Ontology enrichment; Motif; RNA-Binding Proteins; Non-linear computational model; Mature mRNA; Nascent mRNA; Exonic read coverage; Intronic read coverage
POZZOLI, UBERTO
BIANCHI, FABRIZIO
Doctoral Thesis
COMPUTATIONAL APPROACHES IN THE ESTIMATION AND ANALYSIS OF TRANSCRIPTS DIFFERENTIAL EXPRESSION AND SPLICING: APPLICATION TO SPINAL MUSCULAR ATROPHY / S. Vashisht ; added supervisor: F. BIANCHI - He has provided very useful suggestions and recommendations to improve my research work that I have presented in the form of yearly reports during my PhD project. In every discussion or during work presentations he appreciated my work and motivated me to keep on going with more good work external supervisor: ULE, JERNEJ - Due to similar area of study, he helped more specifically throughout my PhD project by analyzing my yearly work reports very deeply. Every discussion with him was very fruitful and helped me to draw strong conclusions from my results ; supervisor: U. Pozzoli. UNIVERSITA' DEGLI STUDI DI MILANO, 2017 Mar 02. 28. ciclo, Anno Accademico 2016. [10.13130/s-vashisht_phd2017-03-02].
File in questo prodotto:
File Dimensione Formato  
phd_unimi_R10339.pdf

Open Access dal 26/07/2018

Descrizione: Tesi Dottorato
Tipologia: Tesi di dottorato completa
Dimensione 14.63 MB
Formato Adobe PDF
14.63 MB Adobe PDF Visualizza/Apri
Pubblicazioni consigliate

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/2434/470076
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus ND
  • ???jsp.display-item.citation.isi??? ND
social impact