Recent advances in genome sequencing technologies provided unexpected opportunities to characterize individual genomic landscape and identify mutations relevant for diagnosis and therapy in clinics. Specifically, whole-exome sequencing for complex disease (such as tumor/normal matched sample) and target resequencing for Mendelian disease, using next-generation sequencing (NGS) technologies, are gaining popularity in the human genetics community due to the moderate costs and the huge quantity of information provided by each experiment. However, NGS data analysis still remains the crucial bottleneck in this approach because of the great amount of data containing millions of potential disease-causing variants and difficulties involving the integration of different sources. Here, we describe the application of a bioinformatics analysis pipeline for NGS data in two case studies about rare cardiac diseases. The case 1 is focused on the target sequencing of 158 candidate genes in 91 patients affected by Brugada Syndrome (BrS). To date the clinical phenotype is associated with mutations in the SCN5A gene but explain only the 30% of the BrS cases. Therefore we selected a panel of genes previously associated to cases of arrhythmogenic disorders in literature and we analysed them in a cohort of BrS patients, which were negative for known SCN5A mutations. We found 98 novel genetic variations and 60 clinical rs belonging to 70 genes. In particular we found 13 genes significantly mutated in our cohort compared to healthy controls in 1000 Genomes data that were not previously associated to BrS phenotype. In case study 2 we performed a whole-exome sequencing experiment of trio family where the child is affected by a severe cardiac disease with unclear diagnosis. We developed a specific bioinformatics pipeline to filter out the germline mutations. We found six genes with novel deleterious mutations that were homozygous in the affected child, and heterozygous in both parents. Among the six mutated genes the TRDN and UNC45A genes were already associated to cardiac dysfunctions in literature. Functional studies will be performed to evaluate the involvement of the mutated genes in the disease onset. In conclusion, we developed an automatic and versatile pipeline to analyse NGS data coming from whole-exome sequencing and target sequencing strategies. In addition, we integrated in the pipeline several public variation databases to evaluate and interpret the candidate mutations. The mutations found were validated by Sanger sequencing to evaluate the strength of the pipeline filters.

NEXT-GENERATION SEQUENCING APPROACH FOR IDENTIFICATION OF CANDIDATE GENES IN ARRHYTHMOGENIC DISEASES / A. Pietrelli ; tutore: C. Battaglia ; co-tutor: G. De Bellis ; direttore del dottorato: M. Clerici. DIPARTIMENTO DI BIOTECNOLOGIE MEDICHE E MEDICINA TRASLAZIONALE, 2014 Feb 04. 26. ciclo, Anno Accademico 2013. [10.13130/pietrelli-alessandro_phd2014-02-04].

NEXT-GENERATION SEQUENCING APPROACH FOR IDENTIFICATION OF CANDIDATE GENES IN ARRHYTHMOGENIC DISEASES

A. Pietrelli
2014

Abstract

Recent advances in genome sequencing technologies provided unexpected opportunities to characterize individual genomic landscape and identify mutations relevant for diagnosis and therapy in clinics. Specifically, whole-exome sequencing for complex disease (such as tumor/normal matched sample) and target resequencing for Mendelian disease, using next-generation sequencing (NGS) technologies, are gaining popularity in the human genetics community due to the moderate costs and the huge quantity of information provided by each experiment. However, NGS data analysis still remains the crucial bottleneck in this approach because of the great amount of data containing millions of potential disease-causing variants and difficulties involving the integration of different sources. Here, we describe the application of a bioinformatics analysis pipeline for NGS data in two case studies about rare cardiac diseases. The case 1 is focused on the target sequencing of 158 candidate genes in 91 patients affected by Brugada Syndrome (BrS). To date the clinical phenotype is associated with mutations in the SCN5A gene but explain only the 30% of the BrS cases. Therefore we selected a panel of genes previously associated to cases of arrhythmogenic disorders in literature and we analysed them in a cohort of BrS patients, which were negative for known SCN5A mutations. We found 98 novel genetic variations and 60 clinical rs belonging to 70 genes. In particular we found 13 genes significantly mutated in our cohort compared to healthy controls in 1000 Genomes data that were not previously associated to BrS phenotype. In case study 2 we performed a whole-exome sequencing experiment of trio family where the child is affected by a severe cardiac disease with unclear diagnosis. We developed a specific bioinformatics pipeline to filter out the germline mutations. We found six genes with novel deleterious mutations that were homozygous in the affected child, and heterozygous in both parents. Among the six mutated genes the TRDN and UNC45A genes were already associated to cardiac dysfunctions in literature. Functional studies will be performed to evaluate the involvement of the mutated genes in the disease onset. In conclusion, we developed an automatic and versatile pipeline to analyse NGS data coming from whole-exome sequencing and target sequencing strategies. In addition, we integrated in the pipeline several public variation databases to evaluate and interpret the candidate mutations. The mutations found were validated by Sanger sequencing to evaluate the strength of the pipeline filters.
4-feb-2014
Settore BIO/10 - Biochimica
bioinformatics; genomics ; Brugada syndrome ; NGS ; target sequencing
BATTAGLIA, CRISTINA
CLERICI, MARIO SALVATORE
Doctoral Thesis
NEXT-GENERATION SEQUENCING APPROACH FOR IDENTIFICATION OF CANDIDATE GENES IN ARRHYTHMOGENIC DISEASES / A. Pietrelli ; tutore: C. Battaglia ; co-tutor: G. De Bellis ; direttore del dottorato: M. Clerici. DIPARTIMENTO DI BIOTECNOLOGIE MEDICHE E MEDICINA TRASLAZIONALE, 2014 Feb 04. 26. ciclo, Anno Accademico 2013. [10.13130/pietrelli-alessandro_phd2014-02-04].
File in questo prodotto:
File Dimensione Formato  
phd_unimi_R09193.pdf

Open Access dal 16/08/2015

Tipologia: Tesi di dottorato completa
Dimensione 4.21 MB
Formato Adobe PDF
4.21 MB Adobe PDF Visualizza/Apri
Pubblicazioni consigliate

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/2434/231158
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus ND
  • ???jsp.display-item.citation.isi??? ND
social impact