Several bioinformatics methods have been proposed for the detection and characterization of genomic structural variation from ultra-high throughput genome resequencing data. Although some of these methods demonstrate reasonably high specificity, the sensitivity of available approaches is rather low. We propose a novel method for the identification of genomic structural variation from high throughput paired end genome resequencing data. While utilizing deviations from expected library insert sizes, our approach employs additional information from local patterns of read mapping and supervised learning to predict the position and nature of structural variants. We show that our method shows notably increased sensitivity at no cost in specificity with respect to existing insert size-based tools in the identification of structural variants in the human genome. Furthermore, we show that the additional information incorporated in our approach allow us to make reliable predictions of very short insertions and deletions that are otherwise only recovered by approaches based on the split mapping of resequencing reads.

Characterizing Structural Variation in Genomes (from humans to crops) / M. Chiara, D.S. Horner. ((Intervento presentato al 6. convegno European Workshop Genomics for research and molecular diagnostic tenutosi a Lodi nel 2011.

Characterizing Structural Variation in Genomes (from humans to crops)

M. Chiara
Primo
;
D.S. Horner
Ultimo
2011-10-13

Abstract

Several bioinformatics methods have been proposed for the detection and characterization of genomic structural variation from ultra-high throughput genome resequencing data. Although some of these methods demonstrate reasonably high specificity, the sensitivity of available approaches is rather low. We propose a novel method for the identification of genomic structural variation from high throughput paired end genome resequencing data. While utilizing deviations from expected library insert sizes, our approach employs additional information from local patterns of read mapping and supervised learning to predict the position and nature of structural variants. We show that our method shows notably increased sensitivity at no cost in specificity with respect to existing insert size-based tools in the identification of structural variants in the human genome. Furthermore, we show that the additional information incorporated in our approach allow us to make reliable predictions of very short insertions and deletions that are otherwise only recovered by approaches based on the split mapping of resequencing reads.
Settore BIO/11 - Biologia Molecolare
Characterizing Structural Variation in Genomes (from humans to crops) / M. Chiara, D.S. Horner. ((Intervento presentato al 6. convegno European Workshop Genomics for research and molecular diagnostic tenutosi a Lodi nel 2011.
Conference Object
File in questo prodotto:
File Dimensione Formato  
Lodi_SV_talk-2.odp

accesso solo dalla rete interna

Tipologia: Altro
Dimensione 368.37 kB
Formato OpenDocument Presentation
368.37 kB OpenDocument Presentation   Visualizza/Apri   Richiedi una copia
Pubblicazioni consigliate

Caricamento pubblicazioni consigliate

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: http://hdl.handle.net/2434/172593
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus ND
  • ???jsp.display-item.citation.isi??? ND
social impact