Genome analysis based on next generation sequencing (NGS) technologies provides a novel approach for surveying molecular diversity among individuals, which in turn can generate tools for linkage and association mapping, gene cloning, molecular breeding, population genetics, germplasm management, and crop systematics and evolution. 'De novo' assembly of short reads is challenging in general and even more so as the size and complexity of genomes increase. A high quality and well annotated reference genome sequence can help solve most of the conflicts. Yet, the identification of several structural variants, such as the movement of transposable elements, large insertions/deletions, segmental duplications, inversions and other genomic features is still a challenge to algorithms and automatic procedures. We sequenced 14 Prunus accessions that include ten peach cultivars, two wild peach-related species, one almond and one apricot accession using the NGS Illumina platform. We produced 64 to 109 bp long single reads as well as paired ends from approx. 300-500 bp long fragments. The coverage varied from approximately 16 to 75 genome equivalents. Individual genomes were aligned using the reference sequence of the doubled haploid peach cultivar 'Lovell', recently released by the International Peach Genome Initiative (IPGI) (http://www.rosaceae. org/peach/genome). In this paper we present a repertoire of molecular variants that can be mined, namely SNPs (Single Nucleotide Polymorphisms), DIPs (Deletion/Insertion Polymorphisms), larger structural variations, which include movement of transposable elements, the so called copy-number variations, segmental duplications and others. Some of these variants, such as SNPs, are easily detectable and much commercial and open-access software can perform the search. Others variants, such as the large structural variations, still need analytical approaches to be implemented or improved. For several variants, theoretical and methodological approaches are presented and discussed and, when available, preliminary results are reported.

A catalog of molecular diversity of Prunus germplasm gathered from aligning NGS reads to the peach reference sequence : bioinformatic approaches and challenges / S. Scalabrin, A. Policriti, F. Nadalin, S. Pinosio, F. Cattonaro, E. Vendramin, V. Aramini, I. Verde, D. Bassi, R. Pirona, L. Rossini, G. Cipriani, R. Testolin, M. Morgante. - In: ACTA HORTICULTURAE. - ISSN 0567-7572. - 976:976(2013), pp. 169-176. ((Intervento presentato al 13. convegno Eucarpia Symposium on Fruit Breeding and Genetics tenutosi a Warsaw, Poland nel 2011.

A catalog of molecular diversity of Prunus germplasm gathered from aligning NGS reads to the peach reference sequence : bioinformatic approaches and challenges

D. Bassi;L. Rossini;
2013

Abstract

Genome analysis based on next generation sequencing (NGS) technologies provides a novel approach for surveying molecular diversity among individuals, which in turn can generate tools for linkage and association mapping, gene cloning, molecular breeding, population genetics, germplasm management, and crop systematics and evolution. 'De novo' assembly of short reads is challenging in general and even more so as the size and complexity of genomes increase. A high quality and well annotated reference genome sequence can help solve most of the conflicts. Yet, the identification of several structural variants, such as the movement of transposable elements, large insertions/deletions, segmental duplications, inversions and other genomic features is still a challenge to algorithms and automatic procedures. We sequenced 14 Prunus accessions that include ten peach cultivars, two wild peach-related species, one almond and one apricot accession using the NGS Illumina platform. We produced 64 to 109 bp long single reads as well as paired ends from approx. 300-500 bp long fragments. The coverage varied from approximately 16 to 75 genome equivalents. Individual genomes were aligned using the reference sequence of the doubled haploid peach cultivar 'Lovell', recently released by the International Peach Genome Initiative (IPGI) (http://www.rosaceae. org/peach/genome). In this paper we present a repertoire of molecular variants that can be mined, namely SNPs (Single Nucleotide Polymorphisms), DIPs (Deletion/Insertion Polymorphisms), larger structural variations, which include movement of transposable elements, the so called copy-number variations, segmental duplications and others. Some of these variants, such as SNPs, are easily detectable and much commercial and open-access software can perform the search. Others variants, such as the large structural variations, still need analytical approaches to be implemented or improved. For several variants, theoretical and methodological approaches are presented and discussed and, when available, preliminary results are reported.
Marker-assisted selection; Next generation sequencing; Single nucleotide polymorphism; SNP DNA arrays; Structural variants
Settore AGR/07 - Genetica Agraria
Settore AGR/03 - Arboricoltura Generale e Coltivazioni Arboree
2013
Article (author)
File in questo prodotto:
File Dimensione Formato  
acta 976.pdf

accesso solo dalla rete interna

Tipologia: Publisher's version/PDF
Dimensione 310.81 kB
Formato Adobe PDF
310.81 kB Adobe PDF   Visualizza/Apri   Richiedi una copia
Pubblicazioni consigliate

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/2434/220084
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 3
  • ???jsp.display-item.citation.isi??? 1
social impact