High-quality genotypic data is a requirement for many genetic analyses. For any crop, errors in genotype calls, phasing of markers, linkage maps, pedigree records, and unnoticed variation in ploidy levels can lead to spurious marker-locus-trait associations and incorrect origin assignment of alleles to individuals. High-throughput genotyping requires automated scoring, as manual inspection of thousands of scored loci is too time-consuming. However, automated SNP scoring can result in errors that should be corrected to ensure recorded genotypic data are accurate and thereby ensure confidence in downstream genetic analyses. To enable quick identification of errors in a large genotypic data set, we have developed a comprehensive workflow. This multiple-step workflow is based on inheritance principles and on removal of markers and individuals that do not follow these principles, as demonstrated here for apple, peach, and sweet cherry. Genotypic data was obtained on pedigreed germplasm using 6-9K SNP arrays for each crop and a subset of well-performing SNPs was created using ASSIsT. Use of correct (and corrected) pedigree records readily identified violations of simple inheritance principles in the genotypic data, streamlined with FlexQTL software. Retained SNPs were grouped into haploblocks to increase the information content of single alleles and reduce computational power needed in downstream genetic analyses. Haploblock borders were defined by recombination locations detected in ancestral generations of cultivars and selections. Another round of inheritance-checking was conducted, for haploblock alleles (i.e., haplotypes). High-quality genotypic data sets were created using this workflow for pedigreed collections representing the U.S. breeding germplasm of apple, peach, and sweet cherry evaluated within the RosBREED project. These data sets contain 3855, 4005, and 1617 SNPs spread over 932, 103, and 196 haploblocks in apple, peach, and sweet cherry, respectively. The highly curated phased SNP and haplotype data sets, as well as the raw iScan data, of germplasm in the apple, peach, and sweet cherry Crop Reference Sets is available through the Genome Database for Rosaceae.

High-quality, genome-wide SNP genotypic data for pedigreed germplasm of the diploid outbreeding species apple, peach, and sweet cherry through a common workflow / S. Vanderzande, N.P. Howard, L. Cai, C. Da Silva Linge, L. Antanaviciute, M.C.A.M. Bink, J.W. Kruisselbrink, N. Bassil, K. Gasic, A. Iezzoni, E. Van de Weg, C. Peace. - In: PLOS ONE. - ISSN 1932-6203. - 14:6(2018), pp. e0210928.1-e0210928.33. [10.1371/journal.pone.0210928]

High-quality, genome-wide SNP genotypic data for pedigreed germplasm of the diploid outbreeding species apple, peach, and sweet cherry through a common workflow

C. Da Silva Linge;
2018

Abstract

High-quality genotypic data is a requirement for many genetic analyses. For any crop, errors in genotype calls, phasing of markers, linkage maps, pedigree records, and unnoticed variation in ploidy levels can lead to spurious marker-locus-trait associations and incorrect origin assignment of alleles to individuals. High-throughput genotyping requires automated scoring, as manual inspection of thousands of scored loci is too time-consuming. However, automated SNP scoring can result in errors that should be corrected to ensure recorded genotypic data are accurate and thereby ensure confidence in downstream genetic analyses. To enable quick identification of errors in a large genotypic data set, we have developed a comprehensive workflow. This multiple-step workflow is based on inheritance principles and on removal of markers and individuals that do not follow these principles, as demonstrated here for apple, peach, and sweet cherry. Genotypic data was obtained on pedigreed germplasm using 6-9K SNP arrays for each crop and a subset of well-performing SNPs was created using ASSIsT. Use of correct (and corrected) pedigree records readily identified violations of simple inheritance principles in the genotypic data, streamlined with FlexQTL software. Retained SNPs were grouped into haploblocks to increase the information content of single alleles and reduce computational power needed in downstream genetic analyses. Haploblock borders were defined by recombination locations detected in ancestral generations of cultivars and selections. Another round of inheritance-checking was conducted, for haploblock alleles (i.e., haplotypes). High-quality genotypic data sets were created using this workflow for pedigreed collections representing the U.S. breeding germplasm of apple, peach, and sweet cherry evaluated within the RosBREED project. These data sets contain 3855, 4005, and 1617 SNPs spread over 932, 103, and 196 haploblocks in apple, peach, and sweet cherry, respectively. The highly curated phased SNP and haplotype data sets, as well as the raw iScan data, of germplasm in the apple, peach, and sweet cherry Crop Reference Sets is available through the Genome Database for Rosaceae.
English
Settore AGR/07 - Genetica Agraria
Articolo
Esperti anonimi
Pubblicazione scientifica
2018
Public Library of Science
14
6
e0210928
1
33
33
Pubblicato
Periodico con rilevanza internazionale
scopus
pubmed
crossref
wos
datacite
Aderisco
info:eu-repo/semantics/article
High-quality, genome-wide SNP genotypic data for pedigreed germplasm of the diploid outbreeding species apple, peach, and sweet cherry through a common workflow / S. Vanderzande, N.P. Howard, L. Cai, C. Da Silva Linge, L. Antanaviciute, M.C.A.M. Bink, J.W. Kruisselbrink, N. Bassil, K. Gasic, A. Iezzoni, E. Van de Weg, C. Peace. - In: PLOS ONE. - ISSN 1932-6203. - 14:6(2018), pp. e0210928.1-e0210928.33. [10.1371/journal.pone.0210928]
open
Prodotti della ricerca::01 - Articolo su periodico
12
262
Article (author)
Periodico con Impact Factor
S. Vanderzande, N.P. Howard, L. Cai, C. Da Silva Linge, L. Antanaviciute, M.C.A.M. Bink, J.W. Kruisselbrink, N. Bassil, K. Gasic, A. Iezzoni, E. Van de Weg, C. Peace
File in questo prodotto:
File Dimensione Formato  
8. Vanderzande et al. 2019.pdf

accesso aperto

Descrizione: Research Article
Tipologia: Publisher's version/PDF
Dimensione 1.28 MB
Formato Adobe PDF
1.28 MB Adobe PDF Visualizza/Apri
Pubblicazioni consigliate

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/2434/972089
Citazioni
  • ???jsp.display-item.citation.pmc??? 29
  • Scopus 59
  • ???jsp.display-item.citation.isi??? 54
social impact