Cargando…

Evaluation of SNP Data from the Malus Infinium Array Identifies Challenges for Genetic Analysis of Complex Genomes of Polyploid Origin

High throughput arrays for the simultaneous genotyping of thousands of single-nucleotide polymorphisms (SNPs) have made the rapid genetic characterisation of plant genomes and the development of saturated linkage maps a realistic prospect for many plant species of agronomic importance. However, the...

Descripción completa

Detalles Bibliográficos
Autores principales: Troggio, Michela, Šurbanovski, Nada, Bianco, Luca, Moretto, Marco, Giongo, Lara, Banchi, Elisa, Viola, Roberto, Fernández, Felicdad Fernández, Costa, Fabrizio, Velasco, Riccardo, Cestaro, Alessandro, Sargent, Daniel James
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2013
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3694884/
https://www.ncbi.nlm.nih.gov/pubmed/23826289
http://dx.doi.org/10.1371/journal.pone.0067407
_version_ 1782274907873214464
author Troggio, Michela
Šurbanovski, Nada
Bianco, Luca
Moretto, Marco
Giongo, Lara
Banchi, Elisa
Viola, Roberto
Fernández, Felicdad Fernández
Costa, Fabrizio
Velasco, Riccardo
Cestaro, Alessandro
Sargent, Daniel James
author_facet Troggio, Michela
Šurbanovski, Nada
Bianco, Luca
Moretto, Marco
Giongo, Lara
Banchi, Elisa
Viola, Roberto
Fernández, Felicdad Fernández
Costa, Fabrizio
Velasco, Riccardo
Cestaro, Alessandro
Sargent, Daniel James
author_sort Troggio, Michela
collection PubMed
description High throughput arrays for the simultaneous genotyping of thousands of single-nucleotide polymorphisms (SNPs) have made the rapid genetic characterisation of plant genomes and the development of saturated linkage maps a realistic prospect for many plant species of agronomic importance. However, the correct calling of SNP genotypes in divergent polyploid genomes using array technology can be problematic due to paralogy, and to divergence in probe sequences causing changes in probe binding efficiencies. An Illumina Infinium II whole-genome genotyping array was recently developed for the cultivated apple and used to develop a molecular linkage map for an apple rootstock progeny (M432), but a large proportion of segregating SNPs were not mapped in the progeny, due to unexpected genotype clustering patterns. To investigate the causes of this unexpected clustering we performed BLAST analysis of all probe sequences against the ‘Golden Delicious’ genome sequence and discovered evidence for paralogous annealing sites and probe sequence divergence for a high proportion of probes contained on the array. Following visual re-evaluation of the genotyping data generated for 8,788 SNPs for the M432 progeny using the array, we manually re-scored genotypes at 818 loci and mapped a further 797 markers to the M432 linkage map. The newly mapped markers included the majority of those that could not be mapped previously, as well as loci that were previously scored as monomorphic, but which segregated due to divergence leading to heterozygosity in probe annealing sites. An evaluation of the 8,788 probes in a diverse collection of Malus germplasm showed that more than half the probes returned genotype clustering patterns that were difficult or impossible to interpret reliably, highlighting implications for the use of the array in genome-wide association studies.
format Online
Article
Text
id pubmed-3694884
institution National Center for Biotechnology Information
language English
publishDate 2013
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-36948842013-07-03 Evaluation of SNP Data from the Malus Infinium Array Identifies Challenges for Genetic Analysis of Complex Genomes of Polyploid Origin Troggio, Michela Šurbanovski, Nada Bianco, Luca Moretto, Marco Giongo, Lara Banchi, Elisa Viola, Roberto Fernández, Felicdad Fernández Costa, Fabrizio Velasco, Riccardo Cestaro, Alessandro Sargent, Daniel James PLoS One Research Article High throughput arrays for the simultaneous genotyping of thousands of single-nucleotide polymorphisms (SNPs) have made the rapid genetic characterisation of plant genomes and the development of saturated linkage maps a realistic prospect for many plant species of agronomic importance. However, the correct calling of SNP genotypes in divergent polyploid genomes using array technology can be problematic due to paralogy, and to divergence in probe sequences causing changes in probe binding efficiencies. An Illumina Infinium II whole-genome genotyping array was recently developed for the cultivated apple and used to develop a molecular linkage map for an apple rootstock progeny (M432), but a large proportion of segregating SNPs were not mapped in the progeny, due to unexpected genotype clustering patterns. To investigate the causes of this unexpected clustering we performed BLAST analysis of all probe sequences against the ‘Golden Delicious’ genome sequence and discovered evidence for paralogous annealing sites and probe sequence divergence for a high proportion of probes contained on the array. Following visual re-evaluation of the genotyping data generated for 8,788 SNPs for the M432 progeny using the array, we manually re-scored genotypes at 818 loci and mapped a further 797 markers to the M432 linkage map. The newly mapped markers included the majority of those that could not be mapped previously, as well as loci that were previously scored as monomorphic, but which segregated due to divergence leading to heterozygosity in probe annealing sites. An evaluation of the 8,788 probes in a diverse collection of Malus germplasm showed that more than half the probes returned genotype clustering patterns that were difficult or impossible to interpret reliably, highlighting implications for the use of the array in genome-wide association studies. Public Library of Science 2013-06-27 /pmc/articles/PMC3694884/ /pubmed/23826289 http://dx.doi.org/10.1371/journal.pone.0067407 Text en © 2013 Troggio et al http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited.
spellingShingle Research Article
Troggio, Michela
Šurbanovski, Nada
Bianco, Luca
Moretto, Marco
Giongo, Lara
Banchi, Elisa
Viola, Roberto
Fernández, Felicdad Fernández
Costa, Fabrizio
Velasco, Riccardo
Cestaro, Alessandro
Sargent, Daniel James
Evaluation of SNP Data from the Malus Infinium Array Identifies Challenges for Genetic Analysis of Complex Genomes of Polyploid Origin
title Evaluation of SNP Data from the Malus Infinium Array Identifies Challenges for Genetic Analysis of Complex Genomes of Polyploid Origin
title_full Evaluation of SNP Data from the Malus Infinium Array Identifies Challenges for Genetic Analysis of Complex Genomes of Polyploid Origin
title_fullStr Evaluation of SNP Data from the Malus Infinium Array Identifies Challenges for Genetic Analysis of Complex Genomes of Polyploid Origin
title_full_unstemmed Evaluation of SNP Data from the Malus Infinium Array Identifies Challenges for Genetic Analysis of Complex Genomes of Polyploid Origin
title_short Evaluation of SNP Data from the Malus Infinium Array Identifies Challenges for Genetic Analysis of Complex Genomes of Polyploid Origin
title_sort evaluation of snp data from the malus infinium array identifies challenges for genetic analysis of complex genomes of polyploid origin
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3694884/
https://www.ncbi.nlm.nih.gov/pubmed/23826289
http://dx.doi.org/10.1371/journal.pone.0067407
work_keys_str_mv AT troggiomichela evaluationofsnpdatafromthemalusinfiniumarrayidentifieschallengesforgeneticanalysisofcomplexgenomesofpolyploidorigin
AT surbanovskinada evaluationofsnpdatafromthemalusinfiniumarrayidentifieschallengesforgeneticanalysisofcomplexgenomesofpolyploidorigin
AT biancoluca evaluationofsnpdatafromthemalusinfiniumarrayidentifieschallengesforgeneticanalysisofcomplexgenomesofpolyploidorigin
AT morettomarco evaluationofsnpdatafromthemalusinfiniumarrayidentifieschallengesforgeneticanalysisofcomplexgenomesofpolyploidorigin
AT giongolara evaluationofsnpdatafromthemalusinfiniumarrayidentifieschallengesforgeneticanalysisofcomplexgenomesofpolyploidorigin
AT banchielisa evaluationofsnpdatafromthemalusinfiniumarrayidentifieschallengesforgeneticanalysisofcomplexgenomesofpolyploidorigin
AT violaroberto evaluationofsnpdatafromthemalusinfiniumarrayidentifieschallengesforgeneticanalysisofcomplexgenomesofpolyploidorigin
AT fernandezfelicdadfernandez evaluationofsnpdatafromthemalusinfiniumarrayidentifieschallengesforgeneticanalysisofcomplexgenomesofpolyploidorigin
AT costafabrizio evaluationofsnpdatafromthemalusinfiniumarrayidentifieschallengesforgeneticanalysisofcomplexgenomesofpolyploidorigin
AT velascoriccardo evaluationofsnpdatafromthemalusinfiniumarrayidentifieschallengesforgeneticanalysisofcomplexgenomesofpolyploidorigin
AT cestaroalessandro evaluationofsnpdatafromthemalusinfiniumarrayidentifieschallengesforgeneticanalysisofcomplexgenomesofpolyploidorigin
AT sargentdanieljames evaluationofsnpdatafromthemalusinfiniumarrayidentifieschallengesforgeneticanalysisofcomplexgenomesofpolyploidorigin