Cargando…

Development of a highly efficient 50K single nucleotide polymorphism genotyping array for the large and complex genome of Norway spruce (Picea abies L. Karst) by whole genome resequencing and its transferability to other spruce species

Norway spruce (Picea abies L. Karst) is one of the most important forest tree species with significant economic and ecological impact in Europe. For decades, genomic and genetic studies on Norway spruce have been challenging due to the large and repetitive genome (19.6 Gb with more than 70% being re...

Descripción completa

Detalles Bibliográficos
Autores principales: Bernhardsson, Carolina, Zan, Yanjun, Chen, Zhiqiang, Ingvarsson, Pär K., Wu, Harry X.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: John Wiley and Sons Inc. 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7984398/
https://www.ncbi.nlm.nih.gov/pubmed/33179386
http://dx.doi.org/10.1111/1755-0998.13292
_version_ 1783668059065548800
author Bernhardsson, Carolina
Zan, Yanjun
Chen, Zhiqiang
Ingvarsson, Pär K.
Wu, Harry X.
author_facet Bernhardsson, Carolina
Zan, Yanjun
Chen, Zhiqiang
Ingvarsson, Pär K.
Wu, Harry X.
author_sort Bernhardsson, Carolina
collection PubMed
description Norway spruce (Picea abies L. Karst) is one of the most important forest tree species with significant economic and ecological impact in Europe. For decades, genomic and genetic studies on Norway spruce have been challenging due to the large and repetitive genome (19.6 Gb with more than 70% being repetitive). To accelerate genomic studies, including population genetics, genome‐wide association studies (GWAS) and genomic selection (GS), in Norway spruce and related species, we here report on the design and performance of a 50K single nucleotide polymorphism (SNP) genotyping array for Norway spruce. The array is developed based on whole genome resequencing (WGS), making it the first WGS‐based SNP array in any conifer species so far. After identifying SNPs using genome resequencing data from 29 trees collected in northern Europe, we adopted a two‐step approach to design the array. First, we built a 450K screening array and used this to genotype a population of 480 trees sampled from both natural and breeding populations across the Norway spruce distribution range. These samples were then used to select high‐confidence probes that were put on the final 50K array. The SNPs selected are distributed over 45,552 scaffolds from the P. abies version 1.0 genome assembly and target 19,954 unique gene models with an even coverage of the 12 linkage groups in Norway spruce. We show that the array has a 99.5% probe specificity, >98% Mendelian allelic inheritance concordance, an average sample call rate of 96.30% and an SNP call rate of 98.90% in family trios and haploid tissues. We also observed that 23,797 probes (50%) could be identified with high confidence in three other spruce species (white spruce [Picea glauca], black spruce [P. mariana] and Sitka spruce [P. sitchensis]). The high‐quality genotyping array will be a valuable resource for genetic and genomic studies in Norway spruce as well as in other conifer species of the same genus.
format Online
Article
Text
id pubmed-7984398
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher John Wiley and Sons Inc.
record_format MEDLINE/PubMed
spelling pubmed-79843982021-03-25 Development of a highly efficient 50K single nucleotide polymorphism genotyping array for the large and complex genome of Norway spruce (Picea abies L. Karst) by whole genome resequencing and its transferability to other spruce species Bernhardsson, Carolina Zan, Yanjun Chen, Zhiqiang Ingvarsson, Pär K. Wu, Harry X. Mol Ecol Resour RESOURCE ARTICLES Norway spruce (Picea abies L. Karst) is one of the most important forest tree species with significant economic and ecological impact in Europe. For decades, genomic and genetic studies on Norway spruce have been challenging due to the large and repetitive genome (19.6 Gb with more than 70% being repetitive). To accelerate genomic studies, including population genetics, genome‐wide association studies (GWAS) and genomic selection (GS), in Norway spruce and related species, we here report on the design and performance of a 50K single nucleotide polymorphism (SNP) genotyping array for Norway spruce. The array is developed based on whole genome resequencing (WGS), making it the first WGS‐based SNP array in any conifer species so far. After identifying SNPs using genome resequencing data from 29 trees collected in northern Europe, we adopted a two‐step approach to design the array. First, we built a 450K screening array and used this to genotype a population of 480 trees sampled from both natural and breeding populations across the Norway spruce distribution range. These samples were then used to select high‐confidence probes that were put on the final 50K array. The SNPs selected are distributed over 45,552 scaffolds from the P. abies version 1.0 genome assembly and target 19,954 unique gene models with an even coverage of the 12 linkage groups in Norway spruce. We show that the array has a 99.5% probe specificity, >98% Mendelian allelic inheritance concordance, an average sample call rate of 96.30% and an SNP call rate of 98.90% in family trios and haploid tissues. We also observed that 23,797 probes (50%) could be identified with high confidence in three other spruce species (white spruce [Picea glauca], black spruce [P. mariana] and Sitka spruce [P. sitchensis]). The high‐quality genotyping array will be a valuable resource for genetic and genomic studies in Norway spruce as well as in other conifer species of the same genus. John Wiley and Sons Inc. 2020-12-02 2021-04 /pmc/articles/PMC7984398/ /pubmed/33179386 http://dx.doi.org/10.1111/1755-0998.13292 Text en © 2020 The Authors. Molecular Ecology Resources published by John Wiley & Sons Ltd This is an open access article under the terms of the http://creativecommons.org/licenses/by-nc/4.0/ License, which permits use, distribution and reproduction in any medium, provided the original work is properly cited and is not used for commercial purposes.
spellingShingle RESOURCE ARTICLES
Bernhardsson, Carolina
Zan, Yanjun
Chen, Zhiqiang
Ingvarsson, Pär K.
Wu, Harry X.
Development of a highly efficient 50K single nucleotide polymorphism genotyping array for the large and complex genome of Norway spruce (Picea abies L. Karst) by whole genome resequencing and its transferability to other spruce species
title Development of a highly efficient 50K single nucleotide polymorphism genotyping array for the large and complex genome of Norway spruce (Picea abies L. Karst) by whole genome resequencing and its transferability to other spruce species
title_full Development of a highly efficient 50K single nucleotide polymorphism genotyping array for the large and complex genome of Norway spruce (Picea abies L. Karst) by whole genome resequencing and its transferability to other spruce species
title_fullStr Development of a highly efficient 50K single nucleotide polymorphism genotyping array for the large and complex genome of Norway spruce (Picea abies L. Karst) by whole genome resequencing and its transferability to other spruce species
title_full_unstemmed Development of a highly efficient 50K single nucleotide polymorphism genotyping array for the large and complex genome of Norway spruce (Picea abies L. Karst) by whole genome resequencing and its transferability to other spruce species
title_short Development of a highly efficient 50K single nucleotide polymorphism genotyping array for the large and complex genome of Norway spruce (Picea abies L. Karst) by whole genome resequencing and its transferability to other spruce species
title_sort development of a highly efficient 50k single nucleotide polymorphism genotyping array for the large and complex genome of norway spruce (picea abies l. karst) by whole genome resequencing and its transferability to other spruce species
topic RESOURCE ARTICLES
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7984398/
https://www.ncbi.nlm.nih.gov/pubmed/33179386
http://dx.doi.org/10.1111/1755-0998.13292
work_keys_str_mv AT bernhardssoncarolina developmentofahighlyefficient50ksinglenucleotidepolymorphismgenotypingarrayforthelargeandcomplexgenomeofnorwaysprucepiceaabieslkarstbywholegenomeresequencinganditstransferabilitytoothersprucespecies
AT zanyanjun developmentofahighlyefficient50ksinglenucleotidepolymorphismgenotypingarrayforthelargeandcomplexgenomeofnorwaysprucepiceaabieslkarstbywholegenomeresequencinganditstransferabilitytoothersprucespecies
AT chenzhiqiang developmentofahighlyefficient50ksinglenucleotidepolymorphismgenotypingarrayforthelargeandcomplexgenomeofnorwaysprucepiceaabieslkarstbywholegenomeresequencinganditstransferabilitytoothersprucespecies
AT ingvarssonpark developmentofahighlyefficient50ksinglenucleotidepolymorphismgenotypingarrayforthelargeandcomplexgenomeofnorwaysprucepiceaabieslkarstbywholegenomeresequencinganditstransferabilitytoothersprucespecies
AT wuharryx developmentofahighlyefficient50ksinglenucleotidepolymorphismgenotypingarrayforthelargeandcomplexgenomeofnorwaysprucepiceaabieslkarstbywholegenomeresequencinganditstransferabilitytoothersprucespecies