Cargando…

Imputation accuracy of wheat genotyping-by-sequencing (GBS) data using barley and wheat genome references

Genotyping-by-sequencing (GBS) provides high SNP coverage and has recently emerged as a popular technology for genetic and breeding applications in bread wheat (Triticum aestivum L.) and many other plant species. Although GBS can discover millions of SNPs, a high rate of missing data is a major conc...

Descripción completa

Detalles Bibliográficos
Autores principales: Alipour, Hadi, Bai, Guihua, Zhang, Guorong, Bihamta, Mohammad Reza, Mohammadi, Valiollah, Peyghambari, Seyed Ali
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6322752/
https://www.ncbi.nlm.nih.gov/pubmed/30615624
http://dx.doi.org/10.1371/journal.pone.0208614
_version_ 1783385648507387904
author Alipour, Hadi
Bai, Guihua
Zhang, Guorong
Bihamta, Mohammad Reza
Mohammadi, Valiollah
Peyghambari, Seyed Ali
author_facet Alipour, Hadi
Bai, Guihua
Zhang, Guorong
Bihamta, Mohammad Reza
Mohammadi, Valiollah
Peyghambari, Seyed Ali
author_sort Alipour, Hadi
collection PubMed
description Genotyping-by-sequencing (GBS) provides high SNP coverage and has recently emerged as a popular technology for genetic and breeding applications in bread wheat (Triticum aestivum L.) and many other plant species. Although GBS can discover millions of SNPs, a high rate of missing data is a major concern for many applications. Accurate imputation of those missing data can significantly improve the utility of GBS data. This study compared imputation accuracies among four genome references including three wheat references (Chinese Spring survey sequence, W7984, and IWGSC RefSeq v1.0) and one barley reference genome by comparing imputed data derived from low-depth sequencing to actual data from high-depth sequencing. After imputation, the average number of imputed data points was the highest in the B genome (~48.99%). The D genome had the lowest imputed data points (~15.02%) but the highest imputation accuracy. Among the four reference genomes, IWGSC RefSeq v1.0 reference provided the most imputed data points, but the lowest imputation accuracy for the SNPs with < 10% minor allele frequency (MAF). The W7984 reference, however, provided the highest imputation accuracy for the SNPs with < 10% MAF.
format Online
Article
Text
id pubmed-6322752
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-63227522019-01-19 Imputation accuracy of wheat genotyping-by-sequencing (GBS) data using barley and wheat genome references Alipour, Hadi Bai, Guihua Zhang, Guorong Bihamta, Mohammad Reza Mohammadi, Valiollah Peyghambari, Seyed Ali PLoS One Research Article Genotyping-by-sequencing (GBS) provides high SNP coverage and has recently emerged as a popular technology for genetic and breeding applications in bread wheat (Triticum aestivum L.) and many other plant species. Although GBS can discover millions of SNPs, a high rate of missing data is a major concern for many applications. Accurate imputation of those missing data can significantly improve the utility of GBS data. This study compared imputation accuracies among four genome references including three wheat references (Chinese Spring survey sequence, W7984, and IWGSC RefSeq v1.0) and one barley reference genome by comparing imputed data derived from low-depth sequencing to actual data from high-depth sequencing. After imputation, the average number of imputed data points was the highest in the B genome (~48.99%). The D genome had the lowest imputed data points (~15.02%) but the highest imputation accuracy. Among the four reference genomes, IWGSC RefSeq v1.0 reference provided the most imputed data points, but the lowest imputation accuracy for the SNPs with < 10% minor allele frequency (MAF). The W7984 reference, however, provided the highest imputation accuracy for the SNPs with < 10% MAF. Public Library of Science 2019-01-07 /pmc/articles/PMC6322752/ /pubmed/30615624 http://dx.doi.org/10.1371/journal.pone.0208614 Text en https://creativecommons.org/publicdomain/zero/1.0/ This is an open access article, free of all copyright, and may be freely reproduced, distributed, transmitted, modified, built upon, or otherwise used by anyone for any lawful purpose. The work is made available under the Creative Commons CC0 (https://creativecommons.org/publicdomain/zero/1.0/) public domain dedication.
spellingShingle Research Article
Alipour, Hadi
Bai, Guihua
Zhang, Guorong
Bihamta, Mohammad Reza
Mohammadi, Valiollah
Peyghambari, Seyed Ali
Imputation accuracy of wheat genotyping-by-sequencing (GBS) data using barley and wheat genome references
title Imputation accuracy of wheat genotyping-by-sequencing (GBS) data using barley and wheat genome references
title_full Imputation accuracy of wheat genotyping-by-sequencing (GBS) data using barley and wheat genome references
title_fullStr Imputation accuracy of wheat genotyping-by-sequencing (GBS) data using barley and wheat genome references
title_full_unstemmed Imputation accuracy of wheat genotyping-by-sequencing (GBS) data using barley and wheat genome references
title_short Imputation accuracy of wheat genotyping-by-sequencing (GBS) data using barley and wheat genome references
title_sort imputation accuracy of wheat genotyping-by-sequencing (gbs) data using barley and wheat genome references
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6322752/
https://www.ncbi.nlm.nih.gov/pubmed/30615624
http://dx.doi.org/10.1371/journal.pone.0208614
work_keys_str_mv AT alipourhadi imputationaccuracyofwheatgenotypingbysequencinggbsdatausingbarleyandwheatgenomereferences
AT baiguihua imputationaccuracyofwheatgenotypingbysequencinggbsdatausingbarleyandwheatgenomereferences
AT zhangguorong imputationaccuracyofwheatgenotypingbysequencinggbsdatausingbarleyandwheatgenomereferences
AT bihamtamohammadreza imputationaccuracyofwheatgenotypingbysequencinggbsdatausingbarleyandwheatgenomereferences
AT mohammadivaliollah imputationaccuracyofwheatgenotypingbysequencinggbsdatausingbarleyandwheatgenomereferences
AT peyghambariseyedali imputationaccuracyofwheatgenotypingbysequencinggbsdatausingbarleyandwheatgenomereferences