Cargando…

Genotype imputation for soybean nested association mapping population to improve precision of QTL detection

KEY MESSAGE: Software for high imputation accuracy in soybean was identified. Imputed dataset could significantly reduce the interval of genomic regions controlling traits, thus greatly improve the efficiency of candidate gene identification. ABSTRACT: Genotype imputation is a strategy to increase m...

Descripción completa

Detalles Bibliográficos
Autores principales: Chen, Linfeng, Yang, Shouping, Araya, Susan, Quigley, Charles, Taliercio, Earl, Mian, Rouf, Specht, James E., Diers, Brian W., Song, Qijian
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Springer Berlin Heidelberg 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9110473/
https://www.ncbi.nlm.nih.gov/pubmed/35275252
http://dx.doi.org/10.1007/s00122-022-04070-7
_version_ 1784709111996219392
author Chen, Linfeng
Yang, Shouping
Araya, Susan
Quigley, Charles
Taliercio, Earl
Mian, Rouf
Specht, James E.
Diers, Brian W.
Song, Qijian
author_facet Chen, Linfeng
Yang, Shouping
Araya, Susan
Quigley, Charles
Taliercio, Earl
Mian, Rouf
Specht, James E.
Diers, Brian W.
Song, Qijian
author_sort Chen, Linfeng
collection PubMed
description KEY MESSAGE: Software for high imputation accuracy in soybean was identified. Imputed dataset could significantly reduce the interval of genomic regions controlling traits, thus greatly improve the efficiency of candidate gene identification. ABSTRACT: Genotype imputation is a strategy to increase marker density of existing datasets without additional genotyping. We compared imputation performance of software BEAGLE 5.0, IMPUTE 5 and AlphaPlantImpute and tested software parameters that may help to improve imputation accuracy in soybean populations. Several factors including marker density, extent of linkage disequilibrium (LD), minor allele frequency (MAF), etc., were examined for their effects on imputation accuracy across different software. Our results showed that AlphaPlantImpute had a higher imputation accuracy than BEAGLE 5.0 or IMPUTE 5 tested in each soybean family, especially if the study progeny were genotyped with an extremely low number of markers. LD extent, MAF and reference panel size were positively correlated with imputation accuracy, a minimum number of 50 markers per chromosome and MAF of SNPs > 0.2 in soybean line were required to avoid a significant loss of imputation accuracy. Using the software, we imputed 5176 soybean lines in the soybean nested mapping population (NAM) with high-density markers of the 40 parents. The dataset containing 423,419 markers for 5176 lines and 40 parents was deposited at the Soybase. The imputed NAM dataset was further examined for the improvement of mapping quantitative trait loci (QTL) controlling soybean seed protein content. Most of the QTL identified were at identical or at similar position based on initial and imputed datasets; however, QTL intervals were greatly narrowed. The resulting genotypic dataset of NAM population will facilitate QTL mapping of traits and downstream applications. The information will also help to improve genotyping imputation accuracy in self-pollinated crops. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1007/s00122-022-04070-7.
format Online
Article
Text
id pubmed-9110473
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Springer Berlin Heidelberg
record_format MEDLINE/PubMed
spelling pubmed-91104732022-05-18 Genotype imputation for soybean nested association mapping population to improve precision of QTL detection Chen, Linfeng Yang, Shouping Araya, Susan Quigley, Charles Taliercio, Earl Mian, Rouf Specht, James E. Diers, Brian W. Song, Qijian Theor Appl Genet Original Article KEY MESSAGE: Software for high imputation accuracy in soybean was identified. Imputed dataset could significantly reduce the interval of genomic regions controlling traits, thus greatly improve the efficiency of candidate gene identification. ABSTRACT: Genotype imputation is a strategy to increase marker density of existing datasets without additional genotyping. We compared imputation performance of software BEAGLE 5.0, IMPUTE 5 and AlphaPlantImpute and tested software parameters that may help to improve imputation accuracy in soybean populations. Several factors including marker density, extent of linkage disequilibrium (LD), minor allele frequency (MAF), etc., were examined for their effects on imputation accuracy across different software. Our results showed that AlphaPlantImpute had a higher imputation accuracy than BEAGLE 5.0 or IMPUTE 5 tested in each soybean family, especially if the study progeny were genotyped with an extremely low number of markers. LD extent, MAF and reference panel size were positively correlated with imputation accuracy, a minimum number of 50 markers per chromosome and MAF of SNPs > 0.2 in soybean line were required to avoid a significant loss of imputation accuracy. Using the software, we imputed 5176 soybean lines in the soybean nested mapping population (NAM) with high-density markers of the 40 parents. The dataset containing 423,419 markers for 5176 lines and 40 parents was deposited at the Soybase. The imputed NAM dataset was further examined for the improvement of mapping quantitative trait loci (QTL) controlling soybean seed protein content. Most of the QTL identified were at identical or at similar position based on initial and imputed datasets; however, QTL intervals were greatly narrowed. The resulting genotypic dataset of NAM population will facilitate QTL mapping of traits and downstream applications. The information will also help to improve genotyping imputation accuracy in self-pollinated crops. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1007/s00122-022-04070-7. Springer Berlin Heidelberg 2022-03-11 2022 /pmc/articles/PMC9110473/ /pubmed/35275252 http://dx.doi.org/10.1007/s00122-022-04070-7 Text en © This is a U.S. government work and not under copyright protection in the U.S.; foreign copyright protection may apply 2022 https://creativecommons.org/licenses/by/4.0/Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) .
spellingShingle Original Article
Chen, Linfeng
Yang, Shouping
Araya, Susan
Quigley, Charles
Taliercio, Earl
Mian, Rouf
Specht, James E.
Diers, Brian W.
Song, Qijian
Genotype imputation for soybean nested association mapping population to improve precision of QTL detection
title Genotype imputation for soybean nested association mapping population to improve precision of QTL detection
title_full Genotype imputation for soybean nested association mapping population to improve precision of QTL detection
title_fullStr Genotype imputation for soybean nested association mapping population to improve precision of QTL detection
title_full_unstemmed Genotype imputation for soybean nested association mapping population to improve precision of QTL detection
title_short Genotype imputation for soybean nested association mapping population to improve precision of QTL detection
title_sort genotype imputation for soybean nested association mapping population to improve precision of qtl detection
topic Original Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9110473/
https://www.ncbi.nlm.nih.gov/pubmed/35275252
http://dx.doi.org/10.1007/s00122-022-04070-7
work_keys_str_mv AT chenlinfeng genotypeimputationforsoybeannestedassociationmappingpopulationtoimproveprecisionofqtldetection
AT yangshouping genotypeimputationforsoybeannestedassociationmappingpopulationtoimproveprecisionofqtldetection
AT arayasusan genotypeimputationforsoybeannestedassociationmappingpopulationtoimproveprecisionofqtldetection
AT quigleycharles genotypeimputationforsoybeannestedassociationmappingpopulationtoimproveprecisionofqtldetection
AT taliercioearl genotypeimputationforsoybeannestedassociationmappingpopulationtoimproveprecisionofqtldetection
AT mianrouf genotypeimputationforsoybeannestedassociationmappingpopulationtoimproveprecisionofqtldetection
AT spechtjamese genotypeimputationforsoybeannestedassociationmappingpopulationtoimproveprecisionofqtldetection
AT diersbrianw genotypeimputationforsoybeannestedassociationmappingpopulationtoimproveprecisionofqtldetection
AT songqijian genotypeimputationforsoybeannestedassociationmappingpopulationtoimproveprecisionofqtldetection