Cargando…
Genotype imputation for soybean nested association mapping population to improve precision of QTL detection
KEY MESSAGE: Software for high imputation accuracy in soybean was identified. Imputed dataset could significantly reduce the interval of genomic regions controlling traits, thus greatly improve the efficiency of candidate gene identification. ABSTRACT: Genotype imputation is a strategy to increase m...
Autores principales: | , , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Springer Berlin Heidelberg
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9110473/ https://www.ncbi.nlm.nih.gov/pubmed/35275252 http://dx.doi.org/10.1007/s00122-022-04070-7 |
_version_ | 1784709111996219392 |
---|---|
author | Chen, Linfeng Yang, Shouping Araya, Susan Quigley, Charles Taliercio, Earl Mian, Rouf Specht, James E. Diers, Brian W. Song, Qijian |
author_facet | Chen, Linfeng Yang, Shouping Araya, Susan Quigley, Charles Taliercio, Earl Mian, Rouf Specht, James E. Diers, Brian W. Song, Qijian |
author_sort | Chen, Linfeng |
collection | PubMed |
description | KEY MESSAGE: Software for high imputation accuracy in soybean was identified. Imputed dataset could significantly reduce the interval of genomic regions controlling traits, thus greatly improve the efficiency of candidate gene identification. ABSTRACT: Genotype imputation is a strategy to increase marker density of existing datasets without additional genotyping. We compared imputation performance of software BEAGLE 5.0, IMPUTE 5 and AlphaPlantImpute and tested software parameters that may help to improve imputation accuracy in soybean populations. Several factors including marker density, extent of linkage disequilibrium (LD), minor allele frequency (MAF), etc., were examined for their effects on imputation accuracy across different software. Our results showed that AlphaPlantImpute had a higher imputation accuracy than BEAGLE 5.0 or IMPUTE 5 tested in each soybean family, especially if the study progeny were genotyped with an extremely low number of markers. LD extent, MAF and reference panel size were positively correlated with imputation accuracy, a minimum number of 50 markers per chromosome and MAF of SNPs > 0.2 in soybean line were required to avoid a significant loss of imputation accuracy. Using the software, we imputed 5176 soybean lines in the soybean nested mapping population (NAM) with high-density markers of the 40 parents. The dataset containing 423,419 markers for 5176 lines and 40 parents was deposited at the Soybase. The imputed NAM dataset was further examined for the improvement of mapping quantitative trait loci (QTL) controlling soybean seed protein content. Most of the QTL identified were at identical or at similar position based on initial and imputed datasets; however, QTL intervals were greatly narrowed. The resulting genotypic dataset of NAM population will facilitate QTL mapping of traits and downstream applications. The information will also help to improve genotyping imputation accuracy in self-pollinated crops. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1007/s00122-022-04070-7. |
format | Online Article Text |
id | pubmed-9110473 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | Springer Berlin Heidelberg |
record_format | MEDLINE/PubMed |
spelling | pubmed-91104732022-05-18 Genotype imputation for soybean nested association mapping population to improve precision of QTL detection Chen, Linfeng Yang, Shouping Araya, Susan Quigley, Charles Taliercio, Earl Mian, Rouf Specht, James E. Diers, Brian W. Song, Qijian Theor Appl Genet Original Article KEY MESSAGE: Software for high imputation accuracy in soybean was identified. Imputed dataset could significantly reduce the interval of genomic regions controlling traits, thus greatly improve the efficiency of candidate gene identification. ABSTRACT: Genotype imputation is a strategy to increase marker density of existing datasets without additional genotyping. We compared imputation performance of software BEAGLE 5.0, IMPUTE 5 and AlphaPlantImpute and tested software parameters that may help to improve imputation accuracy in soybean populations. Several factors including marker density, extent of linkage disequilibrium (LD), minor allele frequency (MAF), etc., were examined for their effects on imputation accuracy across different software. Our results showed that AlphaPlantImpute had a higher imputation accuracy than BEAGLE 5.0 or IMPUTE 5 tested in each soybean family, especially if the study progeny were genotyped with an extremely low number of markers. LD extent, MAF and reference panel size were positively correlated with imputation accuracy, a minimum number of 50 markers per chromosome and MAF of SNPs > 0.2 in soybean line were required to avoid a significant loss of imputation accuracy. Using the software, we imputed 5176 soybean lines in the soybean nested mapping population (NAM) with high-density markers of the 40 parents. The dataset containing 423,419 markers for 5176 lines and 40 parents was deposited at the Soybase. The imputed NAM dataset was further examined for the improvement of mapping quantitative trait loci (QTL) controlling soybean seed protein content. Most of the QTL identified were at identical or at similar position based on initial and imputed datasets; however, QTL intervals were greatly narrowed. The resulting genotypic dataset of NAM population will facilitate QTL mapping of traits and downstream applications. The information will also help to improve genotyping imputation accuracy in self-pollinated crops. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1007/s00122-022-04070-7. Springer Berlin Heidelberg 2022-03-11 2022 /pmc/articles/PMC9110473/ /pubmed/35275252 http://dx.doi.org/10.1007/s00122-022-04070-7 Text en © This is a U.S. government work and not under copyright protection in the U.S.; foreign copyright protection may apply 2022 https://creativecommons.org/licenses/by/4.0/Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . |
spellingShingle | Original Article Chen, Linfeng Yang, Shouping Araya, Susan Quigley, Charles Taliercio, Earl Mian, Rouf Specht, James E. Diers, Brian W. Song, Qijian Genotype imputation for soybean nested association mapping population to improve precision of QTL detection |
title | Genotype imputation for soybean nested association mapping population to improve precision of QTL detection |
title_full | Genotype imputation for soybean nested association mapping population to improve precision of QTL detection |
title_fullStr | Genotype imputation for soybean nested association mapping population to improve precision of QTL detection |
title_full_unstemmed | Genotype imputation for soybean nested association mapping population to improve precision of QTL detection |
title_short | Genotype imputation for soybean nested association mapping population to improve precision of QTL detection |
title_sort | genotype imputation for soybean nested association mapping population to improve precision of qtl detection |
topic | Original Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9110473/ https://www.ncbi.nlm.nih.gov/pubmed/35275252 http://dx.doi.org/10.1007/s00122-022-04070-7 |
work_keys_str_mv | AT chenlinfeng genotypeimputationforsoybeannestedassociationmappingpopulationtoimproveprecisionofqtldetection AT yangshouping genotypeimputationforsoybeannestedassociationmappingpopulationtoimproveprecisionofqtldetection AT arayasusan genotypeimputationforsoybeannestedassociationmappingpopulationtoimproveprecisionofqtldetection AT quigleycharles genotypeimputationforsoybeannestedassociationmappingpopulationtoimproveprecisionofqtldetection AT taliercioearl genotypeimputationforsoybeannestedassociationmappingpopulationtoimproveprecisionofqtldetection AT mianrouf genotypeimputationforsoybeannestedassociationmappingpopulationtoimproveprecisionofqtldetection AT spechtjamese genotypeimputationforsoybeannestedassociationmappingpopulationtoimproveprecisionofqtldetection AT diersbrianw genotypeimputationforsoybeannestedassociationmappingpopulationtoimproveprecisionofqtldetection AT songqijian genotypeimputationforsoybeannestedassociationmappingpopulationtoimproveprecisionofqtldetection |