Cargando…

Identification of Candidate Genes and Genomic Selection for Seed Protein in Soybean Breeding Pipeline

Soybean is a primary meal protein for human consumption, poultry, and livestock feed. In this study, quantitative trait locus (QTL) controlling protein content was explored via genome-wide association studies (GWAS) and linkage mapping approaches based on 284 soybean accessions and 180 recombinant i...

Descripción completa

Detalles Bibliográficos
Autores principales: Qin, Jun, Wang, Fengmin, Zhao, Qingsong, Shi, Ainong, Zhao, Tiantian, Song, Qijian, Ravelombola, Waltram, An, Hongzhou, Yan, Long, Yang, Chunyan, Zhang, Mengchen
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Frontiers Media S.A. 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9244705/
https://www.ncbi.nlm.nih.gov/pubmed/35783963
http://dx.doi.org/10.3389/fpls.2022.882732
_version_ 1784738587864989696
author Qin, Jun
Wang, Fengmin
Zhao, Qingsong
Shi, Ainong
Zhao, Tiantian
Song, Qijian
Ravelombola, Waltram
An, Hongzhou
Yan, Long
Yang, Chunyan
Zhang, Mengchen
author_facet Qin, Jun
Wang, Fengmin
Zhao, Qingsong
Shi, Ainong
Zhao, Tiantian
Song, Qijian
Ravelombola, Waltram
An, Hongzhou
Yan, Long
Yang, Chunyan
Zhang, Mengchen
author_sort Qin, Jun
collection PubMed
description Soybean is a primary meal protein for human consumption, poultry, and livestock feed. In this study, quantitative trait locus (QTL) controlling protein content was explored via genome-wide association studies (GWAS) and linkage mapping approaches based on 284 soybean accessions and 180 recombinant inbred lines (RILs), respectively, which were evaluated for protein content for 4 years. A total of 22 single nucleotide polymorphisms (SNPs) associated with protein content were detected using mixed linear model (MLM) and general linear model (GLM) methods in Tassel and 5 QTLs using Bayesian interval mapping (IM), single-trait multiple interval mapping (SMIM), single-trait composite interval mapping maximum likelihood estimation (SMLE), and single marker regression (SMR) models in Q-Gene and IciMapping. Major QTLs were detected on chromosomes 6 and 20 in both populations. The new QTL genomic region on chromosome 6 (Chr6_18844283–19315351) included 7 candidate genes and the Hap.X(AA) at the Chr6_19172961 position was associated with high protein content. Genomic selection (GS) of protein content was performed using Bayesian Lasso (BL) and ridge regression best linear unbiased prediction (rrBULP) based on all the SNPs and the SNPs significantly associated with protein content resulted from GWAS. The results showed that BL and rrBLUP performed similarly; GS accuracy was dependent on the SNP set and training population size. GS efficiency was higher for the SNPs derived from GWAS than random SNPs and reached a plateau when the number of markers was >2,000. The SNP markers identified in this study and other information were essential in establishing an efficient marker-assisted selection (MAS) and GS pipelines for improving soybean protein content.
format Online
Article
Text
id pubmed-9244705
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Frontiers Media S.A.
record_format MEDLINE/PubMed
spelling pubmed-92447052022-07-01 Identification of Candidate Genes and Genomic Selection for Seed Protein in Soybean Breeding Pipeline Qin, Jun Wang, Fengmin Zhao, Qingsong Shi, Ainong Zhao, Tiantian Song, Qijian Ravelombola, Waltram An, Hongzhou Yan, Long Yang, Chunyan Zhang, Mengchen Front Plant Sci Plant Science Soybean is a primary meal protein for human consumption, poultry, and livestock feed. In this study, quantitative trait locus (QTL) controlling protein content was explored via genome-wide association studies (GWAS) and linkage mapping approaches based on 284 soybean accessions and 180 recombinant inbred lines (RILs), respectively, which were evaluated for protein content for 4 years. A total of 22 single nucleotide polymorphisms (SNPs) associated with protein content were detected using mixed linear model (MLM) and general linear model (GLM) methods in Tassel and 5 QTLs using Bayesian interval mapping (IM), single-trait multiple interval mapping (SMIM), single-trait composite interval mapping maximum likelihood estimation (SMLE), and single marker regression (SMR) models in Q-Gene and IciMapping. Major QTLs were detected on chromosomes 6 and 20 in both populations. The new QTL genomic region on chromosome 6 (Chr6_18844283–19315351) included 7 candidate genes and the Hap.X(AA) at the Chr6_19172961 position was associated with high protein content. Genomic selection (GS) of protein content was performed using Bayesian Lasso (BL) and ridge regression best linear unbiased prediction (rrBULP) based on all the SNPs and the SNPs significantly associated with protein content resulted from GWAS. The results showed that BL and rrBLUP performed similarly; GS accuracy was dependent on the SNP set and training population size. GS efficiency was higher for the SNPs derived from GWAS than random SNPs and reached a plateau when the number of markers was >2,000. The SNP markers identified in this study and other information were essential in establishing an efficient marker-assisted selection (MAS) and GS pipelines for improving soybean protein content. Frontiers Media S.A. 2022-06-16 /pmc/articles/PMC9244705/ /pubmed/35783963 http://dx.doi.org/10.3389/fpls.2022.882732 Text en Copyright © 2022 Qin, Wang, Zhao, Shi, Zhao, Song, Ravelombola, An, Yan, Yang and Zhang. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle Plant Science
Qin, Jun
Wang, Fengmin
Zhao, Qingsong
Shi, Ainong
Zhao, Tiantian
Song, Qijian
Ravelombola, Waltram
An, Hongzhou
Yan, Long
Yang, Chunyan
Zhang, Mengchen
Identification of Candidate Genes and Genomic Selection for Seed Protein in Soybean Breeding Pipeline
title Identification of Candidate Genes and Genomic Selection for Seed Protein in Soybean Breeding Pipeline
title_full Identification of Candidate Genes and Genomic Selection for Seed Protein in Soybean Breeding Pipeline
title_fullStr Identification of Candidate Genes and Genomic Selection for Seed Protein in Soybean Breeding Pipeline
title_full_unstemmed Identification of Candidate Genes and Genomic Selection for Seed Protein in Soybean Breeding Pipeline
title_short Identification of Candidate Genes and Genomic Selection for Seed Protein in Soybean Breeding Pipeline
title_sort identification of candidate genes and genomic selection for seed protein in soybean breeding pipeline
topic Plant Science
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9244705/
https://www.ncbi.nlm.nih.gov/pubmed/35783963
http://dx.doi.org/10.3389/fpls.2022.882732
work_keys_str_mv AT qinjun identificationofcandidategenesandgenomicselectionforseedproteininsoybeanbreedingpipeline
AT wangfengmin identificationofcandidategenesandgenomicselectionforseedproteininsoybeanbreedingpipeline
AT zhaoqingsong identificationofcandidategenesandgenomicselectionforseedproteininsoybeanbreedingpipeline
AT shiainong identificationofcandidategenesandgenomicselectionforseedproteininsoybeanbreedingpipeline
AT zhaotiantian identificationofcandidategenesandgenomicselectionforseedproteininsoybeanbreedingpipeline
AT songqijian identificationofcandidategenesandgenomicselectionforseedproteininsoybeanbreedingpipeline
AT ravelombolawaltram identificationofcandidategenesandgenomicselectionforseedproteininsoybeanbreedingpipeline
AT anhongzhou identificationofcandidategenesandgenomicselectionforseedproteininsoybeanbreedingpipeline
AT yanlong identificationofcandidategenesandgenomicselectionforseedproteininsoybeanbreedingpipeline
AT yangchunyan identificationofcandidategenesandgenomicselectionforseedproteininsoybeanbreedingpipeline
AT zhangmengchen identificationofcandidategenesandgenomicselectionforseedproteininsoybeanbreedingpipeline