Cargando…

Genome-Wide Association Study Identifies Candidate Genes Related to Seed Oil Composition and Protein Content in Gossypium hirsutum L.

Cotton (Gossypium spp.) is a leading natural fiber crop and an important source of vegetable protein and oil for humans and livestock. To investigate the genetic architecture of seed nutrients in upland cotton, a genome-wide association study (GWAS) was conducted in a panel of 196 germplasm resource...

Descripción completa

Detalles Bibliográficos
Autores principales: Yuan, Yanchao, Wang, Xianlin, Wang, Liyuan, Xing, Huixian, Wang, Qingkang, Saeed, Muhammad, Tao, Jincai, Feng, Wei, Zhang, Guihua, Song, Xian-Liang, Sun, Xue-Zhen
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Frontiers Media S.A. 2018
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6204537/
https://www.ncbi.nlm.nih.gov/pubmed/30405645
http://dx.doi.org/10.3389/fpls.2018.01359
_version_ 1783366054908526592
author Yuan, Yanchao
Wang, Xianlin
Wang, Liyuan
Xing, Huixian
Wang, Qingkang
Saeed, Muhammad
Tao, Jincai
Feng, Wei
Zhang, Guihua
Song, Xian-Liang
Sun, Xue-Zhen
author_facet Yuan, Yanchao
Wang, Xianlin
Wang, Liyuan
Xing, Huixian
Wang, Qingkang
Saeed, Muhammad
Tao, Jincai
Feng, Wei
Zhang, Guihua
Song, Xian-Liang
Sun, Xue-Zhen
author_sort Yuan, Yanchao
collection PubMed
description Cotton (Gossypium spp.) is a leading natural fiber crop and an important source of vegetable protein and oil for humans and livestock. To investigate the genetic architecture of seed nutrients in upland cotton, a genome-wide association study (GWAS) was conducted in a panel of 196 germplasm resources under three environments using a CottonSNP80K chip of 77,774 loci. Relatively high genetic diversity (average gene diversity being 0.331) and phenotypic variation (coefficient of variation, CV, exceeding 3.9%) were detected in this panel. Correlation analysis revealed that the well-documented negative association between seed protein (PR) and oil may be to some extent attributable to the negative correlation between oleic acid (OA) and PR. Linkage disequilibrium (LD) was unevenly distributed among chromosomes and subgenomes. It ranged from 0.10–0.20 Mb (Chr19) to 5.65–5.75 Mb (Chr25) among the chromosomes and the range of Dt-subgenomes LD decay distances was smaller than At-subgenomes. This panel was divided into two subpopulations based on the information of 41,815 polymorphic single-nucleotide polymorphism (SNP) markers. The mixed linear model considering both Q-matrix and K-matrix [MLM(Q+K)] was employed to estimate the association between the SNP markers and the seed nutrients, considering the false positives caused by population structure and the kinship. A total of 47 SNP markers and 28 candidate quantitative trait loci (QTLs) regions were found to be significantly associated with seven cottonseed nutrients, including protein, total fatty acid, and five main fatty acid compositions. In addition, the candidate genes in these regions were analyzed, which included three genes, Gh_D12G1161, Gh_D12G1162, and Gh_D12G1165 that were most likely involved in the control of cottonseed protein concentration. These results improved our understanding of the genetic control of cottonseed nutrients and provided potential molecular tools to develop cultivars with high protein and improved fatty acid compositions in cotton breeding programs through marker-assisted selection.
format Online
Article
Text
id pubmed-6204537
institution National Center for Biotechnology Information
language English
publishDate 2018
publisher Frontiers Media S.A.
record_format MEDLINE/PubMed
spelling pubmed-62045372018-11-07 Genome-Wide Association Study Identifies Candidate Genes Related to Seed Oil Composition and Protein Content in Gossypium hirsutum L. Yuan, Yanchao Wang, Xianlin Wang, Liyuan Xing, Huixian Wang, Qingkang Saeed, Muhammad Tao, Jincai Feng, Wei Zhang, Guihua Song, Xian-Liang Sun, Xue-Zhen Front Plant Sci Plant Science Cotton (Gossypium spp.) is a leading natural fiber crop and an important source of vegetable protein and oil for humans and livestock. To investigate the genetic architecture of seed nutrients in upland cotton, a genome-wide association study (GWAS) was conducted in a panel of 196 germplasm resources under three environments using a CottonSNP80K chip of 77,774 loci. Relatively high genetic diversity (average gene diversity being 0.331) and phenotypic variation (coefficient of variation, CV, exceeding 3.9%) were detected in this panel. Correlation analysis revealed that the well-documented negative association between seed protein (PR) and oil may be to some extent attributable to the negative correlation between oleic acid (OA) and PR. Linkage disequilibrium (LD) was unevenly distributed among chromosomes and subgenomes. It ranged from 0.10–0.20 Mb (Chr19) to 5.65–5.75 Mb (Chr25) among the chromosomes and the range of Dt-subgenomes LD decay distances was smaller than At-subgenomes. This panel was divided into two subpopulations based on the information of 41,815 polymorphic single-nucleotide polymorphism (SNP) markers. The mixed linear model considering both Q-matrix and K-matrix [MLM(Q+K)] was employed to estimate the association between the SNP markers and the seed nutrients, considering the false positives caused by population structure and the kinship. A total of 47 SNP markers and 28 candidate quantitative trait loci (QTLs) regions were found to be significantly associated with seven cottonseed nutrients, including protein, total fatty acid, and five main fatty acid compositions. In addition, the candidate genes in these regions were analyzed, which included three genes, Gh_D12G1161, Gh_D12G1162, and Gh_D12G1165 that were most likely involved in the control of cottonseed protein concentration. These results improved our understanding of the genetic control of cottonseed nutrients and provided potential molecular tools to develop cultivars with high protein and improved fatty acid compositions in cotton breeding programs through marker-assisted selection. Frontiers Media S.A. 2018-10-22 /pmc/articles/PMC6204537/ /pubmed/30405645 http://dx.doi.org/10.3389/fpls.2018.01359 Text en Copyright © 2018 Yuan, Wang, Wang, Xing, Wang, Saeed, Tao, Feng, Zhang, Song and Sun. http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle Plant Science
Yuan, Yanchao
Wang, Xianlin
Wang, Liyuan
Xing, Huixian
Wang, Qingkang
Saeed, Muhammad
Tao, Jincai
Feng, Wei
Zhang, Guihua
Song, Xian-Liang
Sun, Xue-Zhen
Genome-Wide Association Study Identifies Candidate Genes Related to Seed Oil Composition and Protein Content in Gossypium hirsutum L.
title Genome-Wide Association Study Identifies Candidate Genes Related to Seed Oil Composition and Protein Content in Gossypium hirsutum L.
title_full Genome-Wide Association Study Identifies Candidate Genes Related to Seed Oil Composition and Protein Content in Gossypium hirsutum L.
title_fullStr Genome-Wide Association Study Identifies Candidate Genes Related to Seed Oil Composition and Protein Content in Gossypium hirsutum L.
title_full_unstemmed Genome-Wide Association Study Identifies Candidate Genes Related to Seed Oil Composition and Protein Content in Gossypium hirsutum L.
title_short Genome-Wide Association Study Identifies Candidate Genes Related to Seed Oil Composition and Protein Content in Gossypium hirsutum L.
title_sort genome-wide association study identifies candidate genes related to seed oil composition and protein content in gossypium hirsutum l.
topic Plant Science
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6204537/
https://www.ncbi.nlm.nih.gov/pubmed/30405645
http://dx.doi.org/10.3389/fpls.2018.01359
work_keys_str_mv AT yuanyanchao genomewideassociationstudyidentifiescandidategenesrelatedtoseedoilcompositionandproteincontentingossypiumhirsutuml
AT wangxianlin genomewideassociationstudyidentifiescandidategenesrelatedtoseedoilcompositionandproteincontentingossypiumhirsutuml
AT wangliyuan genomewideassociationstudyidentifiescandidategenesrelatedtoseedoilcompositionandproteincontentingossypiumhirsutuml
AT xinghuixian genomewideassociationstudyidentifiescandidategenesrelatedtoseedoilcompositionandproteincontentingossypiumhirsutuml
AT wangqingkang genomewideassociationstudyidentifiescandidategenesrelatedtoseedoilcompositionandproteincontentingossypiumhirsutuml
AT saeedmuhammad genomewideassociationstudyidentifiescandidategenesrelatedtoseedoilcompositionandproteincontentingossypiumhirsutuml
AT taojincai genomewideassociationstudyidentifiescandidategenesrelatedtoseedoilcompositionandproteincontentingossypiumhirsutuml
AT fengwei genomewideassociationstudyidentifiescandidategenesrelatedtoseedoilcompositionandproteincontentingossypiumhirsutuml
AT zhangguihua genomewideassociationstudyidentifiescandidategenesrelatedtoseedoilcompositionandproteincontentingossypiumhirsutuml
AT songxianliang genomewideassociationstudyidentifiescandidategenesrelatedtoseedoilcompositionandproteincontentingossypiumhirsutuml
AT sunxuezhen genomewideassociationstudyidentifiescandidategenesrelatedtoseedoilcompositionandproteincontentingossypiumhirsutuml