Cargando…
Modeling multiple phenotypes in wheat using data‐driven genomic exploratory factor analysis and Bayesian network learning
Inferring trait networks from a large volume of genetically correlated diverse phenotypes such as yield, architecture, and disease resistance can provide information on the manner in which complex phenotypes are interrelated. However, studies on statistical methods tailored to multidimensional pheno...
Autores principales: | , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
John Wiley and Sons Inc.
2021
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7833463/ https://www.ncbi.nlm.nih.gov/pubmed/33532691 http://dx.doi.org/10.1002/pld3.304 |
_version_ | 1783642071962222592 |
---|---|
author | Momen, Mehdi Bhatta, Madhav Hussain, Waseem Yu, Haipeng Morota, Gota |
author_facet | Momen, Mehdi Bhatta, Madhav Hussain, Waseem Yu, Haipeng Morota, Gota |
author_sort | Momen, Mehdi |
collection | PubMed |
description | Inferring trait networks from a large volume of genetically correlated diverse phenotypes such as yield, architecture, and disease resistance can provide information on the manner in which complex phenotypes are interrelated. However, studies on statistical methods tailored to multidimensional phenotypes are limited, whereas numerous methods are available for evaluating the massive number of genetic markers. Factor analysis operates at the level of latent variables predicted to generate observed responses. The objectives of this study were to illustrate the manner in which data‐driven exploratory factor analysis can map observed phenotypes into a smaller number of latent variables and infer a genomic latent factor network using 45 agro‐morphological, disease, and grain mineral phenotypes measured in synthetic hexaploid wheat lines (Triticum aestivum L.). In total, eight latent factors including grain yield, architecture, flag leaf‐related traits, grain minerals, yellow rust, two types of stem rust, and leaf rust were identified as common sources of the observed phenotypes. The genetic component of the factor scores for each latent variable was fed into a Bayesian network to obtain a trait structure reflecting the genetic interdependency among traits. Three directed paths were consistently identified by two Bayesian network algorithms. Flag leaf‐related traits influenced leaf rust, and yellow rust and stem rust influenced grain yield. Additional paths that were identified included flag leaf‐related traits to minerals and minerals to architecture. This study shows that data‐driven exploratory factor analysis can reveal smaller dimensional common latent phenotypes that are likely to give rise to numerous observed field phenotypes without relying on prior biological knowledge. The inferred genomic latent factor structure from the Bayesian network provides insights for plant breeding to simultaneously improve multiple traits, as an intervention on one trait will affect the values of focal phenotypes in an interrelated complex trait system. |
format | Online Article Text |
id | pubmed-7833463 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2021 |
publisher | John Wiley and Sons Inc. |
record_format | MEDLINE/PubMed |
spelling | pubmed-78334632021-02-01 Modeling multiple phenotypes in wheat using data‐driven genomic exploratory factor analysis and Bayesian network learning Momen, Mehdi Bhatta, Madhav Hussain, Waseem Yu, Haipeng Morota, Gota Plant Direct Original Research Inferring trait networks from a large volume of genetically correlated diverse phenotypes such as yield, architecture, and disease resistance can provide information on the manner in which complex phenotypes are interrelated. However, studies on statistical methods tailored to multidimensional phenotypes are limited, whereas numerous methods are available for evaluating the massive number of genetic markers. Factor analysis operates at the level of latent variables predicted to generate observed responses. The objectives of this study were to illustrate the manner in which data‐driven exploratory factor analysis can map observed phenotypes into a smaller number of latent variables and infer a genomic latent factor network using 45 agro‐morphological, disease, and grain mineral phenotypes measured in synthetic hexaploid wheat lines (Triticum aestivum L.). In total, eight latent factors including grain yield, architecture, flag leaf‐related traits, grain minerals, yellow rust, two types of stem rust, and leaf rust were identified as common sources of the observed phenotypes. The genetic component of the factor scores for each latent variable was fed into a Bayesian network to obtain a trait structure reflecting the genetic interdependency among traits. Three directed paths were consistently identified by two Bayesian network algorithms. Flag leaf‐related traits influenced leaf rust, and yellow rust and stem rust influenced grain yield. Additional paths that were identified included flag leaf‐related traits to minerals and minerals to architecture. This study shows that data‐driven exploratory factor analysis can reveal smaller dimensional common latent phenotypes that are likely to give rise to numerous observed field phenotypes without relying on prior biological knowledge. The inferred genomic latent factor structure from the Bayesian network provides insights for plant breeding to simultaneously improve multiple traits, as an intervention on one trait will affect the values of focal phenotypes in an interrelated complex trait system. John Wiley and Sons Inc. 2021-01-25 /pmc/articles/PMC7833463/ /pubmed/33532691 http://dx.doi.org/10.1002/pld3.304 Text en © 2021 The Authors. Plant Direct published by American Society of Plant Biologists, Society for Experimental Biology and John Wiley & Sons Ltd. This is an open access article under the terms of the http://creativecommons.org/licenses/by/4.0/ License, which permits use, distribution and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Original Research Momen, Mehdi Bhatta, Madhav Hussain, Waseem Yu, Haipeng Morota, Gota Modeling multiple phenotypes in wheat using data‐driven genomic exploratory factor analysis and Bayesian network learning |
title | Modeling multiple phenotypes in wheat using data‐driven genomic exploratory factor analysis and Bayesian network learning |
title_full | Modeling multiple phenotypes in wheat using data‐driven genomic exploratory factor analysis and Bayesian network learning |
title_fullStr | Modeling multiple phenotypes in wheat using data‐driven genomic exploratory factor analysis and Bayesian network learning |
title_full_unstemmed | Modeling multiple phenotypes in wheat using data‐driven genomic exploratory factor analysis and Bayesian network learning |
title_short | Modeling multiple phenotypes in wheat using data‐driven genomic exploratory factor analysis and Bayesian network learning |
title_sort | modeling multiple phenotypes in wheat using data‐driven genomic exploratory factor analysis and bayesian network learning |
topic | Original Research |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7833463/ https://www.ncbi.nlm.nih.gov/pubmed/33532691 http://dx.doi.org/10.1002/pld3.304 |
work_keys_str_mv | AT momenmehdi modelingmultiplephenotypesinwheatusingdatadrivengenomicexploratoryfactoranalysisandbayesiannetworklearning AT bhattamadhav modelingmultiplephenotypesinwheatusingdatadrivengenomicexploratoryfactoranalysisandbayesiannetworklearning AT hussainwaseem modelingmultiplephenotypesinwheatusingdatadrivengenomicexploratoryfactoranalysisandbayesiannetworklearning AT yuhaipeng modelingmultiplephenotypesinwheatusingdatadrivengenomicexploratoryfactoranalysisandbayesiannetworklearning AT morotagota modelingmultiplephenotypesinwheatusingdatadrivengenomicexploratoryfactoranalysisandbayesiannetworklearning |