Cargando…

Modeling multiple phenotypes in wheat using data‐driven genomic exploratory factor analysis and Bayesian network learning

Inferring trait networks from a large volume of genetically correlated diverse phenotypes such as yield, architecture, and disease resistance can provide information on the manner in which complex phenotypes are interrelated. However, studies on statistical methods tailored to multidimensional pheno...

Descripción completa

Detalles Bibliográficos
Autores principales: Momen, Mehdi, Bhatta, Madhav, Hussain, Waseem, Yu, Haipeng, Morota, Gota
Formato: Online Artículo Texto
Lenguaje:English
Publicado: John Wiley and Sons Inc. 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7833463/
https://www.ncbi.nlm.nih.gov/pubmed/33532691
http://dx.doi.org/10.1002/pld3.304
_version_ 1783642071962222592
author Momen, Mehdi
Bhatta, Madhav
Hussain, Waseem
Yu, Haipeng
Morota, Gota
author_facet Momen, Mehdi
Bhatta, Madhav
Hussain, Waseem
Yu, Haipeng
Morota, Gota
author_sort Momen, Mehdi
collection PubMed
description Inferring trait networks from a large volume of genetically correlated diverse phenotypes such as yield, architecture, and disease resistance can provide information on the manner in which complex phenotypes are interrelated. However, studies on statistical methods tailored to multidimensional phenotypes are limited, whereas numerous methods are available for evaluating the massive number of genetic markers. Factor analysis operates at the level of latent variables predicted to generate observed responses. The objectives of this study were to illustrate the manner in which data‐driven exploratory factor analysis can map observed phenotypes into a smaller number of latent variables and infer a genomic latent factor network using 45 agro‐morphological, disease, and grain mineral phenotypes measured in synthetic hexaploid wheat lines (Triticum aestivum L.). In total, eight latent factors including grain yield, architecture, flag leaf‐related traits, grain minerals, yellow rust, two types of stem rust, and leaf rust were identified as common sources of the observed phenotypes. The genetic component of the factor scores for each latent variable was fed into a Bayesian network to obtain a trait structure reflecting the genetic interdependency among traits. Three directed paths were consistently identified by two Bayesian network algorithms. Flag leaf‐related traits influenced leaf rust, and yellow rust and stem rust influenced grain yield. Additional paths that were identified included flag leaf‐related traits to minerals and minerals to architecture. This study shows that data‐driven exploratory factor analysis can reveal smaller dimensional common latent phenotypes that are likely to give rise to numerous observed field phenotypes without relying on prior biological knowledge. The inferred genomic latent factor structure from the Bayesian network provides insights for plant breeding to simultaneously improve multiple traits, as an intervention on one trait will affect the values of focal phenotypes in an interrelated complex trait system.
format Online
Article
Text
id pubmed-7833463
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher John Wiley and Sons Inc.
record_format MEDLINE/PubMed
spelling pubmed-78334632021-02-01 Modeling multiple phenotypes in wheat using data‐driven genomic exploratory factor analysis and Bayesian network learning Momen, Mehdi Bhatta, Madhav Hussain, Waseem Yu, Haipeng Morota, Gota Plant Direct Original Research Inferring trait networks from a large volume of genetically correlated diverse phenotypes such as yield, architecture, and disease resistance can provide information on the manner in which complex phenotypes are interrelated. However, studies on statistical methods tailored to multidimensional phenotypes are limited, whereas numerous methods are available for evaluating the massive number of genetic markers. Factor analysis operates at the level of latent variables predicted to generate observed responses. The objectives of this study were to illustrate the manner in which data‐driven exploratory factor analysis can map observed phenotypes into a smaller number of latent variables and infer a genomic latent factor network using 45 agro‐morphological, disease, and grain mineral phenotypes measured in synthetic hexaploid wheat lines (Triticum aestivum L.). In total, eight latent factors including grain yield, architecture, flag leaf‐related traits, grain minerals, yellow rust, two types of stem rust, and leaf rust were identified as common sources of the observed phenotypes. The genetic component of the factor scores for each latent variable was fed into a Bayesian network to obtain a trait structure reflecting the genetic interdependency among traits. Three directed paths were consistently identified by two Bayesian network algorithms. Flag leaf‐related traits influenced leaf rust, and yellow rust and stem rust influenced grain yield. Additional paths that were identified included flag leaf‐related traits to minerals and minerals to architecture. This study shows that data‐driven exploratory factor analysis can reveal smaller dimensional common latent phenotypes that are likely to give rise to numerous observed field phenotypes without relying on prior biological knowledge. The inferred genomic latent factor structure from the Bayesian network provides insights for plant breeding to simultaneously improve multiple traits, as an intervention on one trait will affect the values of focal phenotypes in an interrelated complex trait system. John Wiley and Sons Inc. 2021-01-25 /pmc/articles/PMC7833463/ /pubmed/33532691 http://dx.doi.org/10.1002/pld3.304 Text en © 2021 The Authors. Plant Direct published by American Society of Plant Biologists, Society for Experimental Biology and John Wiley & Sons Ltd. This is an open access article under the terms of the http://creativecommons.org/licenses/by/4.0/ License, which permits use, distribution and reproduction in any medium, provided the original work is properly cited.
spellingShingle Original Research
Momen, Mehdi
Bhatta, Madhav
Hussain, Waseem
Yu, Haipeng
Morota, Gota
Modeling multiple phenotypes in wheat using data‐driven genomic exploratory factor analysis and Bayesian network learning
title Modeling multiple phenotypes in wheat using data‐driven genomic exploratory factor analysis and Bayesian network learning
title_full Modeling multiple phenotypes in wheat using data‐driven genomic exploratory factor analysis and Bayesian network learning
title_fullStr Modeling multiple phenotypes in wheat using data‐driven genomic exploratory factor analysis and Bayesian network learning
title_full_unstemmed Modeling multiple phenotypes in wheat using data‐driven genomic exploratory factor analysis and Bayesian network learning
title_short Modeling multiple phenotypes in wheat using data‐driven genomic exploratory factor analysis and Bayesian network learning
title_sort modeling multiple phenotypes in wheat using data‐driven genomic exploratory factor analysis and bayesian network learning
topic Original Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7833463/
https://www.ncbi.nlm.nih.gov/pubmed/33532691
http://dx.doi.org/10.1002/pld3.304
work_keys_str_mv AT momenmehdi modelingmultiplephenotypesinwheatusingdatadrivengenomicexploratoryfactoranalysisandbayesiannetworklearning
AT bhattamadhav modelingmultiplephenotypesinwheatusingdatadrivengenomicexploratoryfactoranalysisandbayesiannetworklearning
AT hussainwaseem modelingmultiplephenotypesinwheatusingdatadrivengenomicexploratoryfactoranalysisandbayesiannetworklearning
AT yuhaipeng modelingmultiplephenotypesinwheatusingdatadrivengenomicexploratoryfactoranalysisandbayesiannetworklearning
AT morotagota modelingmultiplephenotypesinwheatusingdatadrivengenomicexploratoryfactoranalysisandbayesiannetworklearning