Cargando…

Genome-Wide Association Studies of Soybean Yield-Related Hyperspectral Reflectance Bands Using Machine Learning-Mediated Data Integration Methods

In conjunction with big data analysis methods, plant omics technologies have provided scientists with cost-effective and promising tools for discovering genetic architectures of complex agronomic traits using large breeding populations. In recent years, there has been significant progress in plant p...

Descripción completa

Detalles Bibliográficos
Autores principales: Yoosefzadeh-Najafabadi, Mohsen, Torabi, Sepideh, Tulpan, Dan, Rajcan, Istvan, Eskandari, Milad
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Frontiers Media S.A. 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8647880/
https://www.ncbi.nlm.nih.gov/pubmed/34880894
http://dx.doi.org/10.3389/fpls.2021.777028
_version_ 1784610687437242368
author Yoosefzadeh-Najafabadi, Mohsen
Torabi, Sepideh
Tulpan, Dan
Rajcan, Istvan
Eskandari, Milad
author_facet Yoosefzadeh-Najafabadi, Mohsen
Torabi, Sepideh
Tulpan, Dan
Rajcan, Istvan
Eskandari, Milad
author_sort Yoosefzadeh-Najafabadi, Mohsen
collection PubMed
description In conjunction with big data analysis methods, plant omics technologies have provided scientists with cost-effective and promising tools for discovering genetic architectures of complex agronomic traits using large breeding populations. In recent years, there has been significant progress in plant phenomics and genomics approaches for generating reliable large datasets. However, selecting an appropriate data integration and analysis method to improve the efficiency of phenome-phenome and phenome-genome association studies is still a bottleneck. This study proposes a hyperspectral wide association study (HypWAS) approach as a phenome-phenome association analysis through a hierarchical data integration strategy to estimate the prediction power of hyperspectral reflectance bands in predicting soybean seed yield. Using HypWAS, five important hyperspectral reflectance bands in visible, red-edge, and near-infrared regions were identified significantly associated with seed yield. The phenome-genome association analysis of each tested hyperspectral reflectance band was performed using two conventional genome-wide association studies (GWAS) methods and a machine learning mediated GWAS based on the support vector regression (SVR) method. Using SVR-mediated GWAS, more relevant QTL with the physiological background of the tested hyperspectral reflectance bands were detected, supported by the functional annotation of candidate gene analyses. The results of this study have indicated the advantages of using hierarchical data integration strategy and advanced mathematical methods coupled with phenome-phenome and phenome-genome association analyses for a better understanding of the biology and genetic backgrounds of hyperspectral reflectance bands affecting soybean yield formation. The identified yield-related hyperspectral reflectance bands using HypWAS can be used as indirect selection criteria for selecting superior genotypes with improved yield genetic gains in large breeding populations.
format Online
Article
Text
id pubmed-8647880
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Frontiers Media S.A.
record_format MEDLINE/PubMed
spelling pubmed-86478802021-12-07 Genome-Wide Association Studies of Soybean Yield-Related Hyperspectral Reflectance Bands Using Machine Learning-Mediated Data Integration Methods Yoosefzadeh-Najafabadi, Mohsen Torabi, Sepideh Tulpan, Dan Rajcan, Istvan Eskandari, Milad Front Plant Sci Plant Science In conjunction with big data analysis methods, plant omics technologies have provided scientists with cost-effective and promising tools for discovering genetic architectures of complex agronomic traits using large breeding populations. In recent years, there has been significant progress in plant phenomics and genomics approaches for generating reliable large datasets. However, selecting an appropriate data integration and analysis method to improve the efficiency of phenome-phenome and phenome-genome association studies is still a bottleneck. This study proposes a hyperspectral wide association study (HypWAS) approach as a phenome-phenome association analysis through a hierarchical data integration strategy to estimate the prediction power of hyperspectral reflectance bands in predicting soybean seed yield. Using HypWAS, five important hyperspectral reflectance bands in visible, red-edge, and near-infrared regions were identified significantly associated with seed yield. The phenome-genome association analysis of each tested hyperspectral reflectance band was performed using two conventional genome-wide association studies (GWAS) methods and a machine learning mediated GWAS based on the support vector regression (SVR) method. Using SVR-mediated GWAS, more relevant QTL with the physiological background of the tested hyperspectral reflectance bands were detected, supported by the functional annotation of candidate gene analyses. The results of this study have indicated the advantages of using hierarchical data integration strategy and advanced mathematical methods coupled with phenome-phenome and phenome-genome association analyses for a better understanding of the biology and genetic backgrounds of hyperspectral reflectance bands affecting soybean yield formation. The identified yield-related hyperspectral reflectance bands using HypWAS can be used as indirect selection criteria for selecting superior genotypes with improved yield genetic gains in large breeding populations. Frontiers Media S.A. 2021-11-22 /pmc/articles/PMC8647880/ /pubmed/34880894 http://dx.doi.org/10.3389/fpls.2021.777028 Text en Copyright © 2021 Yoosefzadeh-Najafabadi, Torabi, Tulpan, Rajcan and Eskandari. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle Plant Science
Yoosefzadeh-Najafabadi, Mohsen
Torabi, Sepideh
Tulpan, Dan
Rajcan, Istvan
Eskandari, Milad
Genome-Wide Association Studies of Soybean Yield-Related Hyperspectral Reflectance Bands Using Machine Learning-Mediated Data Integration Methods
title Genome-Wide Association Studies of Soybean Yield-Related Hyperspectral Reflectance Bands Using Machine Learning-Mediated Data Integration Methods
title_full Genome-Wide Association Studies of Soybean Yield-Related Hyperspectral Reflectance Bands Using Machine Learning-Mediated Data Integration Methods
title_fullStr Genome-Wide Association Studies of Soybean Yield-Related Hyperspectral Reflectance Bands Using Machine Learning-Mediated Data Integration Methods
title_full_unstemmed Genome-Wide Association Studies of Soybean Yield-Related Hyperspectral Reflectance Bands Using Machine Learning-Mediated Data Integration Methods
title_short Genome-Wide Association Studies of Soybean Yield-Related Hyperspectral Reflectance Bands Using Machine Learning-Mediated Data Integration Methods
title_sort genome-wide association studies of soybean yield-related hyperspectral reflectance bands using machine learning-mediated data integration methods
topic Plant Science
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8647880/
https://www.ncbi.nlm.nih.gov/pubmed/34880894
http://dx.doi.org/10.3389/fpls.2021.777028
work_keys_str_mv AT yoosefzadehnajafabadimohsen genomewideassociationstudiesofsoybeanyieldrelatedhyperspectralreflectancebandsusingmachinelearningmediateddataintegrationmethods
AT torabisepideh genomewideassociationstudiesofsoybeanyieldrelatedhyperspectralreflectancebandsusingmachinelearningmediateddataintegrationmethods
AT tulpandan genomewideassociationstudiesofsoybeanyieldrelatedhyperspectralreflectancebandsusingmachinelearningmediateddataintegrationmethods
AT rajcanistvan genomewideassociationstudiesofsoybeanyieldrelatedhyperspectralreflectancebandsusingmachinelearningmediateddataintegrationmethods
AT eskandarimilad genomewideassociationstudiesofsoybeanyieldrelatedhyperspectralreflectancebandsusingmachinelearningmediateddataintegrationmethods