Cargando…

Genome-Wide Prediction of Complex Traits in Two Outcrossing Plant Species Through Deep Learning and Bayesian Regularized Neural Network

Genomic selection models were investigated to predict several complex traits in breeding populations of Zea mays L. and Eucalyptus globulus Labill. For this, the following methods of Machine Learning (ML) were implemented: (i) Deep Learning (DL) and (ii) Bayesian Regularized Neural Network (BRNN) bo...

Descripción completa

Detalles Bibliográficos
Autores principales: Maldonado, Carlos, Mora-Poblete, Freddy, Contreras-Soto, Rodrigo Iván, Ahmar, Sunny, Chen, Jen-Tsung, do Amaral Júnior, Antônio Teixeira, Scapim, Carlos Alberto
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Frontiers Media S.A. 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7728740/
https://www.ncbi.nlm.nih.gov/pubmed/33329658
http://dx.doi.org/10.3389/fpls.2020.593897
_version_ 1783621334955196416
author Maldonado, Carlos
Mora-Poblete, Freddy
Contreras-Soto, Rodrigo Iván
Ahmar, Sunny
Chen, Jen-Tsung
do Amaral Júnior, Antônio Teixeira
Scapim, Carlos Alberto
author_facet Maldonado, Carlos
Mora-Poblete, Freddy
Contreras-Soto, Rodrigo Iván
Ahmar, Sunny
Chen, Jen-Tsung
do Amaral Júnior, Antônio Teixeira
Scapim, Carlos Alberto
author_sort Maldonado, Carlos
collection PubMed
description Genomic selection models were investigated to predict several complex traits in breeding populations of Zea mays L. and Eucalyptus globulus Labill. For this, the following methods of Machine Learning (ML) were implemented: (i) Deep Learning (DL) and (ii) Bayesian Regularized Neural Network (BRNN) both in combination with different hyperparameters. These ML methods were also compared with Genomic Best Linear Unbiased Prediction (GBLUP) and different Bayesian regression models [Bayes A, Bayes B, Bayes Cπ, Bayesian Ridge Regression, Bayesian LASSO, and Reproducing Kernel Hilbert Space (RKHS)]. DL models, using Rectified Linear Units (as the activation function), had higher predictive ability values, which varied from 0.27 (pilodyn penetration of 6 years old eucalypt trees) to 0.78 (flowering-related traits of maize). Moreover, the larger mini-batch size (100%) had a significantly higher predictive ability for wood-related traits than the smaller mini-batch size (10%). On the other hand, in the BRNN method, the architectures of one and two layers that used only the pureline function showed better results of prediction, with values ranging from 0.21 (pilodyn penetration) to 0.71 (flowering traits). A significant increase in the prediction ability was observed for DL in comparison with other methods of genomic prediction (Bayesian alphabet models, GBLUP, RKHS, and BRNN). Another important finding was the usefulness of DL models (through an iterative algorithm) as an SNP detection strategy for genome-wide association studies. The results of this study confirm the importance of DL for genome-wide analyses and crop/tree improvement strategies, which holds promise for accelerating breeding progress.
format Online
Article
Text
id pubmed-7728740
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher Frontiers Media S.A.
record_format MEDLINE/PubMed
spelling pubmed-77287402020-12-15 Genome-Wide Prediction of Complex Traits in Two Outcrossing Plant Species Through Deep Learning and Bayesian Regularized Neural Network Maldonado, Carlos Mora-Poblete, Freddy Contreras-Soto, Rodrigo Iván Ahmar, Sunny Chen, Jen-Tsung do Amaral Júnior, Antônio Teixeira Scapim, Carlos Alberto Front Plant Sci Plant Science Genomic selection models were investigated to predict several complex traits in breeding populations of Zea mays L. and Eucalyptus globulus Labill. For this, the following methods of Machine Learning (ML) were implemented: (i) Deep Learning (DL) and (ii) Bayesian Regularized Neural Network (BRNN) both in combination with different hyperparameters. These ML methods were also compared with Genomic Best Linear Unbiased Prediction (GBLUP) and different Bayesian regression models [Bayes A, Bayes B, Bayes Cπ, Bayesian Ridge Regression, Bayesian LASSO, and Reproducing Kernel Hilbert Space (RKHS)]. DL models, using Rectified Linear Units (as the activation function), had higher predictive ability values, which varied from 0.27 (pilodyn penetration of 6 years old eucalypt trees) to 0.78 (flowering-related traits of maize). Moreover, the larger mini-batch size (100%) had a significantly higher predictive ability for wood-related traits than the smaller mini-batch size (10%). On the other hand, in the BRNN method, the architectures of one and two layers that used only the pureline function showed better results of prediction, with values ranging from 0.21 (pilodyn penetration) to 0.71 (flowering traits). A significant increase in the prediction ability was observed for DL in comparison with other methods of genomic prediction (Bayesian alphabet models, GBLUP, RKHS, and BRNN). Another important finding was the usefulness of DL models (through an iterative algorithm) as an SNP detection strategy for genome-wide association studies. The results of this study confirm the importance of DL for genome-wide analyses and crop/tree improvement strategies, which holds promise for accelerating breeding progress. Frontiers Media S.A. 2020-11-27 /pmc/articles/PMC7728740/ /pubmed/33329658 http://dx.doi.org/10.3389/fpls.2020.593897 Text en Copyright © 2020 Maldonado, Mora-Poblete, Contreras-Soto, Ahmar, Chen, do Amaral Júnior and Scapim. http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle Plant Science
Maldonado, Carlos
Mora-Poblete, Freddy
Contreras-Soto, Rodrigo Iván
Ahmar, Sunny
Chen, Jen-Tsung
do Amaral Júnior, Antônio Teixeira
Scapim, Carlos Alberto
Genome-Wide Prediction of Complex Traits in Two Outcrossing Plant Species Through Deep Learning and Bayesian Regularized Neural Network
title Genome-Wide Prediction of Complex Traits in Two Outcrossing Plant Species Through Deep Learning and Bayesian Regularized Neural Network
title_full Genome-Wide Prediction of Complex Traits in Two Outcrossing Plant Species Through Deep Learning and Bayesian Regularized Neural Network
title_fullStr Genome-Wide Prediction of Complex Traits in Two Outcrossing Plant Species Through Deep Learning and Bayesian Regularized Neural Network
title_full_unstemmed Genome-Wide Prediction of Complex Traits in Two Outcrossing Plant Species Through Deep Learning and Bayesian Regularized Neural Network
title_short Genome-Wide Prediction of Complex Traits in Two Outcrossing Plant Species Through Deep Learning and Bayesian Regularized Neural Network
title_sort genome-wide prediction of complex traits in two outcrossing plant species through deep learning and bayesian regularized neural network
topic Plant Science
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7728740/
https://www.ncbi.nlm.nih.gov/pubmed/33329658
http://dx.doi.org/10.3389/fpls.2020.593897
work_keys_str_mv AT maldonadocarlos genomewidepredictionofcomplextraitsintwooutcrossingplantspeciesthroughdeeplearningandbayesianregularizedneuralnetwork
AT morapobletefreddy genomewidepredictionofcomplextraitsintwooutcrossingplantspeciesthroughdeeplearningandbayesianregularizedneuralnetwork
AT contrerassotorodrigoivan genomewidepredictionofcomplextraitsintwooutcrossingplantspeciesthroughdeeplearningandbayesianregularizedneuralnetwork
AT ahmarsunny genomewidepredictionofcomplextraitsintwooutcrossingplantspeciesthroughdeeplearningandbayesianregularizedneuralnetwork
AT chenjentsung genomewidepredictionofcomplextraitsintwooutcrossingplantspeciesthroughdeeplearningandbayesianregularizedneuralnetwork
AT doamaraljuniorantonioteixeira genomewidepredictionofcomplextraitsintwooutcrossingplantspeciesthroughdeeplearningandbayesianregularizedneuralnetwork
AT scapimcarlosalberto genomewidepredictionofcomplextraitsintwooutcrossingplantspeciesthroughdeeplearningandbayesianregularizedneuralnetwork