Cargando…

A Multivariate Poisson Deep Learning Model for Genomic Prediction of Count Data

The paradigm called genomic selection (GS) is a revolutionary way of developing new plants and animals. This is a predictive methodology, since it uses learning methods to perform its task. Unfortunately, there is no universal model that can be used for all types of predictions; for this reason, spe...

Descripción completa

Detalles Bibliográficos
Autores principales: Montesinos-López, Osval Antonio, Montesinos-López, José Cricelio, Singh, Pawan, Lozano-Ramirez, Nerida, Barrón-López, Alberto, Montesinos-López, Abelardo, Crossa, José
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Genetics Society of America 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7642922/
https://www.ncbi.nlm.nih.gov/pubmed/32934019
http://dx.doi.org/10.1534/g3.120.401631
_version_ 1783606174424236032
author Montesinos-López, Osval Antonio
Montesinos-López, José Cricelio
Singh, Pawan
Lozano-Ramirez, Nerida
Barrón-López, Alberto
Montesinos-López, Abelardo
Crossa, José
author_facet Montesinos-López, Osval Antonio
Montesinos-López, José Cricelio
Singh, Pawan
Lozano-Ramirez, Nerida
Barrón-López, Alberto
Montesinos-López, Abelardo
Crossa, José
author_sort Montesinos-López, Osval Antonio
collection PubMed
description The paradigm called genomic selection (GS) is a revolutionary way of developing new plants and animals. This is a predictive methodology, since it uses learning methods to perform its task. Unfortunately, there is no universal model that can be used for all types of predictions; for this reason, specific methodologies are required for each type of output (response variables). Since there is a lack of efficient methodologies for multivariate count data outcomes, in this paper, a multivariate Poisson deep neural network (MPDN) model is proposed for the genomic prediction of various count outcomes simultaneously. The MPDN model uses the minus log-likelihood of a Poisson distribution as a loss function, in hidden layers for capturing nonlinear patterns using the rectified linear unit (RELU) activation function and, in the output layer, the exponential activation function was used for producing outputs on the same scale of counts. The proposed MPDN model was compared to conventional generalized Poisson regression models and univariate Poisson deep learning models in two experimental data sets of count data. We found that the proposed MPDL outperformed univariate Poisson deep neural network models, but did not outperform, in terms of prediction, the univariate generalized Poisson regression models. All deep learning models were implemented in Tensorflow as back-end and Keras as front-end, which allows implementing these models on moderate and large data sets, which is a significant advantage over previous GS models for multivariate count data.
format Online
Article
Text
id pubmed-7642922
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher Genetics Society of America
record_format MEDLINE/PubMed
spelling pubmed-76429222020-11-13 A Multivariate Poisson Deep Learning Model for Genomic Prediction of Count Data Montesinos-López, Osval Antonio Montesinos-López, José Cricelio Singh, Pawan Lozano-Ramirez, Nerida Barrón-López, Alberto Montesinos-López, Abelardo Crossa, José G3 (Bethesda) Genomic Prediction The paradigm called genomic selection (GS) is a revolutionary way of developing new plants and animals. This is a predictive methodology, since it uses learning methods to perform its task. Unfortunately, there is no universal model that can be used for all types of predictions; for this reason, specific methodologies are required for each type of output (response variables). Since there is a lack of efficient methodologies for multivariate count data outcomes, in this paper, a multivariate Poisson deep neural network (MPDN) model is proposed for the genomic prediction of various count outcomes simultaneously. The MPDN model uses the minus log-likelihood of a Poisson distribution as a loss function, in hidden layers for capturing nonlinear patterns using the rectified linear unit (RELU) activation function and, in the output layer, the exponential activation function was used for producing outputs on the same scale of counts. The proposed MPDN model was compared to conventional generalized Poisson regression models and univariate Poisson deep learning models in two experimental data sets of count data. We found that the proposed MPDL outperformed univariate Poisson deep neural network models, but did not outperform, in terms of prediction, the univariate generalized Poisson regression models. All deep learning models were implemented in Tensorflow as back-end and Keras as front-end, which allows implementing these models on moderate and large data sets, which is a significant advantage over previous GS models for multivariate count data. Genetics Society of America 2020-09-15 /pmc/articles/PMC7642922/ /pubmed/32934019 http://dx.doi.org/10.1534/g3.120.401631 Text en Copyright © 2020 Montesinos-Lopez et al. http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Genomic Prediction
Montesinos-López, Osval Antonio
Montesinos-López, José Cricelio
Singh, Pawan
Lozano-Ramirez, Nerida
Barrón-López, Alberto
Montesinos-López, Abelardo
Crossa, José
A Multivariate Poisson Deep Learning Model for Genomic Prediction of Count Data
title A Multivariate Poisson Deep Learning Model for Genomic Prediction of Count Data
title_full A Multivariate Poisson Deep Learning Model for Genomic Prediction of Count Data
title_fullStr A Multivariate Poisson Deep Learning Model for Genomic Prediction of Count Data
title_full_unstemmed A Multivariate Poisson Deep Learning Model for Genomic Prediction of Count Data
title_short A Multivariate Poisson Deep Learning Model for Genomic Prediction of Count Data
title_sort multivariate poisson deep learning model for genomic prediction of count data
topic Genomic Prediction
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7642922/
https://www.ncbi.nlm.nih.gov/pubmed/32934019
http://dx.doi.org/10.1534/g3.120.401631
work_keys_str_mv AT montesinoslopezosvalantonio amultivariatepoissondeeplearningmodelforgenomicpredictionofcountdata
AT montesinoslopezjosecricelio amultivariatepoissondeeplearningmodelforgenomicpredictionofcountdata
AT singhpawan amultivariatepoissondeeplearningmodelforgenomicpredictionofcountdata
AT lozanoramireznerida amultivariatepoissondeeplearningmodelforgenomicpredictionofcountdata
AT barronlopezalberto amultivariatepoissondeeplearningmodelforgenomicpredictionofcountdata
AT montesinoslopezabelardo amultivariatepoissondeeplearningmodelforgenomicpredictionofcountdata
AT crossajose amultivariatepoissondeeplearningmodelforgenomicpredictionofcountdata
AT montesinoslopezosvalantonio multivariatepoissondeeplearningmodelforgenomicpredictionofcountdata
AT montesinoslopezjosecricelio multivariatepoissondeeplearningmodelforgenomicpredictionofcountdata
AT singhpawan multivariatepoissondeeplearningmodelforgenomicpredictionofcountdata
AT lozanoramireznerida multivariatepoissondeeplearningmodelforgenomicpredictionofcountdata
AT barronlopezalberto multivariatepoissondeeplearningmodelforgenomicpredictionofcountdata
AT montesinoslopezabelardo multivariatepoissondeeplearningmodelforgenomicpredictionofcountdata
AT crossajose multivariatepoissondeeplearningmodelforgenomicpredictionofcountdata