Cargando…

Predicting complex quantitative traits with Bayesian neural networks: a case study with Jersey cows and wheat

BACKGROUND: In the study of associations between genomic data and complex phenotypes there may be relationships that are not amenable to parametric statistical modeling. Such associations have been investigated mainly using single-marker and Bayesian linear regression models that differ in their dis...

Descripción completa

Detalles Bibliográficos
Autores principales: Gianola, Daniel, Okut, Hayrettin, Weigel, Kent A, Rosa, Guilherme JM
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2011
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3474182/
https://www.ncbi.nlm.nih.gov/pubmed/21981731
http://dx.doi.org/10.1186/1471-2156-12-87
_version_ 1782246776910118912
author Gianola, Daniel
Okut, Hayrettin
Weigel, Kent A
Rosa, Guilherme JM
author_facet Gianola, Daniel
Okut, Hayrettin
Weigel, Kent A
Rosa, Guilherme JM
author_sort Gianola, Daniel
collection PubMed
description BACKGROUND: In the study of associations between genomic data and complex phenotypes there may be relationships that are not amenable to parametric statistical modeling. Such associations have been investigated mainly using single-marker and Bayesian linear regression models that differ in their distributions, but that assume additive inheritance while ignoring interactions and non-linearity. When interactions have been included in the model, their effects have entered linearly. There is a growing interest in non-parametric methods for predicting quantitative traits based on reproducing kernel Hilbert spaces regressions on markers and radial basis functions. Artificial neural networks (ANN) provide an alternative, because these act as universal approximators of complex functions and can capture non-linear relationships between predictors and responses, with the interplay among variables learned adaptively. ANNs are interesting candidates for analysis of traits affected by cryptic forms of gene action. RESULTS: We investigated various Bayesian ANN architectures using for predicting phenotypes in two data sets consisting of milk production in Jersey cows and yield of inbred lines of wheat. For the Jerseys, predictor variables were derived from pedigree and molecular marker (35,798 single nucleotide polymorphisms, SNPS) information on 297 individually cows. The wheat data represented 599 lines, each genotyped with 1,279 markers. The ability of predicting fat, milk and protein yield was low when using pedigrees, but it was better when SNPs were employed, irrespective of the ANN trained. Predictive ability was even better in wheat because the trait was a mean, as opposed to an individual phenotype in cows. Non-linear neural networks outperformed a linear model in predictive ability in both data sets, but more clearly in wheat. CONCLUSION: Results suggest that neural networks may be useful for predicting complex traits using high-dimensional genomic information, a situation where the number of unknowns exceeds sample size. ANNs can capture nonlinearities, adaptively. This may be useful when prediction of phenotypes is crucial.
format Online
Article
Text
id pubmed-3474182
institution National Center for Biotechnology Information
language English
publishDate 2011
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-34741822012-10-23 Predicting complex quantitative traits with Bayesian neural networks: a case study with Jersey cows and wheat Gianola, Daniel Okut, Hayrettin Weigel, Kent A Rosa, Guilherme JM BMC Genet Methodology Article BACKGROUND: In the study of associations between genomic data and complex phenotypes there may be relationships that are not amenable to parametric statistical modeling. Such associations have been investigated mainly using single-marker and Bayesian linear regression models that differ in their distributions, but that assume additive inheritance while ignoring interactions and non-linearity. When interactions have been included in the model, their effects have entered linearly. There is a growing interest in non-parametric methods for predicting quantitative traits based on reproducing kernel Hilbert spaces regressions on markers and radial basis functions. Artificial neural networks (ANN) provide an alternative, because these act as universal approximators of complex functions and can capture non-linear relationships between predictors and responses, with the interplay among variables learned adaptively. ANNs are interesting candidates for analysis of traits affected by cryptic forms of gene action. RESULTS: We investigated various Bayesian ANN architectures using for predicting phenotypes in two data sets consisting of milk production in Jersey cows and yield of inbred lines of wheat. For the Jerseys, predictor variables were derived from pedigree and molecular marker (35,798 single nucleotide polymorphisms, SNPS) information on 297 individually cows. The wheat data represented 599 lines, each genotyped with 1,279 markers. The ability of predicting fat, milk and protein yield was low when using pedigrees, but it was better when SNPs were employed, irrespective of the ANN trained. Predictive ability was even better in wheat because the trait was a mean, as opposed to an individual phenotype in cows. Non-linear neural networks outperformed a linear model in predictive ability in both data sets, but more clearly in wheat. CONCLUSION: Results suggest that neural networks may be useful for predicting complex traits using high-dimensional genomic information, a situation where the number of unknowns exceeds sample size. ANNs can capture nonlinearities, adaptively. This may be useful when prediction of phenotypes is crucial. BioMed Central 2011-10-07 /pmc/articles/PMC3474182/ /pubmed/21981731 http://dx.doi.org/10.1186/1471-2156-12-87 Text en Copyright ©2011 Gianola et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Methodology Article
Gianola, Daniel
Okut, Hayrettin
Weigel, Kent A
Rosa, Guilherme JM
Predicting complex quantitative traits with Bayesian neural networks: a case study with Jersey cows and wheat
title Predicting complex quantitative traits with Bayesian neural networks: a case study with Jersey cows and wheat
title_full Predicting complex quantitative traits with Bayesian neural networks: a case study with Jersey cows and wheat
title_fullStr Predicting complex quantitative traits with Bayesian neural networks: a case study with Jersey cows and wheat
title_full_unstemmed Predicting complex quantitative traits with Bayesian neural networks: a case study with Jersey cows and wheat
title_short Predicting complex quantitative traits with Bayesian neural networks: a case study with Jersey cows and wheat
title_sort predicting complex quantitative traits with bayesian neural networks: a case study with jersey cows and wheat
topic Methodology Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3474182/
https://www.ncbi.nlm.nih.gov/pubmed/21981731
http://dx.doi.org/10.1186/1471-2156-12-87
work_keys_str_mv AT gianoladaniel predictingcomplexquantitativetraitswithbayesianneuralnetworksacasestudywithjerseycowsandwheat
AT okuthayrettin predictingcomplexquantitativetraitswithbayesianneuralnetworksacasestudywithjerseycowsandwheat
AT weigelkenta predictingcomplexquantitativetraitswithbayesianneuralnetworksacasestudywithjerseycowsandwheat
AT rosaguilhermejm predictingcomplexquantitativetraitswithbayesianneuralnetworksacasestudywithjerseycowsandwheat