Cargando…
Predicting complex quantitative traits with Bayesian neural networks: a case study with Jersey cows and wheat
BACKGROUND: In the study of associations between genomic data and complex phenotypes there may be relationships that are not amenable to parametric statistical modeling. Such associations have been investigated mainly using single-marker and Bayesian linear regression models that differ in their dis...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2011
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3474182/ https://www.ncbi.nlm.nih.gov/pubmed/21981731 http://dx.doi.org/10.1186/1471-2156-12-87 |
_version_ | 1782246776910118912 |
---|---|
author | Gianola, Daniel Okut, Hayrettin Weigel, Kent A Rosa, Guilherme JM |
author_facet | Gianola, Daniel Okut, Hayrettin Weigel, Kent A Rosa, Guilherme JM |
author_sort | Gianola, Daniel |
collection | PubMed |
description | BACKGROUND: In the study of associations between genomic data and complex phenotypes there may be relationships that are not amenable to parametric statistical modeling. Such associations have been investigated mainly using single-marker and Bayesian linear regression models that differ in their distributions, but that assume additive inheritance while ignoring interactions and non-linearity. When interactions have been included in the model, their effects have entered linearly. There is a growing interest in non-parametric methods for predicting quantitative traits based on reproducing kernel Hilbert spaces regressions on markers and radial basis functions. Artificial neural networks (ANN) provide an alternative, because these act as universal approximators of complex functions and can capture non-linear relationships between predictors and responses, with the interplay among variables learned adaptively. ANNs are interesting candidates for analysis of traits affected by cryptic forms of gene action. RESULTS: We investigated various Bayesian ANN architectures using for predicting phenotypes in two data sets consisting of milk production in Jersey cows and yield of inbred lines of wheat. For the Jerseys, predictor variables were derived from pedigree and molecular marker (35,798 single nucleotide polymorphisms, SNPS) information on 297 individually cows. The wheat data represented 599 lines, each genotyped with 1,279 markers. The ability of predicting fat, milk and protein yield was low when using pedigrees, but it was better when SNPs were employed, irrespective of the ANN trained. Predictive ability was even better in wheat because the trait was a mean, as opposed to an individual phenotype in cows. Non-linear neural networks outperformed a linear model in predictive ability in both data sets, but more clearly in wheat. CONCLUSION: Results suggest that neural networks may be useful for predicting complex traits using high-dimensional genomic information, a situation where the number of unknowns exceeds sample size. ANNs can capture nonlinearities, adaptively. This may be useful when prediction of phenotypes is crucial. |
format | Online Article Text |
id | pubmed-3474182 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2011 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-34741822012-10-23 Predicting complex quantitative traits with Bayesian neural networks: a case study with Jersey cows and wheat Gianola, Daniel Okut, Hayrettin Weigel, Kent A Rosa, Guilherme JM BMC Genet Methodology Article BACKGROUND: In the study of associations between genomic data and complex phenotypes there may be relationships that are not amenable to parametric statistical modeling. Such associations have been investigated mainly using single-marker and Bayesian linear regression models that differ in their distributions, but that assume additive inheritance while ignoring interactions and non-linearity. When interactions have been included in the model, their effects have entered linearly. There is a growing interest in non-parametric methods for predicting quantitative traits based on reproducing kernel Hilbert spaces regressions on markers and radial basis functions. Artificial neural networks (ANN) provide an alternative, because these act as universal approximators of complex functions and can capture non-linear relationships between predictors and responses, with the interplay among variables learned adaptively. ANNs are interesting candidates for analysis of traits affected by cryptic forms of gene action. RESULTS: We investigated various Bayesian ANN architectures using for predicting phenotypes in two data sets consisting of milk production in Jersey cows and yield of inbred lines of wheat. For the Jerseys, predictor variables were derived from pedigree and molecular marker (35,798 single nucleotide polymorphisms, SNPS) information on 297 individually cows. The wheat data represented 599 lines, each genotyped with 1,279 markers. The ability of predicting fat, milk and protein yield was low when using pedigrees, but it was better when SNPs were employed, irrespective of the ANN trained. Predictive ability was even better in wheat because the trait was a mean, as opposed to an individual phenotype in cows. Non-linear neural networks outperformed a linear model in predictive ability in both data sets, but more clearly in wheat. CONCLUSION: Results suggest that neural networks may be useful for predicting complex traits using high-dimensional genomic information, a situation where the number of unknowns exceeds sample size. ANNs can capture nonlinearities, adaptively. This may be useful when prediction of phenotypes is crucial. BioMed Central 2011-10-07 /pmc/articles/PMC3474182/ /pubmed/21981731 http://dx.doi.org/10.1186/1471-2156-12-87 Text en Copyright ©2011 Gianola et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Methodology Article Gianola, Daniel Okut, Hayrettin Weigel, Kent A Rosa, Guilherme JM Predicting complex quantitative traits with Bayesian neural networks: a case study with Jersey cows and wheat |
title | Predicting complex quantitative traits with Bayesian neural networks: a case study with Jersey cows and wheat |
title_full | Predicting complex quantitative traits with Bayesian neural networks: a case study with Jersey cows and wheat |
title_fullStr | Predicting complex quantitative traits with Bayesian neural networks: a case study with Jersey cows and wheat |
title_full_unstemmed | Predicting complex quantitative traits with Bayesian neural networks: a case study with Jersey cows and wheat |
title_short | Predicting complex quantitative traits with Bayesian neural networks: a case study with Jersey cows and wheat |
title_sort | predicting complex quantitative traits with bayesian neural networks: a case study with jersey cows and wheat |
topic | Methodology Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3474182/ https://www.ncbi.nlm.nih.gov/pubmed/21981731 http://dx.doi.org/10.1186/1471-2156-12-87 |
work_keys_str_mv | AT gianoladaniel predictingcomplexquantitativetraitswithbayesianneuralnetworksacasestudywithjerseycowsandwheat AT okuthayrettin predictingcomplexquantitativetraitswithbayesianneuralnetworksacasestudywithjerseycowsandwheat AT weigelkenta predictingcomplexquantitativetraitswithbayesianneuralnetworksacasestudywithjerseycowsandwheat AT rosaguilhermejm predictingcomplexquantitativetraitswithbayesianneuralnetworksacasestudywithjerseycowsandwheat |