Cargando…

Application of high-dimensional feature selection: evaluation for genomic prediction in man

In this study, we investigated the effect of five feature selection approaches on the performance of a mixed model (G-BLUP) and a Bayesian (Bayes C) prediction method. We predicted height, high density lipoprotein cholesterol (HDL) and body mass index (BMI) within 2,186 Croatian and into 810 UK indi...

Descripción completa

Detalles Bibliográficos
Autores principales: Bermingham, M. L., Pong-Wong, R., Spiliopoulou, A., Hayward, C., Rudan, I., Campbell, H., Wright, A. F., Wilson, J. F., Agakov, F., Navarro, P., Haley, C. S.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Nature Publishing Group 2015
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4437376/
https://www.ncbi.nlm.nih.gov/pubmed/25988841
http://dx.doi.org/10.1038/srep10312
_version_ 1782372202052583424
author Bermingham, M. L.
Pong-Wong, R.
Spiliopoulou, A.
Hayward, C.
Rudan, I.
Campbell, H.
Wright, A. F.
Wilson, J. F.
Agakov, F.
Navarro, P.
Haley, C. S.
author_facet Bermingham, M. L.
Pong-Wong, R.
Spiliopoulou, A.
Hayward, C.
Rudan, I.
Campbell, H.
Wright, A. F.
Wilson, J. F.
Agakov, F.
Navarro, P.
Haley, C. S.
author_sort Bermingham, M. L.
collection PubMed
description In this study, we investigated the effect of five feature selection approaches on the performance of a mixed model (G-BLUP) and a Bayesian (Bayes C) prediction method. We predicted height, high density lipoprotein cholesterol (HDL) and body mass index (BMI) within 2,186 Croatian and into 810 UK individuals using genome-wide SNP data. Using all SNP information Bayes C and G-BLUP had similar predictive performance across all traits within the Croatian data, and for the highly polygenic traits height and BMI when predicting into the UK data. Bayes C outperformed G-BLUP in the prediction of HDL, which is influenced by loci of moderate size, in the UK data. Supervised feature selection of a SNP subset in the G-BLUP framework provided a flexible, generalisable and computationally efficient alternative to Bayes C; but careful evaluation of predictive performance is required when supervised feature selection has been used.
format Online
Article
Text
id pubmed-4437376
institution National Center for Biotechnology Information
language English
publishDate 2015
publisher Nature Publishing Group
record_format MEDLINE/PubMed
spelling pubmed-44373762015-06-01 Application of high-dimensional feature selection: evaluation for genomic prediction in man Bermingham, M. L. Pong-Wong, R. Spiliopoulou, A. Hayward, C. Rudan, I. Campbell, H. Wright, A. F. Wilson, J. F. Agakov, F. Navarro, P. Haley, C. S. Sci Rep Article In this study, we investigated the effect of five feature selection approaches on the performance of a mixed model (G-BLUP) and a Bayesian (Bayes C) prediction method. We predicted height, high density lipoprotein cholesterol (HDL) and body mass index (BMI) within 2,186 Croatian and into 810 UK individuals using genome-wide SNP data. Using all SNP information Bayes C and G-BLUP had similar predictive performance across all traits within the Croatian data, and for the highly polygenic traits height and BMI when predicting into the UK data. Bayes C outperformed G-BLUP in the prediction of HDL, which is influenced by loci of moderate size, in the UK data. Supervised feature selection of a SNP subset in the G-BLUP framework provided a flexible, generalisable and computationally efficient alternative to Bayes C; but careful evaluation of predictive performance is required when supervised feature selection has been used. Nature Publishing Group 2015-05-19 /pmc/articles/PMC4437376/ /pubmed/25988841 http://dx.doi.org/10.1038/srep10312 Text en Copyright © 2015, Macmillan Publishers Limited http://creativecommons.org/licenses/by/4.0/ This work is licensed under a Creative Commons Attribution 4.0 International License. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in the credit line; if the material is not included under the Creative Commons license, users will need to obtain permission from the license holder to reproduce the material. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/
spellingShingle Article
Bermingham, M. L.
Pong-Wong, R.
Spiliopoulou, A.
Hayward, C.
Rudan, I.
Campbell, H.
Wright, A. F.
Wilson, J. F.
Agakov, F.
Navarro, P.
Haley, C. S.
Application of high-dimensional feature selection: evaluation for genomic prediction in man
title Application of high-dimensional feature selection: evaluation for genomic prediction in man
title_full Application of high-dimensional feature selection: evaluation for genomic prediction in man
title_fullStr Application of high-dimensional feature selection: evaluation for genomic prediction in man
title_full_unstemmed Application of high-dimensional feature selection: evaluation for genomic prediction in man
title_short Application of high-dimensional feature selection: evaluation for genomic prediction in man
title_sort application of high-dimensional feature selection: evaluation for genomic prediction in man
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4437376/
https://www.ncbi.nlm.nih.gov/pubmed/25988841
http://dx.doi.org/10.1038/srep10312
work_keys_str_mv AT berminghamml applicationofhighdimensionalfeatureselectionevaluationforgenomicpredictioninman
AT pongwongr applicationofhighdimensionalfeatureselectionevaluationforgenomicpredictioninman
AT spiliopouloua applicationofhighdimensionalfeatureselectionevaluationforgenomicpredictioninman
AT haywardc applicationofhighdimensionalfeatureselectionevaluationforgenomicpredictioninman
AT rudani applicationofhighdimensionalfeatureselectionevaluationforgenomicpredictioninman
AT campbellh applicationofhighdimensionalfeatureselectionevaluationforgenomicpredictioninman
AT wrightaf applicationofhighdimensionalfeatureselectionevaluationforgenomicpredictioninman
AT wilsonjf applicationofhighdimensionalfeatureselectionevaluationforgenomicpredictioninman
AT agakovf applicationofhighdimensionalfeatureselectionevaluationforgenomicpredictioninman
AT navarrop applicationofhighdimensionalfeatureselectionevaluationforgenomicpredictioninman
AT haleycs applicationofhighdimensionalfeatureselectionevaluationforgenomicpredictioninman