Cargando…

SLINGER: large-scale learning for predicting gene expression

Recent studies have established that single nucleotide polymorphisms are sufficient to build accurate predictive models of gene expression. Gamazon, et al., found that gene expression values predicted from cis neighborhood SNPs show statistical association with disease status. In this work, we remov...

Descripción completa

Detalles Bibliográficos
Autores principales: Vervier, Kévin, Michaelson, Jacob J.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Nature Publishing Group 2016
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5171717/
https://www.ncbi.nlm.nih.gov/pubmed/27996030
http://dx.doi.org/10.1038/srep39360
_version_ 1782483995947171840
author Vervier, Kévin
Michaelson, Jacob J.
author_facet Vervier, Kévin
Michaelson, Jacob J.
author_sort Vervier, Kévin
collection PubMed
description Recent studies have established that single nucleotide polymorphisms are sufficient to build accurate predictive models of gene expression. Gamazon, et al., found that gene expression values predicted from cis neighborhood SNPs show statistical association with disease status. In this work, we remove the cis neighborhood constraint during the learning process, and propose a novel predictive approach called SLINGER. We demonstrate that models drawing from a genome-wide set of SNPs are able to predict expression for more genes than the ones built on cis neighborhood only. Results indicate that these new models significantly improve accuracy for a large number of genes. Thanks to a penalized linear model, we also show that the number of features used in our models remains comparable to the cis-only models. Finally, SLINGER application on seven Wellcome Trust Case-Control Consortium genome-wide association studies demonstrate that compared to a cis-only approach, our models lead to associations with greater fidelity to actual gene expression values.
format Online
Article
Text
id pubmed-5171717
institution National Center for Biotechnology Information
language English
publishDate 2016
publisher Nature Publishing Group
record_format MEDLINE/PubMed
spelling pubmed-51717172016-12-28 SLINGER: large-scale learning for predicting gene expression Vervier, Kévin Michaelson, Jacob J. Sci Rep Article Recent studies have established that single nucleotide polymorphisms are sufficient to build accurate predictive models of gene expression. Gamazon, et al., found that gene expression values predicted from cis neighborhood SNPs show statistical association with disease status. In this work, we remove the cis neighborhood constraint during the learning process, and propose a novel predictive approach called SLINGER. We demonstrate that models drawing from a genome-wide set of SNPs are able to predict expression for more genes than the ones built on cis neighborhood only. Results indicate that these new models significantly improve accuracy for a large number of genes. Thanks to a penalized linear model, we also show that the number of features used in our models remains comparable to the cis-only models. Finally, SLINGER application on seven Wellcome Trust Case-Control Consortium genome-wide association studies demonstrate that compared to a cis-only approach, our models lead to associations with greater fidelity to actual gene expression values. Nature Publishing Group 2016-12-20 /pmc/articles/PMC5171717/ /pubmed/27996030 http://dx.doi.org/10.1038/srep39360 Text en Copyright © 2016, The Author(s) http://creativecommons.org/licenses/by/4.0/ This work is licensed under a Creative Commons Attribution 4.0 International License. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in the credit line; if the material is not included under the Creative Commons license, users will need to obtain permission from the license holder to reproduce the material. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/
spellingShingle Article
Vervier, Kévin
Michaelson, Jacob J.
SLINGER: large-scale learning for predicting gene expression
title SLINGER: large-scale learning for predicting gene expression
title_full SLINGER: large-scale learning for predicting gene expression
title_fullStr SLINGER: large-scale learning for predicting gene expression
title_full_unstemmed SLINGER: large-scale learning for predicting gene expression
title_short SLINGER: large-scale learning for predicting gene expression
title_sort slinger: large-scale learning for predicting gene expression
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5171717/
https://www.ncbi.nlm.nih.gov/pubmed/27996030
http://dx.doi.org/10.1038/srep39360
work_keys_str_mv AT vervierkevin slingerlargescalelearningforpredictinggeneexpression
AT michaelsonjacobj slingerlargescalelearningforpredictinggeneexpression