Cargando…

Robust Prediction of Expression Differences among Human Individuals Using Only Genotype Information

Many genetic variants that are significantly correlated to gene expression changes across human individuals have been identified, but the ability of these variants to predict expression of unseen individuals has rarely been evaluated. Here, we devise an algorithm that, given training expression and...

Descripción completa

Detalles Bibliográficos
Autores principales: Manor, Ohad, Segal, Eran
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2013
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3610805/
https://www.ncbi.nlm.nih.gov/pubmed/23555302
http://dx.doi.org/10.1371/journal.pgen.1003396
_version_ 1782264504768266240
author Manor, Ohad
Segal, Eran
author_facet Manor, Ohad
Segal, Eran
author_sort Manor, Ohad
collection PubMed
description Many genetic variants that are significantly correlated to gene expression changes across human individuals have been identified, but the ability of these variants to predict expression of unseen individuals has rarely been evaluated. Here, we devise an algorithm that, given training expression and genotype data for a set of individuals, predicts the expression of genes of unseen test individuals given only their genotype in the local genomic vicinity of the predicted gene. Notably, the resulting predictions are remarkably robust in that they agree well between the training and test sets, even when the training and test sets consist of individuals from distinct populations. Thus, although the overall number of genes that can be predicted is relatively small, as expected from our choice to ignore effects such as environmental factors and trans sequence variation, the robust nature of the predictions means that the identity and quantitative degree to which genes can be predicted is known in advance. We also present an extension that incorporates heterogeneous types of genomic annotations to differentially weigh the importance of the various genetic variants, and we show that assigning higher weights to variants with particular annotations such as proximity to genes and high regional G/C content can further improve the predictions. Finally, genes that are successfully predicted have, on average, higher expression and more variability across individuals, providing insight into the characteristics of the types of genes that can be predicted from their cis genetic variation.
format Online
Article
Text
id pubmed-3610805
institution National Center for Biotechnology Information
language English
publishDate 2013
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-36108052013-04-03 Robust Prediction of Expression Differences among Human Individuals Using Only Genotype Information Manor, Ohad Segal, Eran PLoS Genet Research Article Many genetic variants that are significantly correlated to gene expression changes across human individuals have been identified, but the ability of these variants to predict expression of unseen individuals has rarely been evaluated. Here, we devise an algorithm that, given training expression and genotype data for a set of individuals, predicts the expression of genes of unseen test individuals given only their genotype in the local genomic vicinity of the predicted gene. Notably, the resulting predictions are remarkably robust in that they agree well between the training and test sets, even when the training and test sets consist of individuals from distinct populations. Thus, although the overall number of genes that can be predicted is relatively small, as expected from our choice to ignore effects such as environmental factors and trans sequence variation, the robust nature of the predictions means that the identity and quantitative degree to which genes can be predicted is known in advance. We also present an extension that incorporates heterogeneous types of genomic annotations to differentially weigh the importance of the various genetic variants, and we show that assigning higher weights to variants with particular annotations such as proximity to genes and high regional G/C content can further improve the predictions. Finally, genes that are successfully predicted have, on average, higher expression and more variability across individuals, providing insight into the characteristics of the types of genes that can be predicted from their cis genetic variation. Public Library of Science 2013-03-28 /pmc/articles/PMC3610805/ /pubmed/23555302 http://dx.doi.org/10.1371/journal.pgen.1003396 Text en © 2013 Manor, Segal http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited.
spellingShingle Research Article
Manor, Ohad
Segal, Eran
Robust Prediction of Expression Differences among Human Individuals Using Only Genotype Information
title Robust Prediction of Expression Differences among Human Individuals Using Only Genotype Information
title_full Robust Prediction of Expression Differences among Human Individuals Using Only Genotype Information
title_fullStr Robust Prediction of Expression Differences among Human Individuals Using Only Genotype Information
title_full_unstemmed Robust Prediction of Expression Differences among Human Individuals Using Only Genotype Information
title_short Robust Prediction of Expression Differences among Human Individuals Using Only Genotype Information
title_sort robust prediction of expression differences among human individuals using only genotype information
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3610805/
https://www.ncbi.nlm.nih.gov/pubmed/23555302
http://dx.doi.org/10.1371/journal.pgen.1003396
work_keys_str_mv AT manorohad robustpredictionofexpressiondifferencesamonghumanindividualsusingonlygenotypeinformation
AT segaleran robustpredictionofexpressiondifferencesamonghumanindividualsusingonlygenotypeinformation