Cargando…

Fast, scalable prediction of deleterious noncoding variants from functional and population genomic data

Many genetic variants that influence phenotypes of interest are located outside of protein-coding genes, yet existing methods for identifying such variants have poor predictive power. Here, we introduce a new computational method, called LINSIGHT, that substantially improves the prediction of noncod...

Descripción completa

Detalles Bibliográficos
Autores principales: Huang, Yi-Fei, Gulko, Brad, Siepel, Adam
Formato: Online Artículo Texto
Lenguaje:English
Publicado: 2017
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5395419/
https://www.ncbi.nlm.nih.gov/pubmed/28288115
http://dx.doi.org/10.1038/ng.3810
_version_ 1783229857444921344
author Huang, Yi-Fei
Gulko, Brad
Siepel, Adam
author_facet Huang, Yi-Fei
Gulko, Brad
Siepel, Adam
author_sort Huang, Yi-Fei
collection PubMed
description Many genetic variants that influence phenotypes of interest are located outside of protein-coding genes, yet existing methods for identifying such variants have poor predictive power. Here, we introduce a new computational method, called LINSIGHT, that substantially improves the prediction of noncoding nucleotide sites at which mutations are likely to have deleterious fitness consequences, and which therefore are likely to be phenotypically important. LINSIGHT combines a generalized linear model for functional genomic data with a probabilistic model of molecular evolution. The method is fast and highly scalable, enabling it to exploit the “Big Data” available in modern genomics. We show that LINSIGHT outperforms the best available methods in identifying human noncoding variants associated with inherited diseases. In addition, we apply LINSIGHT to an atlas of human enhancers and show that the fitness consequences at enhancers depend on cell type, tissue specificity, and constraints at associated promoters.
format Online
Article
Text
id pubmed-5395419
institution National Center for Biotechnology Information
language English
publishDate 2017
record_format MEDLINE/PubMed
spelling pubmed-53954192017-09-13 Fast, scalable prediction of deleterious noncoding variants from functional and population genomic data Huang, Yi-Fei Gulko, Brad Siepel, Adam Nat Genet Article Many genetic variants that influence phenotypes of interest are located outside of protein-coding genes, yet existing methods for identifying such variants have poor predictive power. Here, we introduce a new computational method, called LINSIGHT, that substantially improves the prediction of noncoding nucleotide sites at which mutations are likely to have deleterious fitness consequences, and which therefore are likely to be phenotypically important. LINSIGHT combines a generalized linear model for functional genomic data with a probabilistic model of molecular evolution. The method is fast and highly scalable, enabling it to exploit the “Big Data” available in modern genomics. We show that LINSIGHT outperforms the best available methods in identifying human noncoding variants associated with inherited diseases. In addition, we apply LINSIGHT to an atlas of human enhancers and show that the fitness consequences at enhancers depend on cell type, tissue specificity, and constraints at associated promoters. 2017-03-13 2017-04 /pmc/articles/PMC5395419/ /pubmed/28288115 http://dx.doi.org/10.1038/ng.3810 Text en http://www.nature.com/authors/editorial_policies/license.html#terms Users may view, print, copy, and download text and data-mine the content in such documents, for the purposes of academic research, subject always to the full Conditions of use:http://www.nature.com/authors/editorial_policies/license.html#terms
spellingShingle Article
Huang, Yi-Fei
Gulko, Brad
Siepel, Adam
Fast, scalable prediction of deleterious noncoding variants from functional and population genomic data
title Fast, scalable prediction of deleterious noncoding variants from functional and population genomic data
title_full Fast, scalable prediction of deleterious noncoding variants from functional and population genomic data
title_fullStr Fast, scalable prediction of deleterious noncoding variants from functional and population genomic data
title_full_unstemmed Fast, scalable prediction of deleterious noncoding variants from functional and population genomic data
title_short Fast, scalable prediction of deleterious noncoding variants from functional and population genomic data
title_sort fast, scalable prediction of deleterious noncoding variants from functional and population genomic data
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5395419/
https://www.ncbi.nlm.nih.gov/pubmed/28288115
http://dx.doi.org/10.1038/ng.3810
work_keys_str_mv AT huangyifei fastscalablepredictionofdeleteriousnoncodingvariantsfromfunctionalandpopulationgenomicdata
AT gulkobrad fastscalablepredictionofdeleteriousnoncodingvariantsfromfunctionalandpopulationgenomicdata
AT siepeladam fastscalablepredictionofdeleteriousnoncodingvariantsfromfunctionalandpopulationgenomicdata