Cargando…

Interpretation of Genomic Variants Using a Unified Biological Network Approach

The decreasing cost of sequencing is leading to a growing repertoire of personal genomes. However, we are lagging behind in understanding the functional consequences of the millions of variants obtained from sequencing. Global system-wide effects of variants in coding genes are particularly poorly u...

Descripción completa

Detalles Bibliográficos
Autores principales: Khurana, Ekta, Fu, Yao, Chen, Jieming, Gerstein, Mark
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2013
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3591262/
https://www.ncbi.nlm.nih.gov/pubmed/23505346
http://dx.doi.org/10.1371/journal.pcbi.1002886
_version_ 1782262016692453376
author Khurana, Ekta
Fu, Yao
Chen, Jieming
Gerstein, Mark
author_facet Khurana, Ekta
Fu, Yao
Chen, Jieming
Gerstein, Mark
author_sort Khurana, Ekta
collection PubMed
description The decreasing cost of sequencing is leading to a growing repertoire of personal genomes. However, we are lagging behind in understanding the functional consequences of the millions of variants obtained from sequencing. Global system-wide effects of variants in coding genes are particularly poorly understood. It is known that while variants in some genes can lead to diseases, complete disruption of other genes, called ‘loss-of-function tolerant’, is possible with no obvious effect. Here, we build a systems-based classifier to quantitatively estimate the global perturbation caused by deleterious mutations in each gene. We first survey the degree to which gene centrality in various individual networks and a unified ‘Multinet’ correlates with the tolerance to loss-of-function mutations and evolutionary conservation. We find that functionally significant and highly conserved genes tend to be more central in physical protein-protein and regulatory networks. However, this is not the case for metabolic pathways, where the highly central genes have more duplicated copies and are more tolerant to loss-of-function mutations. Integration of three-dimensional protein structures reveals that the correlation with centrality in the protein-protein interaction network is also seen in terms of the number of interaction interfaces used. Finally, combining all the network and evolutionary properties allows us to build a classifier distinguishing functionally essential and loss-of-function tolerant genes with higher accuracy (AUC = 0.91) than any individual property. Application of the classifier to the whole genome shows its strong potential for interpretation of variants involved in Mendelian diseases and in complex disorders probed by genome-wide association studies.
format Online
Article
Text
id pubmed-3591262
institution National Center for Biotechnology Information
language English
publishDate 2013
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-35912622013-03-15 Interpretation of Genomic Variants Using a Unified Biological Network Approach Khurana, Ekta Fu, Yao Chen, Jieming Gerstein, Mark PLoS Comput Biol Research Article The decreasing cost of sequencing is leading to a growing repertoire of personal genomes. However, we are lagging behind in understanding the functional consequences of the millions of variants obtained from sequencing. Global system-wide effects of variants in coding genes are particularly poorly understood. It is known that while variants in some genes can lead to diseases, complete disruption of other genes, called ‘loss-of-function tolerant’, is possible with no obvious effect. Here, we build a systems-based classifier to quantitatively estimate the global perturbation caused by deleterious mutations in each gene. We first survey the degree to which gene centrality in various individual networks and a unified ‘Multinet’ correlates with the tolerance to loss-of-function mutations and evolutionary conservation. We find that functionally significant and highly conserved genes tend to be more central in physical protein-protein and regulatory networks. However, this is not the case for metabolic pathways, where the highly central genes have more duplicated copies and are more tolerant to loss-of-function mutations. Integration of three-dimensional protein structures reveals that the correlation with centrality in the protein-protein interaction network is also seen in terms of the number of interaction interfaces used. Finally, combining all the network and evolutionary properties allows us to build a classifier distinguishing functionally essential and loss-of-function tolerant genes with higher accuracy (AUC = 0.91) than any individual property. Application of the classifier to the whole genome shows its strong potential for interpretation of variants involved in Mendelian diseases and in complex disorders probed by genome-wide association studies. Public Library of Science 2013-03-07 /pmc/articles/PMC3591262/ /pubmed/23505346 http://dx.doi.org/10.1371/journal.pcbi.1002886 Text en © 2013 Khurana et al http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited.
spellingShingle Research Article
Khurana, Ekta
Fu, Yao
Chen, Jieming
Gerstein, Mark
Interpretation of Genomic Variants Using a Unified Biological Network Approach
title Interpretation of Genomic Variants Using a Unified Biological Network Approach
title_full Interpretation of Genomic Variants Using a Unified Biological Network Approach
title_fullStr Interpretation of Genomic Variants Using a Unified Biological Network Approach
title_full_unstemmed Interpretation of Genomic Variants Using a Unified Biological Network Approach
title_short Interpretation of Genomic Variants Using a Unified Biological Network Approach
title_sort interpretation of genomic variants using a unified biological network approach
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3591262/
https://www.ncbi.nlm.nih.gov/pubmed/23505346
http://dx.doi.org/10.1371/journal.pcbi.1002886
work_keys_str_mv AT khuranaekta interpretationofgenomicvariantsusingaunifiedbiologicalnetworkapproach
AT fuyao interpretationofgenomicvariantsusingaunifiedbiologicalnetworkapproach
AT chenjieming interpretationofgenomicvariantsusingaunifiedbiologicalnetworkapproach
AT gersteinmark interpretationofgenomicvariantsusingaunifiedbiologicalnetworkapproach