Cargando…
Fusion of Large-Scale Genomic Knowledge and Frequency Data Computationally Prioritizes Variants in Epilepsy
Curation and interpretation of copy number variants identified by genome-wide testing is challenged by the large number of events harbored in each personal genome. Conventional determination of phenotypic relevance relies on patterns of higher frequency in affected individuals versus controls; howev...
Autores principales: | , , , , , , , , , , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Public Library of Science
2013
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3784560/ https://www.ncbi.nlm.nih.gov/pubmed/24086149 http://dx.doi.org/10.1371/journal.pgen.1003797 |
_version_ | 1782477580071337984 |
---|---|
author | Campbell, Ian M. Rao, Mitchell Arredondo, Sean D. Lalani, Seema R. Xia, Zhilian Kang, Sung-Hae L. Bi, Weimin Breman, Amy M. Smith, Janice L. Bacino, Carlos A. Beaudet, Arthur L. Patel, Ankita Cheung, Sau Wai Lupski, James R. Stankiewicz, Paweł Ramocki, Melissa B. Shaw, Chad A. |
author_facet | Campbell, Ian M. Rao, Mitchell Arredondo, Sean D. Lalani, Seema R. Xia, Zhilian Kang, Sung-Hae L. Bi, Weimin Breman, Amy M. Smith, Janice L. Bacino, Carlos A. Beaudet, Arthur L. Patel, Ankita Cheung, Sau Wai Lupski, James R. Stankiewicz, Paweł Ramocki, Melissa B. Shaw, Chad A. |
author_sort | Campbell, Ian M. |
collection | PubMed |
description | Curation and interpretation of copy number variants identified by genome-wide testing is challenged by the large number of events harbored in each personal genome. Conventional determination of phenotypic relevance relies on patterns of higher frequency in affected individuals versus controls; however, an increasing amount of ascertained variation is rare or private to clans. Consequently, frequency data have less utility to resolve pathogenic from benign. One solution is disease-specific algorithms that leverage gene knowledge together with variant frequency to aid prioritization. We used large-scale resources including Gene Ontology, protein-protein interactions and other annotation systems together with a broad set of 83 genes with known associations to epilepsy to construct a pathogenicity score for the phenotype. We evaluated the score for all annotated human genes and applied Bayesian methods to combine the derived pathogenicity score with frequency information from our diagnostic laboratory. Analysis determined Bayes factors and posterior distributions for each gene. We applied our method to subjects with abnormal chromosomal microarray results and confirmed epilepsy diagnoses gathered by electronic medical record review. Genes deleted in our subjects with epilepsy had significantly higher pathogenicity scores and Bayes factors compared to subjects referred for non-neurologic indications. We also applied our scores to identify a recently validated epilepsy gene in a complex genomic region and to reveal candidate genes for epilepsy. We propose a potential use in clinical decision support for our results in the context of genome-wide screening. Our approach demonstrates the utility of integrative data in medical genomics. |
format | Online Article Text |
id | pubmed-3784560 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2013 |
publisher | Public Library of Science |
record_format | MEDLINE/PubMed |
spelling | pubmed-37845602013-10-01 Fusion of Large-Scale Genomic Knowledge and Frequency Data Computationally Prioritizes Variants in Epilepsy Campbell, Ian M. Rao, Mitchell Arredondo, Sean D. Lalani, Seema R. Xia, Zhilian Kang, Sung-Hae L. Bi, Weimin Breman, Amy M. Smith, Janice L. Bacino, Carlos A. Beaudet, Arthur L. Patel, Ankita Cheung, Sau Wai Lupski, James R. Stankiewicz, Paweł Ramocki, Melissa B. Shaw, Chad A. PLoS Genet Research Article Curation and interpretation of copy number variants identified by genome-wide testing is challenged by the large number of events harbored in each personal genome. Conventional determination of phenotypic relevance relies on patterns of higher frequency in affected individuals versus controls; however, an increasing amount of ascertained variation is rare or private to clans. Consequently, frequency data have less utility to resolve pathogenic from benign. One solution is disease-specific algorithms that leverage gene knowledge together with variant frequency to aid prioritization. We used large-scale resources including Gene Ontology, protein-protein interactions and other annotation systems together with a broad set of 83 genes with known associations to epilepsy to construct a pathogenicity score for the phenotype. We evaluated the score for all annotated human genes and applied Bayesian methods to combine the derived pathogenicity score with frequency information from our diagnostic laboratory. Analysis determined Bayes factors and posterior distributions for each gene. We applied our method to subjects with abnormal chromosomal microarray results and confirmed epilepsy diagnoses gathered by electronic medical record review. Genes deleted in our subjects with epilepsy had significantly higher pathogenicity scores and Bayes factors compared to subjects referred for non-neurologic indications. We also applied our scores to identify a recently validated epilepsy gene in a complex genomic region and to reveal candidate genes for epilepsy. We propose a potential use in clinical decision support for our results in the context of genome-wide screening. Our approach demonstrates the utility of integrative data in medical genomics. Public Library of Science 2013-09-26 /pmc/articles/PMC3784560/ /pubmed/24086149 http://dx.doi.org/10.1371/journal.pgen.1003797 Text en © 2013 Campbell et al http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited. |
spellingShingle | Research Article Campbell, Ian M. Rao, Mitchell Arredondo, Sean D. Lalani, Seema R. Xia, Zhilian Kang, Sung-Hae L. Bi, Weimin Breman, Amy M. Smith, Janice L. Bacino, Carlos A. Beaudet, Arthur L. Patel, Ankita Cheung, Sau Wai Lupski, James R. Stankiewicz, Paweł Ramocki, Melissa B. Shaw, Chad A. Fusion of Large-Scale Genomic Knowledge and Frequency Data Computationally Prioritizes Variants in Epilepsy |
title | Fusion of Large-Scale Genomic Knowledge and Frequency Data Computationally Prioritizes Variants in Epilepsy |
title_full | Fusion of Large-Scale Genomic Knowledge and Frequency Data Computationally Prioritizes Variants in Epilepsy |
title_fullStr | Fusion of Large-Scale Genomic Knowledge and Frequency Data Computationally Prioritizes Variants in Epilepsy |
title_full_unstemmed | Fusion of Large-Scale Genomic Knowledge and Frequency Data Computationally Prioritizes Variants in Epilepsy |
title_short | Fusion of Large-Scale Genomic Knowledge and Frequency Data Computationally Prioritizes Variants in Epilepsy |
title_sort | fusion of large-scale genomic knowledge and frequency data computationally prioritizes variants in epilepsy |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3784560/ https://www.ncbi.nlm.nih.gov/pubmed/24086149 http://dx.doi.org/10.1371/journal.pgen.1003797 |
work_keys_str_mv | AT campbellianm fusionoflargescalegenomicknowledgeandfrequencydatacomputationallyprioritizesvariantsinepilepsy AT raomitchell fusionoflargescalegenomicknowledgeandfrequencydatacomputationallyprioritizesvariantsinepilepsy AT arredondoseand fusionoflargescalegenomicknowledgeandfrequencydatacomputationallyprioritizesvariantsinepilepsy AT lalaniseemar fusionoflargescalegenomicknowledgeandfrequencydatacomputationallyprioritizesvariantsinepilepsy AT xiazhilian fusionoflargescalegenomicknowledgeandfrequencydatacomputationallyprioritizesvariantsinepilepsy AT kangsunghael fusionoflargescalegenomicknowledgeandfrequencydatacomputationallyprioritizesvariantsinepilepsy AT biweimin fusionoflargescalegenomicknowledgeandfrequencydatacomputationallyprioritizesvariantsinepilepsy AT bremanamym fusionoflargescalegenomicknowledgeandfrequencydatacomputationallyprioritizesvariantsinepilepsy AT smithjanicel fusionoflargescalegenomicknowledgeandfrequencydatacomputationallyprioritizesvariantsinepilepsy AT bacinocarlosa fusionoflargescalegenomicknowledgeandfrequencydatacomputationallyprioritizesvariantsinepilepsy AT beaudetarthurl fusionoflargescalegenomicknowledgeandfrequencydatacomputationallyprioritizesvariantsinepilepsy AT patelankita fusionoflargescalegenomicknowledgeandfrequencydatacomputationallyprioritizesvariantsinepilepsy AT cheungsauwai fusionoflargescalegenomicknowledgeandfrequencydatacomputationallyprioritizesvariantsinepilepsy AT lupskijamesr fusionoflargescalegenomicknowledgeandfrequencydatacomputationallyprioritizesvariantsinepilepsy AT stankiewiczpaweł fusionoflargescalegenomicknowledgeandfrequencydatacomputationallyprioritizesvariantsinepilepsy AT ramockimelissab fusionoflargescalegenomicknowledgeandfrequencydatacomputationallyprioritizesvariantsinepilepsy AT shawchada fusionoflargescalegenomicknowledgeandfrequencydatacomputationallyprioritizesvariantsinepilepsy |