Cargando…

Fusion of Large-Scale Genomic Knowledge and Frequency Data Computationally Prioritizes Variants in Epilepsy

Curation and interpretation of copy number variants identified by genome-wide testing is challenged by the large number of events harbored in each personal genome. Conventional determination of phenotypic relevance relies on patterns of higher frequency in affected individuals versus controls; howev...

Descripción completa

Detalles Bibliográficos
Autores principales: Campbell, Ian M., Rao, Mitchell, Arredondo, Sean D., Lalani, Seema R., Xia, Zhilian, Kang, Sung-Hae L., Bi, Weimin, Breman, Amy M., Smith, Janice L., Bacino, Carlos A., Beaudet, Arthur L., Patel, Ankita, Cheung, Sau Wai, Lupski, James R., Stankiewicz, Paweł, Ramocki, Melissa B., Shaw, Chad A.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2013
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3784560/
https://www.ncbi.nlm.nih.gov/pubmed/24086149
http://dx.doi.org/10.1371/journal.pgen.1003797
_version_ 1782477580071337984
author Campbell, Ian M.
Rao, Mitchell
Arredondo, Sean D.
Lalani, Seema R.
Xia, Zhilian
Kang, Sung-Hae L.
Bi, Weimin
Breman, Amy M.
Smith, Janice L.
Bacino, Carlos A.
Beaudet, Arthur L.
Patel, Ankita
Cheung, Sau Wai
Lupski, James R.
Stankiewicz, Paweł
Ramocki, Melissa B.
Shaw, Chad A.
author_facet Campbell, Ian M.
Rao, Mitchell
Arredondo, Sean D.
Lalani, Seema R.
Xia, Zhilian
Kang, Sung-Hae L.
Bi, Weimin
Breman, Amy M.
Smith, Janice L.
Bacino, Carlos A.
Beaudet, Arthur L.
Patel, Ankita
Cheung, Sau Wai
Lupski, James R.
Stankiewicz, Paweł
Ramocki, Melissa B.
Shaw, Chad A.
author_sort Campbell, Ian M.
collection PubMed
description Curation and interpretation of copy number variants identified by genome-wide testing is challenged by the large number of events harbored in each personal genome. Conventional determination of phenotypic relevance relies on patterns of higher frequency in affected individuals versus controls; however, an increasing amount of ascertained variation is rare or private to clans. Consequently, frequency data have less utility to resolve pathogenic from benign. One solution is disease-specific algorithms that leverage gene knowledge together with variant frequency to aid prioritization. We used large-scale resources including Gene Ontology, protein-protein interactions and other annotation systems together with a broad set of 83 genes with known associations to epilepsy to construct a pathogenicity score for the phenotype. We evaluated the score for all annotated human genes and applied Bayesian methods to combine the derived pathogenicity score with frequency information from our diagnostic laboratory. Analysis determined Bayes factors and posterior distributions for each gene. We applied our method to subjects with abnormal chromosomal microarray results and confirmed epilepsy diagnoses gathered by electronic medical record review. Genes deleted in our subjects with epilepsy had significantly higher pathogenicity scores and Bayes factors compared to subjects referred for non-neurologic indications. We also applied our scores to identify a recently validated epilepsy gene in a complex genomic region and to reveal candidate genes for epilepsy. We propose a potential use in clinical decision support for our results in the context of genome-wide screening. Our approach demonstrates the utility of integrative data in medical genomics.
format Online
Article
Text
id pubmed-3784560
institution National Center for Biotechnology Information
language English
publishDate 2013
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-37845602013-10-01 Fusion of Large-Scale Genomic Knowledge and Frequency Data Computationally Prioritizes Variants in Epilepsy Campbell, Ian M. Rao, Mitchell Arredondo, Sean D. Lalani, Seema R. Xia, Zhilian Kang, Sung-Hae L. Bi, Weimin Breman, Amy M. Smith, Janice L. Bacino, Carlos A. Beaudet, Arthur L. Patel, Ankita Cheung, Sau Wai Lupski, James R. Stankiewicz, Paweł Ramocki, Melissa B. Shaw, Chad A. PLoS Genet Research Article Curation and interpretation of copy number variants identified by genome-wide testing is challenged by the large number of events harbored in each personal genome. Conventional determination of phenotypic relevance relies on patterns of higher frequency in affected individuals versus controls; however, an increasing amount of ascertained variation is rare or private to clans. Consequently, frequency data have less utility to resolve pathogenic from benign. One solution is disease-specific algorithms that leverage gene knowledge together with variant frequency to aid prioritization. We used large-scale resources including Gene Ontology, protein-protein interactions and other annotation systems together with a broad set of 83 genes with known associations to epilepsy to construct a pathogenicity score for the phenotype. We evaluated the score for all annotated human genes and applied Bayesian methods to combine the derived pathogenicity score with frequency information from our diagnostic laboratory. Analysis determined Bayes factors and posterior distributions for each gene. We applied our method to subjects with abnormal chromosomal microarray results and confirmed epilepsy diagnoses gathered by electronic medical record review. Genes deleted in our subjects with epilepsy had significantly higher pathogenicity scores and Bayes factors compared to subjects referred for non-neurologic indications. We also applied our scores to identify a recently validated epilepsy gene in a complex genomic region and to reveal candidate genes for epilepsy. We propose a potential use in clinical decision support for our results in the context of genome-wide screening. Our approach demonstrates the utility of integrative data in medical genomics. Public Library of Science 2013-09-26 /pmc/articles/PMC3784560/ /pubmed/24086149 http://dx.doi.org/10.1371/journal.pgen.1003797 Text en © 2013 Campbell et al http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited.
spellingShingle Research Article
Campbell, Ian M.
Rao, Mitchell
Arredondo, Sean D.
Lalani, Seema R.
Xia, Zhilian
Kang, Sung-Hae L.
Bi, Weimin
Breman, Amy M.
Smith, Janice L.
Bacino, Carlos A.
Beaudet, Arthur L.
Patel, Ankita
Cheung, Sau Wai
Lupski, James R.
Stankiewicz, Paweł
Ramocki, Melissa B.
Shaw, Chad A.
Fusion of Large-Scale Genomic Knowledge and Frequency Data Computationally Prioritizes Variants in Epilepsy
title Fusion of Large-Scale Genomic Knowledge and Frequency Data Computationally Prioritizes Variants in Epilepsy
title_full Fusion of Large-Scale Genomic Knowledge and Frequency Data Computationally Prioritizes Variants in Epilepsy
title_fullStr Fusion of Large-Scale Genomic Knowledge and Frequency Data Computationally Prioritizes Variants in Epilepsy
title_full_unstemmed Fusion of Large-Scale Genomic Knowledge and Frequency Data Computationally Prioritizes Variants in Epilepsy
title_short Fusion of Large-Scale Genomic Knowledge and Frequency Data Computationally Prioritizes Variants in Epilepsy
title_sort fusion of large-scale genomic knowledge and frequency data computationally prioritizes variants in epilepsy
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3784560/
https://www.ncbi.nlm.nih.gov/pubmed/24086149
http://dx.doi.org/10.1371/journal.pgen.1003797
work_keys_str_mv AT campbellianm fusionoflargescalegenomicknowledgeandfrequencydatacomputationallyprioritizesvariantsinepilepsy
AT raomitchell fusionoflargescalegenomicknowledgeandfrequencydatacomputationallyprioritizesvariantsinepilepsy
AT arredondoseand fusionoflargescalegenomicknowledgeandfrequencydatacomputationallyprioritizesvariantsinepilepsy
AT lalaniseemar fusionoflargescalegenomicknowledgeandfrequencydatacomputationallyprioritizesvariantsinepilepsy
AT xiazhilian fusionoflargescalegenomicknowledgeandfrequencydatacomputationallyprioritizesvariantsinepilepsy
AT kangsunghael fusionoflargescalegenomicknowledgeandfrequencydatacomputationallyprioritizesvariantsinepilepsy
AT biweimin fusionoflargescalegenomicknowledgeandfrequencydatacomputationallyprioritizesvariantsinepilepsy
AT bremanamym fusionoflargescalegenomicknowledgeandfrequencydatacomputationallyprioritizesvariantsinepilepsy
AT smithjanicel fusionoflargescalegenomicknowledgeandfrequencydatacomputationallyprioritizesvariantsinepilepsy
AT bacinocarlosa fusionoflargescalegenomicknowledgeandfrequencydatacomputationallyprioritizesvariantsinepilepsy
AT beaudetarthurl fusionoflargescalegenomicknowledgeandfrequencydatacomputationallyprioritizesvariantsinepilepsy
AT patelankita fusionoflargescalegenomicknowledgeandfrequencydatacomputationallyprioritizesvariantsinepilepsy
AT cheungsauwai fusionoflargescalegenomicknowledgeandfrequencydatacomputationallyprioritizesvariantsinepilepsy
AT lupskijamesr fusionoflargescalegenomicknowledgeandfrequencydatacomputationallyprioritizesvariantsinepilepsy
AT stankiewiczpaweł fusionoflargescalegenomicknowledgeandfrequencydatacomputationallyprioritizesvariantsinepilepsy
AT ramockimelissab fusionoflargescalegenomicknowledgeandfrequencydatacomputationallyprioritizesvariantsinepilepsy
AT shawchada fusionoflargescalegenomicknowledgeandfrequencydatacomputationallyprioritizesvariantsinepilepsy