Cargando…

A literature review at genome scale: improving clinical variant assessment

PURPOSE: Over 150,000 variants have been reported to cause Mendelian disease in the medical literature. It is still difficult to leverage this knowledge base in clinical practice as many reports lack strong statistical evidence or may include false associations. Clinical laboratories assess whether...

Descripción completa

Detalles Bibliográficos
Autores principales: Cassa, Christopher A., Jordan, Daniel M., Adzhubei, Ivan, Sunyaev, Shamil
Formato: Online Artículo Texto
Lenguaje:English
Publicado: 2018
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6070443/
https://www.ncbi.nlm.nih.gov/pubmed/29388949
http://dx.doi.org/10.1038/gim.2017.230
_version_ 1783343664831922176
author Cassa, Christopher A.
Jordan, Daniel M.
Adzhubei, Ivan
Sunyaev, Shamil
author_facet Cassa, Christopher A.
Jordan, Daniel M.
Adzhubei, Ivan
Sunyaev, Shamil
author_sort Cassa, Christopher A.
collection PubMed
description PURPOSE: Over 150,000 variants have been reported to cause Mendelian disease in the medical literature. It is still difficult to leverage this knowledge base in clinical practice as many reports lack strong statistical evidence or may include false associations. Clinical laboratories assess whether these variants (along with newly observed variants that are adjacent to these published ones) underlie clinical disorders. METHODS: We measured whether citation data—including journal impact factor and the number of cited variants (NCV) in each gene with published disease associations—can be used to improve variant assessment. RESULTS: Surprisingly, we find that impact factor is not predictive of pathogenicity, but the NCV score for each gene can provide statistical support of pathogenicity. When combining this gene-level citation metric with variant-level evolutionary conservation and structural features, classification accuracy reaches 89.5%. Further, variants identified in clinical exome sequencing cases have higher NCV scores than simulated rare variants from ExAC in matched genes and consequences (p<2.22×10(−16)). CONCLUSION: Aggregate citation data can complement existing variant-based predictive algorithms, and can boost their performance without accessing and reviewing large numbers of manuscripts. The NCV is a slow-growing metric of scientific knowledge about each gene’s association with disease. FUNDING: This research was supported by NIH NHGRI grant HG007229 (C.C.) and NIGMS grant GM078598 (I.A., D.J., and S.S.).
format Online
Article
Text
id pubmed-6070443
institution National Center for Biotechnology Information
language English
publishDate 2018
record_format MEDLINE/PubMed
spelling pubmed-60704432018-08-02 A literature review at genome scale: improving clinical variant assessment Cassa, Christopher A. Jordan, Daniel M. Adzhubei, Ivan Sunyaev, Shamil Genet Med Article PURPOSE: Over 150,000 variants have been reported to cause Mendelian disease in the medical literature. It is still difficult to leverage this knowledge base in clinical practice as many reports lack strong statistical evidence or may include false associations. Clinical laboratories assess whether these variants (along with newly observed variants that are adjacent to these published ones) underlie clinical disorders. METHODS: We measured whether citation data—including journal impact factor and the number of cited variants (NCV) in each gene with published disease associations—can be used to improve variant assessment. RESULTS: Surprisingly, we find that impact factor is not predictive of pathogenicity, but the NCV score for each gene can provide statistical support of pathogenicity. When combining this gene-level citation metric with variant-level evolutionary conservation and structural features, classification accuracy reaches 89.5%. Further, variants identified in clinical exome sequencing cases have higher NCV scores than simulated rare variants from ExAC in matched genes and consequences (p<2.22×10(−16)). CONCLUSION: Aggregate citation data can complement existing variant-based predictive algorithms, and can boost their performance without accessing and reviewing large numbers of manuscripts. The NCV is a slow-growing metric of scientific knowledge about each gene’s association with disease. FUNDING: This research was supported by NIH NHGRI grant HG007229 (C.C.) and NIGMS grant GM078598 (I.A., D.J., and S.S.). 2018-02-01 2018-09 /pmc/articles/PMC6070443/ /pubmed/29388949 http://dx.doi.org/10.1038/gim.2017.230 Text en http://www.nature.com/authors/editorial_policies/license.html#terms Users may view, print, copy, and download text and data-mine the content in such documents, for the purposes of academic research, subject always to the full Conditions of use:http://www.nature.com/authors/editorial_policies/license.html#terms
spellingShingle Article
Cassa, Christopher A.
Jordan, Daniel M.
Adzhubei, Ivan
Sunyaev, Shamil
A literature review at genome scale: improving clinical variant assessment
title A literature review at genome scale: improving clinical variant assessment
title_full A literature review at genome scale: improving clinical variant assessment
title_fullStr A literature review at genome scale: improving clinical variant assessment
title_full_unstemmed A literature review at genome scale: improving clinical variant assessment
title_short A literature review at genome scale: improving clinical variant assessment
title_sort literature review at genome scale: improving clinical variant assessment
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6070443/
https://www.ncbi.nlm.nih.gov/pubmed/29388949
http://dx.doi.org/10.1038/gim.2017.230
work_keys_str_mv AT cassachristophera aliteraturereviewatgenomescaleimprovingclinicalvariantassessment
AT jordandanielm aliteraturereviewatgenomescaleimprovingclinicalvariantassessment
AT adzhubeiivan aliteraturereviewatgenomescaleimprovingclinicalvariantassessment
AT sunyaevshamil aliteraturereviewatgenomescaleimprovingclinicalvariantassessment
AT cassachristophera literaturereviewatgenomescaleimprovingclinicalvariantassessment
AT jordandanielm literaturereviewatgenomescaleimprovingclinicalvariantassessment
AT adzhubeiivan literaturereviewatgenomescaleimprovingclinicalvariantassessment
AT sunyaevshamil literaturereviewatgenomescaleimprovingclinicalvariantassessment