Cargando…

Mastermind: A Comprehensive Genomic Association Search Engine for Empirical Evidence Curation and Genetic Variant Interpretation

Design and interpretation of genome sequencing assays in clinical diagnostics and research labs is complicated by an inability to identify information from the medical literature and related databases quickly, comprehensively and reproducibly. This challenge is compounded by the complexity and heter...

Descripción completa

Detalles Bibliográficos
Autores principales: Chunn, Lauren M., Nefcy, Diane C., Scouten, Rachel W., Tarpey, Ryan P., Chauhan, Gurinder, Lim, Megan S., Elenitoba-Johnson, Kojo S. J., Schwartz, Steven A., Kiel, Mark J.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Frontiers Media S.A. 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7691534/
https://www.ncbi.nlm.nih.gov/pubmed/33281875
http://dx.doi.org/10.3389/fgene.2020.577152
_version_ 1783614313166012416
author Chunn, Lauren M.
Nefcy, Diane C.
Scouten, Rachel W.
Tarpey, Ryan P.
Chauhan, Gurinder
Lim, Megan S.
Elenitoba-Johnson, Kojo S. J.
Schwartz, Steven A.
Kiel, Mark J.
author_facet Chunn, Lauren M.
Nefcy, Diane C.
Scouten, Rachel W.
Tarpey, Ryan P.
Chauhan, Gurinder
Lim, Megan S.
Elenitoba-Johnson, Kojo S. J.
Schwartz, Steven A.
Kiel, Mark J.
author_sort Chunn, Lauren M.
collection PubMed
description Design and interpretation of genome sequencing assays in clinical diagnostics and research labs is complicated by an inability to identify information from the medical literature and related databases quickly, comprehensively and reproducibly. This challenge is compounded by the complexity and heterogeneity of nomenclatures used to describe diseases, genes and genetic variants. Mastermind is a widely-used bioinformatic platform of genomic associations that has indexed more than 7.5 M full-text articles and 2.5 M supplemental datasets. It has automatically identified, disambiguated and annotated >6.1 M genetic variants and identified >50 K disease-gene associations. Here, we describe how Mastermind improves the sensitivity and reproducibility of clinical variant interpretation and produces comprehensive genomic landscapes of genetic variants driving pharmaceutical research. We demonstrate an alarmingly high degree of heterogeneity across commercially available panels for hereditary cancer that is resolved by evidence from Mastermind. We further examined the sensitivity of Mastermind for variant interpretation by examining 108 clinically-encountered variants and comparing the results to alternate methods. Mastermind demonstrated a sensitivity of 98.4% compared to 4.4, 45.6, and 37.4% for alternatives PubMed, Google Scholar, and ClinVar, respectively, and a specificity of 98.5% compared to 45.1, 57.6, and 68.8% as well as an increase in content yield of 22.6-, 2.2-, and 2.6-fold. When curated for clinical significance, Mastermind identified more than 4.9-fold more pathogenic variants than ClinVar for representative genes. For structural variants, we compared Mastermind’s ability to sensitively identify evidence for 10 representative disease-causing CNVs versus results identified in PubMed, as well as its ability to identify evidence for fusion events compared to COSMIC. Mastermind demonstrated a 4.0- to 43.9-fold increase in references for specific CNVs compared to PubMed, as well as 5.4-fold more fusion genes when compared with COSMIC’s curated database. Additionally, Mastermind produced an 8.0-fold increase in reference citations for fusion events common to Mastermind and outside databases. Taken together, these results demonstrate the utility and superiority of Mastermind in terms of both sensitivity and specificity of automated results for clinical diagnostic variant interpretation for multiple genetic variant types and highlight the potential benefit in informing pharmaceutical research.
format Online
Article
Text
id pubmed-7691534
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher Frontiers Media S.A.
record_format MEDLINE/PubMed
spelling pubmed-76915342020-12-04 Mastermind: A Comprehensive Genomic Association Search Engine for Empirical Evidence Curation and Genetic Variant Interpretation Chunn, Lauren M. Nefcy, Diane C. Scouten, Rachel W. Tarpey, Ryan P. Chauhan, Gurinder Lim, Megan S. Elenitoba-Johnson, Kojo S. J. Schwartz, Steven A. Kiel, Mark J. Front Genet Genetics Design and interpretation of genome sequencing assays in clinical diagnostics and research labs is complicated by an inability to identify information from the medical literature and related databases quickly, comprehensively and reproducibly. This challenge is compounded by the complexity and heterogeneity of nomenclatures used to describe diseases, genes and genetic variants. Mastermind is a widely-used bioinformatic platform of genomic associations that has indexed more than 7.5 M full-text articles and 2.5 M supplemental datasets. It has automatically identified, disambiguated and annotated >6.1 M genetic variants and identified >50 K disease-gene associations. Here, we describe how Mastermind improves the sensitivity and reproducibility of clinical variant interpretation and produces comprehensive genomic landscapes of genetic variants driving pharmaceutical research. We demonstrate an alarmingly high degree of heterogeneity across commercially available panels for hereditary cancer that is resolved by evidence from Mastermind. We further examined the sensitivity of Mastermind for variant interpretation by examining 108 clinically-encountered variants and comparing the results to alternate methods. Mastermind demonstrated a sensitivity of 98.4% compared to 4.4, 45.6, and 37.4% for alternatives PubMed, Google Scholar, and ClinVar, respectively, and a specificity of 98.5% compared to 45.1, 57.6, and 68.8% as well as an increase in content yield of 22.6-, 2.2-, and 2.6-fold. When curated for clinical significance, Mastermind identified more than 4.9-fold more pathogenic variants than ClinVar for representative genes. For structural variants, we compared Mastermind’s ability to sensitively identify evidence for 10 representative disease-causing CNVs versus results identified in PubMed, as well as its ability to identify evidence for fusion events compared to COSMIC. Mastermind demonstrated a 4.0- to 43.9-fold increase in references for specific CNVs compared to PubMed, as well as 5.4-fold more fusion genes when compared with COSMIC’s curated database. Additionally, Mastermind produced an 8.0-fold increase in reference citations for fusion events common to Mastermind and outside databases. Taken together, these results demonstrate the utility and superiority of Mastermind in terms of both sensitivity and specificity of automated results for clinical diagnostic variant interpretation for multiple genetic variant types and highlight the potential benefit in informing pharmaceutical research. Frontiers Media S.A. 2020-11-13 /pmc/articles/PMC7691534/ /pubmed/33281875 http://dx.doi.org/10.3389/fgene.2020.577152 Text en Copyright © 2020 Chunn, Nefcy, Scouten, Tarpey, Chauhan, Lim, Elenitoba-Johnson, Schwartz and Kiel. http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle Genetics
Chunn, Lauren M.
Nefcy, Diane C.
Scouten, Rachel W.
Tarpey, Ryan P.
Chauhan, Gurinder
Lim, Megan S.
Elenitoba-Johnson, Kojo S. J.
Schwartz, Steven A.
Kiel, Mark J.
Mastermind: A Comprehensive Genomic Association Search Engine for Empirical Evidence Curation and Genetic Variant Interpretation
title Mastermind: A Comprehensive Genomic Association Search Engine for Empirical Evidence Curation and Genetic Variant Interpretation
title_full Mastermind: A Comprehensive Genomic Association Search Engine for Empirical Evidence Curation and Genetic Variant Interpretation
title_fullStr Mastermind: A Comprehensive Genomic Association Search Engine for Empirical Evidence Curation and Genetic Variant Interpretation
title_full_unstemmed Mastermind: A Comprehensive Genomic Association Search Engine for Empirical Evidence Curation and Genetic Variant Interpretation
title_short Mastermind: A Comprehensive Genomic Association Search Engine for Empirical Evidence Curation and Genetic Variant Interpretation
title_sort mastermind: a comprehensive genomic association search engine for empirical evidence curation and genetic variant interpretation
topic Genetics
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7691534/
https://www.ncbi.nlm.nih.gov/pubmed/33281875
http://dx.doi.org/10.3389/fgene.2020.577152
work_keys_str_mv AT chunnlaurenm mastermindacomprehensivegenomicassociationsearchengineforempiricalevidencecurationandgeneticvariantinterpretation
AT nefcydianec mastermindacomprehensivegenomicassociationsearchengineforempiricalevidencecurationandgeneticvariantinterpretation
AT scoutenrachelw mastermindacomprehensivegenomicassociationsearchengineforempiricalevidencecurationandgeneticvariantinterpretation
AT tarpeyryanp mastermindacomprehensivegenomicassociationsearchengineforempiricalevidencecurationandgeneticvariantinterpretation
AT chauhangurinder mastermindacomprehensivegenomicassociationsearchengineforempiricalevidencecurationandgeneticvariantinterpretation
AT limmegans mastermindacomprehensivegenomicassociationsearchengineforempiricalevidencecurationandgeneticvariantinterpretation
AT elenitobajohnsonkojosj mastermindacomprehensivegenomicassociationsearchengineforempiricalevidencecurationandgeneticvariantinterpretation
AT schwartzstevena mastermindacomprehensivegenomicassociationsearchengineforempiricalevidencecurationandgeneticvariantinterpretation
AT kielmarkj mastermindacomprehensivegenomicassociationsearchengineforempiricalevidencecurationandgeneticvariantinterpretation