Cargando…
Mastermind: A Comprehensive Genomic Association Search Engine for Empirical Evidence Curation and Genetic Variant Interpretation
Design and interpretation of genome sequencing assays in clinical diagnostics and research labs is complicated by an inability to identify information from the medical literature and related databases quickly, comprehensively and reproducibly. This challenge is compounded by the complexity and heter...
Autores principales: | , , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Frontiers Media S.A.
2020
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7691534/ https://www.ncbi.nlm.nih.gov/pubmed/33281875 http://dx.doi.org/10.3389/fgene.2020.577152 |
_version_ | 1783614313166012416 |
---|---|
author | Chunn, Lauren M. Nefcy, Diane C. Scouten, Rachel W. Tarpey, Ryan P. Chauhan, Gurinder Lim, Megan S. Elenitoba-Johnson, Kojo S. J. Schwartz, Steven A. Kiel, Mark J. |
author_facet | Chunn, Lauren M. Nefcy, Diane C. Scouten, Rachel W. Tarpey, Ryan P. Chauhan, Gurinder Lim, Megan S. Elenitoba-Johnson, Kojo S. J. Schwartz, Steven A. Kiel, Mark J. |
author_sort | Chunn, Lauren M. |
collection | PubMed |
description | Design and interpretation of genome sequencing assays in clinical diagnostics and research labs is complicated by an inability to identify information from the medical literature and related databases quickly, comprehensively and reproducibly. This challenge is compounded by the complexity and heterogeneity of nomenclatures used to describe diseases, genes and genetic variants. Mastermind is a widely-used bioinformatic platform of genomic associations that has indexed more than 7.5 M full-text articles and 2.5 M supplemental datasets. It has automatically identified, disambiguated and annotated >6.1 M genetic variants and identified >50 K disease-gene associations. Here, we describe how Mastermind improves the sensitivity and reproducibility of clinical variant interpretation and produces comprehensive genomic landscapes of genetic variants driving pharmaceutical research. We demonstrate an alarmingly high degree of heterogeneity across commercially available panels for hereditary cancer that is resolved by evidence from Mastermind. We further examined the sensitivity of Mastermind for variant interpretation by examining 108 clinically-encountered variants and comparing the results to alternate methods. Mastermind demonstrated a sensitivity of 98.4% compared to 4.4, 45.6, and 37.4% for alternatives PubMed, Google Scholar, and ClinVar, respectively, and a specificity of 98.5% compared to 45.1, 57.6, and 68.8% as well as an increase in content yield of 22.6-, 2.2-, and 2.6-fold. When curated for clinical significance, Mastermind identified more than 4.9-fold more pathogenic variants than ClinVar for representative genes. For structural variants, we compared Mastermind’s ability to sensitively identify evidence for 10 representative disease-causing CNVs versus results identified in PubMed, as well as its ability to identify evidence for fusion events compared to COSMIC. Mastermind demonstrated a 4.0- to 43.9-fold increase in references for specific CNVs compared to PubMed, as well as 5.4-fold more fusion genes when compared with COSMIC’s curated database. Additionally, Mastermind produced an 8.0-fold increase in reference citations for fusion events common to Mastermind and outside databases. Taken together, these results demonstrate the utility and superiority of Mastermind in terms of both sensitivity and specificity of automated results for clinical diagnostic variant interpretation for multiple genetic variant types and highlight the potential benefit in informing pharmaceutical research. |
format | Online Article Text |
id | pubmed-7691534 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2020 |
publisher | Frontiers Media S.A. |
record_format | MEDLINE/PubMed |
spelling | pubmed-76915342020-12-04 Mastermind: A Comprehensive Genomic Association Search Engine for Empirical Evidence Curation and Genetic Variant Interpretation Chunn, Lauren M. Nefcy, Diane C. Scouten, Rachel W. Tarpey, Ryan P. Chauhan, Gurinder Lim, Megan S. Elenitoba-Johnson, Kojo S. J. Schwartz, Steven A. Kiel, Mark J. Front Genet Genetics Design and interpretation of genome sequencing assays in clinical diagnostics and research labs is complicated by an inability to identify information from the medical literature and related databases quickly, comprehensively and reproducibly. This challenge is compounded by the complexity and heterogeneity of nomenclatures used to describe diseases, genes and genetic variants. Mastermind is a widely-used bioinformatic platform of genomic associations that has indexed more than 7.5 M full-text articles and 2.5 M supplemental datasets. It has automatically identified, disambiguated and annotated >6.1 M genetic variants and identified >50 K disease-gene associations. Here, we describe how Mastermind improves the sensitivity and reproducibility of clinical variant interpretation and produces comprehensive genomic landscapes of genetic variants driving pharmaceutical research. We demonstrate an alarmingly high degree of heterogeneity across commercially available panels for hereditary cancer that is resolved by evidence from Mastermind. We further examined the sensitivity of Mastermind for variant interpretation by examining 108 clinically-encountered variants and comparing the results to alternate methods. Mastermind demonstrated a sensitivity of 98.4% compared to 4.4, 45.6, and 37.4% for alternatives PubMed, Google Scholar, and ClinVar, respectively, and a specificity of 98.5% compared to 45.1, 57.6, and 68.8% as well as an increase in content yield of 22.6-, 2.2-, and 2.6-fold. When curated for clinical significance, Mastermind identified more than 4.9-fold more pathogenic variants than ClinVar for representative genes. For structural variants, we compared Mastermind’s ability to sensitively identify evidence for 10 representative disease-causing CNVs versus results identified in PubMed, as well as its ability to identify evidence for fusion events compared to COSMIC. Mastermind demonstrated a 4.0- to 43.9-fold increase in references for specific CNVs compared to PubMed, as well as 5.4-fold more fusion genes when compared with COSMIC’s curated database. Additionally, Mastermind produced an 8.0-fold increase in reference citations for fusion events common to Mastermind and outside databases. Taken together, these results demonstrate the utility and superiority of Mastermind in terms of both sensitivity and specificity of automated results for clinical diagnostic variant interpretation for multiple genetic variant types and highlight the potential benefit in informing pharmaceutical research. Frontiers Media S.A. 2020-11-13 /pmc/articles/PMC7691534/ /pubmed/33281875 http://dx.doi.org/10.3389/fgene.2020.577152 Text en Copyright © 2020 Chunn, Nefcy, Scouten, Tarpey, Chauhan, Lim, Elenitoba-Johnson, Schwartz and Kiel. http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms. |
spellingShingle | Genetics Chunn, Lauren M. Nefcy, Diane C. Scouten, Rachel W. Tarpey, Ryan P. Chauhan, Gurinder Lim, Megan S. Elenitoba-Johnson, Kojo S. J. Schwartz, Steven A. Kiel, Mark J. Mastermind: A Comprehensive Genomic Association Search Engine for Empirical Evidence Curation and Genetic Variant Interpretation |
title | Mastermind: A Comprehensive Genomic Association Search Engine for Empirical Evidence Curation and Genetic Variant Interpretation |
title_full | Mastermind: A Comprehensive Genomic Association Search Engine for Empirical Evidence Curation and Genetic Variant Interpretation |
title_fullStr | Mastermind: A Comprehensive Genomic Association Search Engine for Empirical Evidence Curation and Genetic Variant Interpretation |
title_full_unstemmed | Mastermind: A Comprehensive Genomic Association Search Engine for Empirical Evidence Curation and Genetic Variant Interpretation |
title_short | Mastermind: A Comprehensive Genomic Association Search Engine for Empirical Evidence Curation and Genetic Variant Interpretation |
title_sort | mastermind: a comprehensive genomic association search engine for empirical evidence curation and genetic variant interpretation |
topic | Genetics |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7691534/ https://www.ncbi.nlm.nih.gov/pubmed/33281875 http://dx.doi.org/10.3389/fgene.2020.577152 |
work_keys_str_mv | AT chunnlaurenm mastermindacomprehensivegenomicassociationsearchengineforempiricalevidencecurationandgeneticvariantinterpretation AT nefcydianec mastermindacomprehensivegenomicassociationsearchengineforempiricalevidencecurationandgeneticvariantinterpretation AT scoutenrachelw mastermindacomprehensivegenomicassociationsearchengineforempiricalevidencecurationandgeneticvariantinterpretation AT tarpeyryanp mastermindacomprehensivegenomicassociationsearchengineforempiricalevidencecurationandgeneticvariantinterpretation AT chauhangurinder mastermindacomprehensivegenomicassociationsearchengineforempiricalevidencecurationandgeneticvariantinterpretation AT limmegans mastermindacomprehensivegenomicassociationsearchengineforempiricalevidencecurationandgeneticvariantinterpretation AT elenitobajohnsonkojosj mastermindacomprehensivegenomicassociationsearchengineforempiricalevidencecurationandgeneticvariantinterpretation AT schwartzstevena mastermindacomprehensivegenomicassociationsearchengineforempiricalevidencecurationandgeneticvariantinterpretation AT kielmarkj mastermindacomprehensivegenomicassociationsearchengineforempiricalevidencecurationandgeneticvariantinterpretation |