Cargando…

A Comprehensive Evaluation of the Performance of Prediction Algorithms on Clinically Relevant Missense Variants

The rapid integration of genomic technologies in clinical diagnostics has resulted in the detection of a multitude of missense variants whose clinical significance is often unknown. As a result, a plethora of computational tools have been developed to facilitate variant interpretation. However, choo...

Descripción completa

Detalles Bibliográficos
Autores principales: Qorri, Erda, Takács, Bertalan, Gráf, Alexandra, Enyedi, Márton Zsolt, Pintér, Lajos, Kiss, Ernő, Haracska, Lajos
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9322961/
https://www.ncbi.nlm.nih.gov/pubmed/35887294
http://dx.doi.org/10.3390/ijms23147946
_version_ 1784756432751558656
author Qorri, Erda
Takács, Bertalan
Gráf, Alexandra
Enyedi, Márton Zsolt
Pintér, Lajos
Kiss, Ernő
Haracska, Lajos
author_facet Qorri, Erda
Takács, Bertalan
Gráf, Alexandra
Enyedi, Márton Zsolt
Pintér, Lajos
Kiss, Ernő
Haracska, Lajos
author_sort Qorri, Erda
collection PubMed
description The rapid integration of genomic technologies in clinical diagnostics has resulted in the detection of a multitude of missense variants whose clinical significance is often unknown. As a result, a plethora of computational tools have been developed to facilitate variant interpretation. However, choosing an appropriate software from such a broad range of tools can be challenging; therefore, systematic benchmarking with high-quality, independent datasets is critical. Using three independent benchmarking datasets compiled from the ClinVar database, we evaluated the performance of ten widely used prediction algorithms with missense variants from 21 clinically relevant genes, including BRCA1 and BRCA2. A fourth dataset consisting of 1053 missense variants was also used to investigate the impact of type 1 circularity on their performance. The performance of the prediction algorithms varied widely across datasets. Based on Matthews Correlation Coefficient and Area Under the Curve, SNPs&GO and PMut consistently displayed an overall above-average performance across the datasets. Most of the tools demonstrated greater sensitivity and negative predictive values at the expense of lower specificity and positive predictive values. We also demonstrated that type 1 circularity significantly impacts the performance of these tools and, if not accounted for, may confound the selection of the best performing algorithms.
format Online
Article
Text
id pubmed-9322961
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-93229612022-07-27 A Comprehensive Evaluation of the Performance of Prediction Algorithms on Clinically Relevant Missense Variants Qorri, Erda Takács, Bertalan Gráf, Alexandra Enyedi, Márton Zsolt Pintér, Lajos Kiss, Ernő Haracska, Lajos Int J Mol Sci Article The rapid integration of genomic technologies in clinical diagnostics has resulted in the detection of a multitude of missense variants whose clinical significance is often unknown. As a result, a plethora of computational tools have been developed to facilitate variant interpretation. However, choosing an appropriate software from such a broad range of tools can be challenging; therefore, systematic benchmarking with high-quality, independent datasets is critical. Using three independent benchmarking datasets compiled from the ClinVar database, we evaluated the performance of ten widely used prediction algorithms with missense variants from 21 clinically relevant genes, including BRCA1 and BRCA2. A fourth dataset consisting of 1053 missense variants was also used to investigate the impact of type 1 circularity on their performance. The performance of the prediction algorithms varied widely across datasets. Based on Matthews Correlation Coefficient and Area Under the Curve, SNPs&GO and PMut consistently displayed an overall above-average performance across the datasets. Most of the tools demonstrated greater sensitivity and negative predictive values at the expense of lower specificity and positive predictive values. We also demonstrated that type 1 circularity significantly impacts the performance of these tools and, if not accounted for, may confound the selection of the best performing algorithms. MDPI 2022-07-19 /pmc/articles/PMC9322961/ /pubmed/35887294 http://dx.doi.org/10.3390/ijms23147946 Text en © 2022 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Qorri, Erda
Takács, Bertalan
Gráf, Alexandra
Enyedi, Márton Zsolt
Pintér, Lajos
Kiss, Ernő
Haracska, Lajos
A Comprehensive Evaluation of the Performance of Prediction Algorithms on Clinically Relevant Missense Variants
title A Comprehensive Evaluation of the Performance of Prediction Algorithms on Clinically Relevant Missense Variants
title_full A Comprehensive Evaluation of the Performance of Prediction Algorithms on Clinically Relevant Missense Variants
title_fullStr A Comprehensive Evaluation of the Performance of Prediction Algorithms on Clinically Relevant Missense Variants
title_full_unstemmed A Comprehensive Evaluation of the Performance of Prediction Algorithms on Clinically Relevant Missense Variants
title_short A Comprehensive Evaluation of the Performance of Prediction Algorithms on Clinically Relevant Missense Variants
title_sort comprehensive evaluation of the performance of prediction algorithms on clinically relevant missense variants
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9322961/
https://www.ncbi.nlm.nih.gov/pubmed/35887294
http://dx.doi.org/10.3390/ijms23147946
work_keys_str_mv AT qorrierda acomprehensiveevaluationoftheperformanceofpredictionalgorithmsonclinicallyrelevantmissensevariants
AT takacsbertalan acomprehensiveevaluationoftheperformanceofpredictionalgorithmsonclinicallyrelevantmissensevariants
AT grafalexandra acomprehensiveevaluationoftheperformanceofpredictionalgorithmsonclinicallyrelevantmissensevariants
AT enyedimartonzsolt acomprehensiveevaluationoftheperformanceofpredictionalgorithmsonclinicallyrelevantmissensevariants
AT pinterlajos acomprehensiveevaluationoftheperformanceofpredictionalgorithmsonclinicallyrelevantmissensevariants
AT kisserno acomprehensiveevaluationoftheperformanceofpredictionalgorithmsonclinicallyrelevantmissensevariants
AT haracskalajos acomprehensiveevaluationoftheperformanceofpredictionalgorithmsonclinicallyrelevantmissensevariants
AT qorrierda comprehensiveevaluationoftheperformanceofpredictionalgorithmsonclinicallyrelevantmissensevariants
AT takacsbertalan comprehensiveevaluationoftheperformanceofpredictionalgorithmsonclinicallyrelevantmissensevariants
AT grafalexandra comprehensiveevaluationoftheperformanceofpredictionalgorithmsonclinicallyrelevantmissensevariants
AT enyedimartonzsolt comprehensiveevaluationoftheperformanceofpredictionalgorithmsonclinicallyrelevantmissensevariants
AT pinterlajos comprehensiveevaluationoftheperformanceofpredictionalgorithmsonclinicallyrelevantmissensevariants
AT kisserno comprehensiveevaluationoftheperformanceofpredictionalgorithmsonclinicallyrelevantmissensevariants
AT haracskalajos comprehensiveevaluationoftheperformanceofpredictionalgorithmsonclinicallyrelevantmissensevariants