Cargando…
Comprehensive assessment of computational algorithms in predicting cancer driver mutations
BACKGROUND: The initiation and subsequent evolution of cancer are largely driven by a relatively small number of somatic mutations with critical functional impacts, so-called driver mutations. Identifying driver mutations in a patient’s tumor cells is a central task in the era of precision cancer me...
Autores principales: | , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2020
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7033911/ https://www.ncbi.nlm.nih.gov/pubmed/32079540 http://dx.doi.org/10.1186/s13059-020-01954-z |
_version_ | 1783499768378425344 |
---|---|
author | Chen, Hu Li, Jun Wang, Yumeng Ng, Patrick Kwok-Shing Tsang, Yiu Huen Shaw, Kenna R. Mills, Gordon B. Liang, Han |
author_facet | Chen, Hu Li, Jun Wang, Yumeng Ng, Patrick Kwok-Shing Tsang, Yiu Huen Shaw, Kenna R. Mills, Gordon B. Liang, Han |
author_sort | Chen, Hu |
collection | PubMed |
description | BACKGROUND: The initiation and subsequent evolution of cancer are largely driven by a relatively small number of somatic mutations with critical functional impacts, so-called driver mutations. Identifying driver mutations in a patient’s tumor cells is a central task in the era of precision cancer medicine. Over the decade, many computational algorithms have been developed to predict the effects of missense single-nucleotide variants, and they are frequently employed to prioritize mutation candidates. These algorithms employ diverse molecular features to build predictive models, and while some algorithms are cancer-specific, others are not. However, the relative performance of these algorithms has not been rigorously assessed. RESULTS: We construct five complementary benchmark datasets: mutation clustering patterns in the protein 3D structures, literature annotation based on OncoKB, TP53 mutations based on their effects on target-gene transactivation, effects of cancer mutations on tumor formation in xenograft experiments, and functional annotation based on in vitro cell viability assays we developed including a new dataset of ~ 200 mutations. We evaluate the performance of 33 algorithms and found that CHASM, CTAT-cancer, DEOGEN2, and PrimateAI show consistently better performance than the other algorithms. Moreover, cancer-specific algorithms show much better performance than those designed for a general purpose. CONCLUSIONS: Our study is a comprehensive assessment of the performance of different algorithms in predicting cancer driver mutations and provides deep insights into the best practice of computationally prioritizing cancer mutation candidates for end-users and for the future development of new algorithms. |
format | Online Article Text |
id | pubmed-7033911 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2020 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-70339112020-02-27 Comprehensive assessment of computational algorithms in predicting cancer driver mutations Chen, Hu Li, Jun Wang, Yumeng Ng, Patrick Kwok-Shing Tsang, Yiu Huen Shaw, Kenna R. Mills, Gordon B. Liang, Han Genome Biol Research BACKGROUND: The initiation and subsequent evolution of cancer are largely driven by a relatively small number of somatic mutations with critical functional impacts, so-called driver mutations. Identifying driver mutations in a patient’s tumor cells is a central task in the era of precision cancer medicine. Over the decade, many computational algorithms have been developed to predict the effects of missense single-nucleotide variants, and they are frequently employed to prioritize mutation candidates. These algorithms employ diverse molecular features to build predictive models, and while some algorithms are cancer-specific, others are not. However, the relative performance of these algorithms has not been rigorously assessed. RESULTS: We construct five complementary benchmark datasets: mutation clustering patterns in the protein 3D structures, literature annotation based on OncoKB, TP53 mutations based on their effects on target-gene transactivation, effects of cancer mutations on tumor formation in xenograft experiments, and functional annotation based on in vitro cell viability assays we developed including a new dataset of ~ 200 mutations. We evaluate the performance of 33 algorithms and found that CHASM, CTAT-cancer, DEOGEN2, and PrimateAI show consistently better performance than the other algorithms. Moreover, cancer-specific algorithms show much better performance than those designed for a general purpose. CONCLUSIONS: Our study is a comprehensive assessment of the performance of different algorithms in predicting cancer driver mutations and provides deep insights into the best practice of computationally prioritizing cancer mutation candidates for end-users and for the future development of new algorithms. BioMed Central 2020-02-20 /pmc/articles/PMC7033911/ /pubmed/32079540 http://dx.doi.org/10.1186/s13059-020-01954-z Text en © The Author(s) 2020 Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated. |
spellingShingle | Research Chen, Hu Li, Jun Wang, Yumeng Ng, Patrick Kwok-Shing Tsang, Yiu Huen Shaw, Kenna R. Mills, Gordon B. Liang, Han Comprehensive assessment of computational algorithms in predicting cancer driver mutations |
title | Comprehensive assessment of computational algorithms in predicting cancer driver mutations |
title_full | Comprehensive assessment of computational algorithms in predicting cancer driver mutations |
title_fullStr | Comprehensive assessment of computational algorithms in predicting cancer driver mutations |
title_full_unstemmed | Comprehensive assessment of computational algorithms in predicting cancer driver mutations |
title_short | Comprehensive assessment of computational algorithms in predicting cancer driver mutations |
title_sort | comprehensive assessment of computational algorithms in predicting cancer driver mutations |
topic | Research |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7033911/ https://www.ncbi.nlm.nih.gov/pubmed/32079540 http://dx.doi.org/10.1186/s13059-020-01954-z |
work_keys_str_mv | AT chenhu comprehensiveassessmentofcomputationalalgorithmsinpredictingcancerdrivermutations AT lijun comprehensiveassessmentofcomputationalalgorithmsinpredictingcancerdrivermutations AT wangyumeng comprehensiveassessmentofcomputationalalgorithmsinpredictingcancerdrivermutations AT ngpatrickkwokshing comprehensiveassessmentofcomputationalalgorithmsinpredictingcancerdrivermutations AT tsangyiuhuen comprehensiveassessmentofcomputationalalgorithmsinpredictingcancerdrivermutations AT shawkennar comprehensiveassessmentofcomputationalalgorithmsinpredictingcancerdrivermutations AT millsgordonb comprehensiveassessmentofcomputationalalgorithmsinpredictingcancerdrivermutations AT lianghan comprehensiveassessmentofcomputationalalgorithmsinpredictingcancerdrivermutations |