Cargando…

Comprehensive assessment of computational algorithms in predicting cancer driver mutations

BACKGROUND: The initiation and subsequent evolution of cancer are largely driven by a relatively small number of somatic mutations with critical functional impacts, so-called driver mutations. Identifying driver mutations in a patient’s tumor cells is a central task in the era of precision cancer me...

Descripción completa

Detalles Bibliográficos
Autores principales:	Chen, Hu, Li, Jun, Wang, Yumeng, Ng, Patrick Kwok-Shing, Tsang, Yiu Huen, Shaw, Kenna R., Mills, Gordon B., Liang, Han
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	BioMed Central 2020
Materias:	Research
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7033911/ https://www.ncbi.nlm.nih.gov/pubmed/32079540 http://dx.doi.org/10.1186/s13059-020-01954-z

_version_	1783499768378425344
author	Chen, Hu Li, Jun Wang, Yumeng Ng, Patrick Kwok-Shing Tsang, Yiu Huen Shaw, Kenna R. Mills, Gordon B. Liang, Han
author_facet	Chen, Hu Li, Jun Wang, Yumeng Ng, Patrick Kwok-Shing Tsang, Yiu Huen Shaw, Kenna R. Mills, Gordon B. Liang, Han
author_sort	Chen, Hu
collection	PubMed
description	BACKGROUND: The initiation and subsequent evolution of cancer are largely driven by a relatively small number of somatic mutations with critical functional impacts, so-called driver mutations. Identifying driver mutations in a patient’s tumor cells is a central task in the era of precision cancer medicine. Over the decade, many computational algorithms have been developed to predict the effects of missense single-nucleotide variants, and they are frequently employed to prioritize mutation candidates. These algorithms employ diverse molecular features to build predictive models, and while some algorithms are cancer-specific, others are not. However, the relative performance of these algorithms has not been rigorously assessed. RESULTS: We construct five complementary benchmark datasets: mutation clustering patterns in the protein 3D structures, literature annotation based on OncoKB, TP53 mutations based on their effects on target-gene transactivation, effects of cancer mutations on tumor formation in xenograft experiments, and functional annotation based on in vitro cell viability assays we developed including a new dataset of ~ 200 mutations. We evaluate the performance of 33 algorithms and found that CHASM, CTAT-cancer, DEOGEN2, and PrimateAI show consistently better performance than the other algorithms. Moreover, cancer-specific algorithms show much better performance than those designed for a general purpose. CONCLUSIONS: Our study is a comprehensive assessment of the performance of different algorithms in predicting cancer driver mutations and provides deep insights into the best practice of computationally prioritizing cancer mutation candidates for end-users and for the future development of new algorithms.
format	Online Article Text
id	pubmed-7033911
institution	National Center for Biotechnology Information
language	English
publishDate	2020
publisher	BioMed Central
record_format	MEDLINE/PubMed
spelling	pubmed-70339112020-02-27 Comprehensive assessment of computational algorithms in predicting cancer driver mutations Chen, Hu Li, Jun Wang, Yumeng Ng, Patrick Kwok-Shing Tsang, Yiu Huen Shaw, Kenna R. Mills, Gordon B. Liang, Han Genome Biol Research BACKGROUND: The initiation and subsequent evolution of cancer are largely driven by a relatively small number of somatic mutations with critical functional impacts, so-called driver mutations. Identifying driver mutations in a patient’s tumor cells is a central task in the era of precision cancer medicine. Over the decade, many computational algorithms have been developed to predict the effects of missense single-nucleotide variants, and they are frequently employed to prioritize mutation candidates. These algorithms employ diverse molecular features to build predictive models, and while some algorithms are cancer-specific, others are not. However, the relative performance of these algorithms has not been rigorously assessed. RESULTS: We construct five complementary benchmark datasets: mutation clustering patterns in the protein 3D structures, literature annotation based on OncoKB, TP53 mutations based on their effects on target-gene transactivation, effects of cancer mutations on tumor formation in xenograft experiments, and functional annotation based on in vitro cell viability assays we developed including a new dataset of ~ 200 mutations. We evaluate the performance of 33 algorithms and found that CHASM, CTAT-cancer, DEOGEN2, and PrimateAI show consistently better performance than the other algorithms. Moreover, cancer-specific algorithms show much better performance than those designed for a general purpose. CONCLUSIONS: Our study is a comprehensive assessment of the performance of different algorithms in predicting cancer driver mutations and provides deep insights into the best practice of computationally prioritizing cancer mutation candidates for end-users and for the future development of new algorithms. BioMed Central 2020-02-20 /pmc/articles/PMC7033911/ /pubmed/32079540 http://dx.doi.org/10.1186/s13059-020-01954-z Text en © The Author(s) 2020 Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle	Research Chen, Hu Li, Jun Wang, Yumeng Ng, Patrick Kwok-Shing Tsang, Yiu Huen Shaw, Kenna R. Mills, Gordon B. Liang, Han Comprehensive assessment of computational algorithms in predicting cancer driver mutations
title	Comprehensive assessment of computational algorithms in predicting cancer driver mutations
title_full	Comprehensive assessment of computational algorithms in predicting cancer driver mutations
title_fullStr	Comprehensive assessment of computational algorithms in predicting cancer driver mutations
title_full_unstemmed	Comprehensive assessment of computational algorithms in predicting cancer driver mutations
title_short	Comprehensive assessment of computational algorithms in predicting cancer driver mutations
title_sort	comprehensive assessment of computational algorithms in predicting cancer driver mutations
topic	Research
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7033911/ https://www.ncbi.nlm.nih.gov/pubmed/32079540 http://dx.doi.org/10.1186/s13059-020-01954-z
work_keys_str_mv	AT chenhu comprehensiveassessmentofcomputationalalgorithmsinpredictingcancerdrivermutations AT lijun comprehensiveassessmentofcomputationalalgorithmsinpredictingcancerdrivermutations AT wangyumeng comprehensiveassessmentofcomputationalalgorithmsinpredictingcancerdrivermutations AT ngpatrickkwokshing comprehensiveassessmentofcomputationalalgorithmsinpredictingcancerdrivermutations AT tsangyiuhuen comprehensiveassessmentofcomputationalalgorithmsinpredictingcancerdrivermutations AT shawkennar comprehensiveassessmentofcomputationalalgorithmsinpredictingcancerdrivermutations AT millsgordonb comprehensiveassessmentofcomputationalalgorithmsinpredictingcancerdrivermutations AT lianghan comprehensiveassessmentofcomputationalalgorithmsinpredictingcancerdrivermutations

Comprehensive assessment of computational algorithms in predicting cancer driver mutations

Ejemplares similares