Cargando…

Assessment of computational methods for predicting the effects of missense mutations in human cancers

BACKGROUND: Recent advances in sequencing technologies have greatly increased the identification of mutations in cancer genomes. However, it remains a significant challenge to identify cancer-driving mutations, since most observed missense changes are neutral passenger mutations. Various computation...

Descripción completa

Detalles Bibliográficos
Autores principales:	Gnad, Florian, Baucom, Albion, Mukhyala, Kiran, Manning, Gerard, Zhang, Zemin
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	BioMed Central 2013
Materias:	Research
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3665581/ https://www.ncbi.nlm.nih.gov/pubmed/23819521 http://dx.doi.org/10.1186/1471-2164-14-S3-S7

_version_	1782271276487802880
author	Gnad, Florian Baucom, Albion Mukhyala, Kiran Manning, Gerard Zhang, Zemin
author_facet	Gnad, Florian Baucom, Albion Mukhyala, Kiran Manning, Gerard Zhang, Zemin
author_sort	Gnad, Florian
collection	PubMed
description	BACKGROUND: Recent advances in sequencing technologies have greatly increased the identification of mutations in cancer genomes. However, it remains a significant challenge to identify cancer-driving mutations, since most observed missense changes are neutral passenger mutations. Various computational methods have been developed to predict the effects of amino acid substitutions on protein function and classify mutations as deleterious or benign. These include approaches that rely on evolutionary conservation, structural constraints, or physicochemical attributes of amino acid substitutions. Here we review existing methods and further examine eight tools: SIFT, PolyPhen2, Condel, CHASM, mCluster, logRE, SNAP, and MutationAssessor, with respect to their coverage, accuracy, availability and dependence on other tools. RESULTS: Single nucleotide polymorphisms with high minor allele frequencies were used as a negative (neutral) set for testing, and recurrent mutations from the COSMIC database as well as novel recurrent somatic mutations identified in very recent cancer studies were used as positive (non-neutral) sets. Conservation-based methods generally had moderately high accuracy in distinguishing neutral from deleterious mutations, whereas the performance of machine learning based predictors with comprehensive feature spaces varied between assessments using different positive sets. MutationAssessor consistently provided the highest accuracies. For certain combinations metapredictors slightly improved the performance of included individual methods, but did not outperform MutationAssessor as stand-alone tool. CONCLUSIONS: Our independent assessment of existing tools reveals various performance disparities. Cancer-trained methods did not improve upon more general predictors. No method or combination of methods exceeds 81% accuracy, indicating there is still significant room for improvement for driver mutation prediction, and perhaps more sophisticated feature integration is needed to develop a more robust tool.
format	Online Article Text
id	pubmed-3665581
institution	National Center for Biotechnology Information
language	English
publishDate	2013
publisher	BioMed Central
record_format	MEDLINE/PubMed
spelling	pubmed-36655812013-06-05 Assessment of computational methods for predicting the effects of missense mutations in human cancers Gnad, Florian Baucom, Albion Mukhyala, Kiran Manning, Gerard Zhang, Zemin BMC Genomics Research BACKGROUND: Recent advances in sequencing technologies have greatly increased the identification of mutations in cancer genomes. However, it remains a significant challenge to identify cancer-driving mutations, since most observed missense changes are neutral passenger mutations. Various computational methods have been developed to predict the effects of amino acid substitutions on protein function and classify mutations as deleterious or benign. These include approaches that rely on evolutionary conservation, structural constraints, or physicochemical attributes of amino acid substitutions. Here we review existing methods and further examine eight tools: SIFT, PolyPhen2, Condel, CHASM, mCluster, logRE, SNAP, and MutationAssessor, with respect to their coverage, accuracy, availability and dependence on other tools. RESULTS: Single nucleotide polymorphisms with high minor allele frequencies were used as a negative (neutral) set for testing, and recurrent mutations from the COSMIC database as well as novel recurrent somatic mutations identified in very recent cancer studies were used as positive (non-neutral) sets. Conservation-based methods generally had moderately high accuracy in distinguishing neutral from deleterious mutations, whereas the performance of machine learning based predictors with comprehensive feature spaces varied between assessments using different positive sets. MutationAssessor consistently provided the highest accuracies. For certain combinations metapredictors slightly improved the performance of included individual methods, but did not outperform MutationAssessor as stand-alone tool. CONCLUSIONS: Our independent assessment of existing tools reveals various performance disparities. Cancer-trained methods did not improve upon more general predictors. No method or combination of methods exceeds 81% accuracy, indicating there is still significant room for improvement for driver mutation prediction, and perhaps more sophisticated feature integration is needed to develop a more robust tool. BioMed Central 2013-05-28 /pmc/articles/PMC3665581/ /pubmed/23819521 http://dx.doi.org/10.1186/1471-2164-14-S3-S7 Text en Copyright © 2013 Gnad et al.; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle	Research Gnad, Florian Baucom, Albion Mukhyala, Kiran Manning, Gerard Zhang, Zemin Assessment of computational methods for predicting the effects of missense mutations in human cancers
title	Assessment of computational methods for predicting the effects of missense mutations in human cancers
title_full	Assessment of computational methods for predicting the effects of missense mutations in human cancers
title_fullStr	Assessment of computational methods for predicting the effects of missense mutations in human cancers
title_full_unstemmed	Assessment of computational methods for predicting the effects of missense mutations in human cancers
title_short	Assessment of computational methods for predicting the effects of missense mutations in human cancers
title_sort	assessment of computational methods for predicting the effects of missense mutations in human cancers
topic	Research
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3665581/ https://www.ncbi.nlm.nih.gov/pubmed/23819521 http://dx.doi.org/10.1186/1471-2164-14-S3-S7
work_keys_str_mv	AT gnadflorian assessmentofcomputationalmethodsforpredictingtheeffectsofmissensemutationsinhumancancers AT baucomalbion assessmentofcomputationalmethodsforpredictingtheeffectsofmissensemutationsinhumancancers AT mukhyalakiran assessmentofcomputationalmethodsforpredictingtheeffectsofmissensemutationsinhumancancers AT manninggerard assessmentofcomputationalmethodsforpredictingtheeffectsofmissensemutationsinhumancancers AT zhangzemin assessmentofcomputationalmethodsforpredictingtheeffectsofmissensemutationsinhumancancers

Assessment of computational methods for predicting the effects of missense mutations in human cancers

Ejemplares similares