Cargando…

Statistical method on nonrandom clustering with application to somatic mutations in cancer

BACKGROUND: Human cancer is caused by the accumulation of tumor-specific mutations in oncogenes and tumor suppressors that confer a selective growth advantage to cells. As a consequence of genomic instability and high levels of proliferation, many passenger mutations that do not contribute to the ca...

Descripción completa

Detalles Bibliográficos
Autores principales: Ye, Jingjing, Pavlicek, Adam, Lunney, Elizabeth A, Rejto, Paul A, Teng, Chi-Hse
Formato: Texto
Lenguaje:English
Publicado: BioMed Central 2010
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2822753/
https://www.ncbi.nlm.nih.gov/pubmed/20053295
http://dx.doi.org/10.1186/1471-2105-11-11
_version_ 1782177551379070976
author Ye, Jingjing
Pavlicek, Adam
Lunney, Elizabeth A
Rejto, Paul A
Teng, Chi-Hse
author_facet Ye, Jingjing
Pavlicek, Adam
Lunney, Elizabeth A
Rejto, Paul A
Teng, Chi-Hse
author_sort Ye, Jingjing
collection PubMed
description BACKGROUND: Human cancer is caused by the accumulation of tumor-specific mutations in oncogenes and tumor suppressors that confer a selective growth advantage to cells. As a consequence of genomic instability and high levels of proliferation, many passenger mutations that do not contribute to the cancer phenotype arise alongside mutations that drive oncogenesis. While several approaches have been developed to separate driver mutations from passengers, few approaches can specifically identify activating driver mutations in oncogenes, which are more amenable for pharmacological intervention. RESULTS: We propose a new statistical method for detecting activating mutations in cancer by identifying nonrandom clusters of amino acid mutations in protein sequences. A probability model is derived using order statistics assuming that the location of amino acid mutations on a protein follows a uniform distribution. Our statistical measure is the differences between pair-wise order statistics, which is equivalent to the size of an amino acid mutation cluster, and the probabilities are derived from exact and approximate distributions of the statistical measure. Using data in the Catalog of Somatic Mutations in Cancer (COSMIC) database, we have demonstrated that our method detects well-known clusters of activating mutations in KRAS, BRAF, PI3K, and β-catenin. The method can also identify new cancer targets as well as gain-of-function mutations in tumor suppressors. CONCLUSIONS: Our proposed method is useful to discover activating driver mutations in cancer by identifying nonrandom clusters of somatic amino acid mutations in protein sequences.
format Text
id pubmed-2822753
institution National Center for Biotechnology Information
language English
publishDate 2010
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-28227532010-02-17 Statistical method on nonrandom clustering with application to somatic mutations in cancer Ye, Jingjing Pavlicek, Adam Lunney, Elizabeth A Rejto, Paul A Teng, Chi-Hse BMC Bioinformatics Methodology article BACKGROUND: Human cancer is caused by the accumulation of tumor-specific mutations in oncogenes and tumor suppressors that confer a selective growth advantage to cells. As a consequence of genomic instability and high levels of proliferation, many passenger mutations that do not contribute to the cancer phenotype arise alongside mutations that drive oncogenesis. While several approaches have been developed to separate driver mutations from passengers, few approaches can specifically identify activating driver mutations in oncogenes, which are more amenable for pharmacological intervention. RESULTS: We propose a new statistical method for detecting activating mutations in cancer by identifying nonrandom clusters of amino acid mutations in protein sequences. A probability model is derived using order statistics assuming that the location of amino acid mutations on a protein follows a uniform distribution. Our statistical measure is the differences between pair-wise order statistics, which is equivalent to the size of an amino acid mutation cluster, and the probabilities are derived from exact and approximate distributions of the statistical measure. Using data in the Catalog of Somatic Mutations in Cancer (COSMIC) database, we have demonstrated that our method detects well-known clusters of activating mutations in KRAS, BRAF, PI3K, and β-catenin. The method can also identify new cancer targets as well as gain-of-function mutations in tumor suppressors. CONCLUSIONS: Our proposed method is useful to discover activating driver mutations in cancer by identifying nonrandom clusters of somatic amino acid mutations in protein sequences. BioMed Central 2010-01-07 /pmc/articles/PMC2822753/ /pubmed/20053295 http://dx.doi.org/10.1186/1471-2105-11-11 Text en Copyright ©2010 Ye et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Methodology article
Ye, Jingjing
Pavlicek, Adam
Lunney, Elizabeth A
Rejto, Paul A
Teng, Chi-Hse
Statistical method on nonrandom clustering with application to somatic mutations in cancer
title Statistical method on nonrandom clustering with application to somatic mutations in cancer
title_full Statistical method on nonrandom clustering with application to somatic mutations in cancer
title_fullStr Statistical method on nonrandom clustering with application to somatic mutations in cancer
title_full_unstemmed Statistical method on nonrandom clustering with application to somatic mutations in cancer
title_short Statistical method on nonrandom clustering with application to somatic mutations in cancer
title_sort statistical method on nonrandom clustering with application to somatic mutations in cancer
topic Methodology article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2822753/
https://www.ncbi.nlm.nih.gov/pubmed/20053295
http://dx.doi.org/10.1186/1471-2105-11-11
work_keys_str_mv AT yejingjing statisticalmethodonnonrandomclusteringwithapplicationtosomaticmutationsincancer
AT pavlicekadam statisticalmethodonnonrandomclusteringwithapplicationtosomaticmutationsincancer
AT lunneyelizabetha statisticalmethodonnonrandomclusteringwithapplicationtosomaticmutationsincancer
AT rejtopaula statisticalmethodonnonrandomclusteringwithapplicationtosomaticmutationsincancer
AT tengchihse statisticalmethodonnonrandomclusteringwithapplicationtosomaticmutationsincancer