Cargando…

Mutation Clusters from Cancer Exome

We apply our statistically deterministic machine learning/clustering algorithm *K-means (recently developed in https://ssrn.com/abstract=2908286) to 10,656 published exome samples for 32 cancer types. A majority of cancer types exhibit a mutation clustering structure. Our results are in-sample stabl...

Descripción completa

Detalles Bibliográficos
Autores principales: Kakushadze, Zura, Yu, Willie
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2017
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5575665/
https://www.ncbi.nlm.nih.gov/pubmed/28809811
http://dx.doi.org/10.3390/genes8080201
_version_ 1783260098040168448
author Kakushadze, Zura
Yu, Willie
author_facet Kakushadze, Zura
Yu, Willie
author_sort Kakushadze, Zura
collection PubMed
description We apply our statistically deterministic machine learning/clustering algorithm *K-means (recently developed in https://ssrn.com/abstract=2908286) to 10,656 published exome samples for 32 cancer types. A majority of cancer types exhibit a mutation clustering structure. Our results are in-sample stable. They are also out-of-sample stable when applied to 1389 published genome samples across 14 cancer types. In contrast, we find in- and out-of-sample instabilities in cancer signatures extracted from exome samples via nonnegative matrix factorization (NMF), a computationally-costly and non-deterministic method. Extracting stable mutation structures from exome data could have important implications for speed and cost, which are critical for early-stage cancer diagnostics, such as novel blood-test methods currently in development.
format Online
Article
Text
id pubmed-5575665
institution National Center for Biotechnology Information
language English
publishDate 2017
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-55756652017-09-01 Mutation Clusters from Cancer Exome Kakushadze, Zura Yu, Willie Genes (Basel) Article We apply our statistically deterministic machine learning/clustering algorithm *K-means (recently developed in https://ssrn.com/abstract=2908286) to 10,656 published exome samples for 32 cancer types. A majority of cancer types exhibit a mutation clustering structure. Our results are in-sample stable. They are also out-of-sample stable when applied to 1389 published genome samples across 14 cancer types. In contrast, we find in- and out-of-sample instabilities in cancer signatures extracted from exome samples via nonnegative matrix factorization (NMF), a computationally-costly and non-deterministic method. Extracting stable mutation structures from exome data could have important implications for speed and cost, which are critical for early-stage cancer diagnostics, such as novel blood-test methods currently in development. MDPI 2017-08-15 /pmc/articles/PMC5575665/ /pubmed/28809811 http://dx.doi.org/10.3390/genes8080201 Text en © 2017 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Kakushadze, Zura
Yu, Willie
Mutation Clusters from Cancer Exome
title Mutation Clusters from Cancer Exome
title_full Mutation Clusters from Cancer Exome
title_fullStr Mutation Clusters from Cancer Exome
title_full_unstemmed Mutation Clusters from Cancer Exome
title_short Mutation Clusters from Cancer Exome
title_sort mutation clusters from cancer exome
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5575665/
https://www.ncbi.nlm.nih.gov/pubmed/28809811
http://dx.doi.org/10.3390/genes8080201
work_keys_str_mv AT kakushadzezura mutationclustersfromcancerexome
AT yuwillie mutationclustersfromcancerexome