Cargando…
Mutation Clusters from Cancer Exome
We apply our statistically deterministic machine learning/clustering algorithm *K-means (recently developed in https://ssrn.com/abstract=2908286) to 10,656 published exome samples for 32 cancer types. A majority of cancer types exhibit a mutation clustering structure. Our results are in-sample stabl...
Autores principales: | , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
MDPI
2017
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5575665/ https://www.ncbi.nlm.nih.gov/pubmed/28809811 http://dx.doi.org/10.3390/genes8080201 |
_version_ | 1783260098040168448 |
---|---|
author | Kakushadze, Zura Yu, Willie |
author_facet | Kakushadze, Zura Yu, Willie |
author_sort | Kakushadze, Zura |
collection | PubMed |
description | We apply our statistically deterministic machine learning/clustering algorithm *K-means (recently developed in https://ssrn.com/abstract=2908286) to 10,656 published exome samples for 32 cancer types. A majority of cancer types exhibit a mutation clustering structure. Our results are in-sample stable. They are also out-of-sample stable when applied to 1389 published genome samples across 14 cancer types. In contrast, we find in- and out-of-sample instabilities in cancer signatures extracted from exome samples via nonnegative matrix factorization (NMF), a computationally-costly and non-deterministic method. Extracting stable mutation structures from exome data could have important implications for speed and cost, which are critical for early-stage cancer diagnostics, such as novel blood-test methods currently in development. |
format | Online Article Text |
id | pubmed-5575665 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2017 |
publisher | MDPI |
record_format | MEDLINE/PubMed |
spelling | pubmed-55756652017-09-01 Mutation Clusters from Cancer Exome Kakushadze, Zura Yu, Willie Genes (Basel) Article We apply our statistically deterministic machine learning/clustering algorithm *K-means (recently developed in https://ssrn.com/abstract=2908286) to 10,656 published exome samples for 32 cancer types. A majority of cancer types exhibit a mutation clustering structure. Our results are in-sample stable. They are also out-of-sample stable when applied to 1389 published genome samples across 14 cancer types. In contrast, we find in- and out-of-sample instabilities in cancer signatures extracted from exome samples via nonnegative matrix factorization (NMF), a computationally-costly and non-deterministic method. Extracting stable mutation structures from exome data could have important implications for speed and cost, which are critical for early-stage cancer diagnostics, such as novel blood-test methods currently in development. MDPI 2017-08-15 /pmc/articles/PMC5575665/ /pubmed/28809811 http://dx.doi.org/10.3390/genes8080201 Text en © 2017 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/). |
spellingShingle | Article Kakushadze, Zura Yu, Willie Mutation Clusters from Cancer Exome |
title | Mutation Clusters from Cancer Exome |
title_full | Mutation Clusters from Cancer Exome |
title_fullStr | Mutation Clusters from Cancer Exome |
title_full_unstemmed | Mutation Clusters from Cancer Exome |
title_short | Mutation Clusters from Cancer Exome |
title_sort | mutation clusters from cancer exome |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5575665/ https://www.ncbi.nlm.nih.gov/pubmed/28809811 http://dx.doi.org/10.3390/genes8080201 |
work_keys_str_mv | AT kakushadzezura mutationclustersfromcancerexome AT yuwillie mutationclustersfromcancerexome |