Cargando…

Rigid geometry solves “curse of dimensionality” effects in clustering methods: An application to omics data

The quality of samples preserved long term at ultralow temperatures has not been adequately studied. To improve our understanding, we need a strategy to analyze protein degradation and metabolism at subfreezing temperatures. To do this, we obtained liquid chromatography-mass spectrometry (LC/MS) dat...

Descripción completa

Detalles Bibliográficos
Autor principal: Adachi, Shun
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2017
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5470695/
https://www.ncbi.nlm.nih.gov/pubmed/28614363
http://dx.doi.org/10.1371/journal.pone.0179180
_version_ 1783243810473508864
author Adachi, Shun
author_facet Adachi, Shun
author_sort Adachi, Shun
collection PubMed
description The quality of samples preserved long term at ultralow temperatures has not been adequately studied. To improve our understanding, we need a strategy to analyze protein degradation and metabolism at subfreezing temperatures. To do this, we obtained liquid chromatography-mass spectrometry (LC/MS) data of calculated protein signal intensities in HEK-293 cells. Our first attempt at directly clustering the values failed, most likely due to the so-called “curse of dimensionality”. The clusters were not reproducible, and the outputs differed with different methods. By utilizing rigid geometry with a prime ideal I-adic (p-adic) metric, however, we rearranged the sample clusters into a meaningful and reproducible order, and the results were the same with each of the different clustering methods tested. Furthermore, we have also succeeded in application of this method to expression array data in similar situations. Thus, we eliminated the “curse of dimensionality” from the data set, at least in clustering methods. It is possible that our approach determines a characteristic value of systems that follow a Boltzmann distribution.
format Online
Article
Text
id pubmed-5470695
institution National Center for Biotechnology Information
language English
publishDate 2017
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-54706952017-07-03 Rigid geometry solves “curse of dimensionality” effects in clustering methods: An application to omics data Adachi, Shun PLoS One Research Article The quality of samples preserved long term at ultralow temperatures has not been adequately studied. To improve our understanding, we need a strategy to analyze protein degradation and metabolism at subfreezing temperatures. To do this, we obtained liquid chromatography-mass spectrometry (LC/MS) data of calculated protein signal intensities in HEK-293 cells. Our first attempt at directly clustering the values failed, most likely due to the so-called “curse of dimensionality”. The clusters were not reproducible, and the outputs differed with different methods. By utilizing rigid geometry with a prime ideal I-adic (p-adic) metric, however, we rearranged the sample clusters into a meaningful and reproducible order, and the results were the same with each of the different clustering methods tested. Furthermore, we have also succeeded in application of this method to expression array data in similar situations. Thus, we eliminated the “curse of dimensionality” from the data set, at least in clustering methods. It is possible that our approach determines a characteristic value of systems that follow a Boltzmann distribution. Public Library of Science 2017-06-14 /pmc/articles/PMC5470695/ /pubmed/28614363 http://dx.doi.org/10.1371/journal.pone.0179180 Text en © 2017 Shun Adachi http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle Research Article
Adachi, Shun
Rigid geometry solves “curse of dimensionality” effects in clustering methods: An application to omics data
title Rigid geometry solves “curse of dimensionality” effects in clustering methods: An application to omics data
title_full Rigid geometry solves “curse of dimensionality” effects in clustering methods: An application to omics data
title_fullStr Rigid geometry solves “curse of dimensionality” effects in clustering methods: An application to omics data
title_full_unstemmed Rigid geometry solves “curse of dimensionality” effects in clustering methods: An application to omics data
title_short Rigid geometry solves “curse of dimensionality” effects in clustering methods: An application to omics data
title_sort rigid geometry solves “curse of dimensionality” effects in clustering methods: an application to omics data
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5470695/
https://www.ncbi.nlm.nih.gov/pubmed/28614363
http://dx.doi.org/10.1371/journal.pone.0179180
work_keys_str_mv AT adachishun rigidgeometrysolvescurseofdimensionalityeffectsinclusteringmethodsanapplicationtoomicsdata