Cargando…

Extracting insights from the shape of complex data using topology

This paper applies topological methods to study complex high dimensional data sets by extracting shapes (patterns) and obtaining insights about them. Our method combines the best features of existing standard methodologies such as principal component and cluster analyses to provide a geometric repre...

Descripción completa

Detalles Bibliográficos
Autores principales: Lum, P. Y., Singh, G., Lehman, A., Ishkanov, T., Vejdemo-Johansson, M., Alagappan, M., Carlsson, J., Carlsson, G.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Nature Publishing Group 2013
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3566620/
https://www.ncbi.nlm.nih.gov/pubmed/23393618
http://dx.doi.org/10.1038/srep01236
_version_ 1782258586520387584
author Lum, P. Y.
Singh, G.
Lehman, A.
Ishkanov, T.
Vejdemo-Johansson, M.
Alagappan, M.
Carlsson, J.
Carlsson, G.
author_facet Lum, P. Y.
Singh, G.
Lehman, A.
Ishkanov, T.
Vejdemo-Johansson, M.
Alagappan, M.
Carlsson, J.
Carlsson, G.
author_sort Lum, P. Y.
collection PubMed
description This paper applies topological methods to study complex high dimensional data sets by extracting shapes (patterns) and obtaining insights about them. Our method combines the best features of existing standard methodologies such as principal component and cluster analyses to provide a geometric representation of complex data sets. Through this hybrid method, we often find subgroups in data sets that traditional methodologies fail to find. Our method also permits the analysis of individual data sets as well as the analysis of relationships between related data sets. We illustrate the use of our method by applying it to three very different kinds of data, namely gene expression from breast tumors, voting data from the United States House of Representatives and player performance data from the NBA, in each case finding stratifications of the data which are more refined than those produced by standard methods.
format Online
Article
Text
id pubmed-3566620
institution National Center for Biotechnology Information
language English
publishDate 2013
publisher Nature Publishing Group
record_format MEDLINE/PubMed
spelling pubmed-35666202013-02-07 Extracting insights from the shape of complex data using topology Lum, P. Y. Singh, G. Lehman, A. Ishkanov, T. Vejdemo-Johansson, M. Alagappan, M. Carlsson, J. Carlsson, G. Sci Rep Article This paper applies topological methods to study complex high dimensional data sets by extracting shapes (patterns) and obtaining insights about them. Our method combines the best features of existing standard methodologies such as principal component and cluster analyses to provide a geometric representation of complex data sets. Through this hybrid method, we often find subgroups in data sets that traditional methodologies fail to find. Our method also permits the analysis of individual data sets as well as the analysis of relationships between related data sets. We illustrate the use of our method by applying it to three very different kinds of data, namely gene expression from breast tumors, voting data from the United States House of Representatives and player performance data from the NBA, in each case finding stratifications of the data which are more refined than those produced by standard methods. Nature Publishing Group 2013-02-07 /pmc/articles/PMC3566620/ /pubmed/23393618 http://dx.doi.org/10.1038/srep01236 Text en Copyright © 2013, Macmillan Publishers Limited. All rights reserved http://creativecommons.org/licenses/by-nc-nd/3.0/ This work is licensed under a Creative Commons Attribution-NonCommercial-NoDerivs 3.0 Unported License. To view a copy of this license, visit http://creativecommons.org/licenses/by-nc-nd/3.0/
spellingShingle Article
Lum, P. Y.
Singh, G.
Lehman, A.
Ishkanov, T.
Vejdemo-Johansson, M.
Alagappan, M.
Carlsson, J.
Carlsson, G.
Extracting insights from the shape of complex data using topology
title Extracting insights from the shape of complex data using topology
title_full Extracting insights from the shape of complex data using topology
title_fullStr Extracting insights from the shape of complex data using topology
title_full_unstemmed Extracting insights from the shape of complex data using topology
title_short Extracting insights from the shape of complex data using topology
title_sort extracting insights from the shape of complex data using topology
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3566620/
https://www.ncbi.nlm.nih.gov/pubmed/23393618
http://dx.doi.org/10.1038/srep01236
work_keys_str_mv AT lumpy extractinginsightsfromtheshapeofcomplexdatausingtopology
AT singhg extractinginsightsfromtheshapeofcomplexdatausingtopology
AT lehmana extractinginsightsfromtheshapeofcomplexdatausingtopology
AT ishkanovt extractinginsightsfromtheshapeofcomplexdatausingtopology
AT vejdemojohanssonm extractinginsightsfromtheshapeofcomplexdatausingtopology
AT alagappanm extractinginsightsfromtheshapeofcomplexdatausingtopology
AT carlssonj extractinginsightsfromtheshapeofcomplexdatausingtopology
AT carlssong extractinginsightsfromtheshapeofcomplexdatausingtopology