Cargando…
Mercator: a pipeline for multi-method, unsupervised visualization and distance generation
SUMMARY: Unsupervised machine learning provides tools for researchers to uncover latent patterns in large-scale data, based on calculated distances between observations. Methods to visualize high-dimensional data based on these distances can elucidate subtypes and interactions within multi-dimension...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Oxford University Press
2021
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8428582/ https://www.ncbi.nlm.nih.gov/pubmed/33515233 http://dx.doi.org/10.1093/bioinformatics/btab037 |
_version_ | 1783750406735659008 |
---|---|
author | Abrams, Zachary B. Coombes, Caitlin E. Li, Suli Coombes, Kevin R. |
author_facet | Abrams, Zachary B. Coombes, Caitlin E. Li, Suli Coombes, Kevin R. |
author_sort | Abrams, Zachary B. |
collection | PubMed |
description | SUMMARY: Unsupervised machine learning provides tools for researchers to uncover latent patterns in large-scale data, based on calculated distances between observations. Methods to visualize high-dimensional data based on these distances can elucidate subtypes and interactions within multi-dimensional and high-throughput data. However, researchers can select from a vast number of distance metrics and visualizations, each with their own strengths and weaknesses. The Mercator R package facilitates selection of a biologically meaningful distance from 10 metrics, together appropriate for binary, categorical and continuous data, and visualization with 5 standard and high-dimensional graphics tools. Mercator provides a user-friendly pipeline for informaticians or biologists to perform unsupervised analyses, from exploratory pattern recognition to production of publication-quality graphics. AVAILABILITYAND IMPLEMENTATION: Mercator is freely available at the Comprehensive R Archive Network (https://cran.r-project.org/web/packages/Mercator/index.html). |
format | Online Article Text |
id | pubmed-8428582 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2021 |
publisher | Oxford University Press |
record_format | MEDLINE/PubMed |
spelling | pubmed-84285822021-09-10 Mercator: a pipeline for multi-method, unsupervised visualization and distance generation Abrams, Zachary B. Coombes, Caitlin E. Li, Suli Coombes, Kevin R. Bioinformatics Applications Notes SUMMARY: Unsupervised machine learning provides tools for researchers to uncover latent patterns in large-scale data, based on calculated distances between observations. Methods to visualize high-dimensional data based on these distances can elucidate subtypes and interactions within multi-dimensional and high-throughput data. However, researchers can select from a vast number of distance metrics and visualizations, each with their own strengths and weaknesses. The Mercator R package facilitates selection of a biologically meaningful distance from 10 metrics, together appropriate for binary, categorical and continuous data, and visualization with 5 standard and high-dimensional graphics tools. Mercator provides a user-friendly pipeline for informaticians or biologists to perform unsupervised analyses, from exploratory pattern recognition to production of publication-quality graphics. AVAILABILITYAND IMPLEMENTATION: Mercator is freely available at the Comprehensive R Archive Network (https://cran.r-project.org/web/packages/Mercator/index.html). Oxford University Press 2021-01-30 /pmc/articles/PMC8428582/ /pubmed/33515233 http://dx.doi.org/10.1093/bioinformatics/btab037 Text en © The Author(s) 2021. Published by Oxford University Press. https://creativecommons.org/licenses/by-nc/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (https://creativecommons.org/licenses/by-nc/4.0/), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial re-use, please contact journals.permissions@oup.com |
spellingShingle | Applications Notes Abrams, Zachary B. Coombes, Caitlin E. Li, Suli Coombes, Kevin R. Mercator: a pipeline for multi-method, unsupervised visualization and distance generation |
title | Mercator: a pipeline for multi-method, unsupervised visualization and distance generation |
title_full | Mercator: a pipeline for multi-method, unsupervised visualization and distance generation |
title_fullStr | Mercator: a pipeline for multi-method, unsupervised visualization and distance generation |
title_full_unstemmed | Mercator: a pipeline for multi-method, unsupervised visualization and distance generation |
title_short | Mercator: a pipeline for multi-method, unsupervised visualization and distance generation |
title_sort | mercator: a pipeline for multi-method, unsupervised visualization and distance generation |
topic | Applications Notes |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8428582/ https://www.ncbi.nlm.nih.gov/pubmed/33515233 http://dx.doi.org/10.1093/bioinformatics/btab037 |
work_keys_str_mv | AT abramszacharyb mercatorapipelineformultimethodunsupervisedvisualizationanddistancegeneration AT coombescaitline mercatorapipelineformultimethodunsupervisedvisualizationanddistancegeneration AT lisuli mercatorapipelineformultimethodunsupervisedvisualizationanddistancegeneration AT coombeskevinr mercatorapipelineformultimethodunsupervisedvisualizationanddistancegeneration |