Cargando…
Nonlinear expression and visualization of nonmetric relationships in genetic diseases and microbiome data
BACKGROUND: The traditional methods of visualizing high-dimensional data objects in low-dimensional metric spaces are subject to the basic limitations of metric space. These limitations result in multidimensional scaling that fails to faithfully represent non-metric similarity data. RESULTS: Multipl...
Autores principales: | , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2018
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6302369/ https://www.ncbi.nlm.nih.gov/pubmed/30577738 http://dx.doi.org/10.1186/s12859-018-2537-z |
_version_ | 1783381962829856768 |
---|---|
author | Zhu, Xianchao Shen, Xianjun Jiang, Xingpeng Wei, Kaiping He, Tingting Ma, Yuanyuan Liu, Jiaqi Hu, Xiaohua |
author_facet | Zhu, Xianchao Shen, Xianjun Jiang, Xingpeng Wei, Kaiping He, Tingting Ma, Yuanyuan Liu, Jiaqi Hu, Xiaohua |
author_sort | Zhu, Xianchao |
collection | PubMed |
description | BACKGROUND: The traditional methods of visualizing high-dimensional data objects in low-dimensional metric spaces are subject to the basic limitations of metric space. These limitations result in multidimensional scaling that fails to faithfully represent non-metric similarity data. RESULTS: Multiple maps t-SNE (mm-tSNE) has drawn much attention due to the construction of multiple mappings in low-dimensional space to visualize the non-metric pairwise similarity to eliminate the limitations of a single metric map. mm-tSNE regularization combines the intrinsic geometry between data points in a high-dimensional space. The weight of data points on each map is used as the regularization parameter of the manifold, so the weights of similar data points on the same map are also as close as possible. However, these methods use standard momentum methods to calculate parameters of gradient at each iteration, which may lead to erroneous gradient search directions so that the target loss function fails to achieve a better local minimum. In this article, we use a Nesterov momentum method to learn the target loss function and correct each gradient update by looking back at the previous gradient in the candidate search direction. By using indirect second-order information, the algorithm obtains faster convergence than the original algorithm. To further evaluate our approach from a comparative perspective, we conducted experiments on several datasets including social network data, phenotype similarity data, and microbiomic data. CONCLUSIONS: The experimental results show that the proposed method achieves better results than several versions of mm-tSNE based on three evaluation indicators including the neighborhood preservation ratio (NPR), error rate and time complexity. |
format | Online Article Text |
id | pubmed-6302369 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2018 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-63023692018-12-31 Nonlinear expression and visualization of nonmetric relationships in genetic diseases and microbiome data Zhu, Xianchao Shen, Xianjun Jiang, Xingpeng Wei, Kaiping He, Tingting Ma, Yuanyuan Liu, Jiaqi Hu, Xiaohua BMC Bioinformatics Research BACKGROUND: The traditional methods of visualizing high-dimensional data objects in low-dimensional metric spaces are subject to the basic limitations of metric space. These limitations result in multidimensional scaling that fails to faithfully represent non-metric similarity data. RESULTS: Multiple maps t-SNE (mm-tSNE) has drawn much attention due to the construction of multiple mappings in low-dimensional space to visualize the non-metric pairwise similarity to eliminate the limitations of a single metric map. mm-tSNE regularization combines the intrinsic geometry between data points in a high-dimensional space. The weight of data points on each map is used as the regularization parameter of the manifold, so the weights of similar data points on the same map are also as close as possible. However, these methods use standard momentum methods to calculate parameters of gradient at each iteration, which may lead to erroneous gradient search directions so that the target loss function fails to achieve a better local minimum. In this article, we use a Nesterov momentum method to learn the target loss function and correct each gradient update by looking back at the previous gradient in the candidate search direction. By using indirect second-order information, the algorithm obtains faster convergence than the original algorithm. To further evaluate our approach from a comparative perspective, we conducted experiments on several datasets including social network data, phenotype similarity data, and microbiomic data. CONCLUSIONS: The experimental results show that the proposed method achieves better results than several versions of mm-tSNE based on three evaluation indicators including the neighborhood preservation ratio (NPR), error rate and time complexity. BioMed Central 2018-12-21 /pmc/articles/PMC6302369/ /pubmed/30577738 http://dx.doi.org/10.1186/s12859-018-2537-z Text en © The Author(s). 2018 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated. |
spellingShingle | Research Zhu, Xianchao Shen, Xianjun Jiang, Xingpeng Wei, Kaiping He, Tingting Ma, Yuanyuan Liu, Jiaqi Hu, Xiaohua Nonlinear expression and visualization of nonmetric relationships in genetic diseases and microbiome data |
title | Nonlinear expression and visualization of nonmetric relationships in genetic diseases and microbiome data |
title_full | Nonlinear expression and visualization of nonmetric relationships in genetic diseases and microbiome data |
title_fullStr | Nonlinear expression and visualization of nonmetric relationships in genetic diseases and microbiome data |
title_full_unstemmed | Nonlinear expression and visualization of nonmetric relationships in genetic diseases and microbiome data |
title_short | Nonlinear expression and visualization of nonmetric relationships in genetic diseases and microbiome data |
title_sort | nonlinear expression and visualization of nonmetric relationships in genetic diseases and microbiome data |
topic | Research |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6302369/ https://www.ncbi.nlm.nih.gov/pubmed/30577738 http://dx.doi.org/10.1186/s12859-018-2537-z |
work_keys_str_mv | AT zhuxianchao nonlinearexpressionandvisualizationofnonmetricrelationshipsingeneticdiseasesandmicrobiomedata AT shenxianjun nonlinearexpressionandvisualizationofnonmetricrelationshipsingeneticdiseasesandmicrobiomedata AT jiangxingpeng nonlinearexpressionandvisualizationofnonmetricrelationshipsingeneticdiseasesandmicrobiomedata AT weikaiping nonlinearexpressionandvisualizationofnonmetricrelationshipsingeneticdiseasesandmicrobiomedata AT hetingting nonlinearexpressionandvisualizationofnonmetricrelationshipsingeneticdiseasesandmicrobiomedata AT mayuanyuan nonlinearexpressionandvisualizationofnonmetricrelationshipsingeneticdiseasesandmicrobiomedata AT liujiaqi nonlinearexpressionandvisualizationofnonmetricrelationshipsingeneticdiseasesandmicrobiomedata AT huxiaohua nonlinearexpressionandvisualizationofnonmetricrelationshipsingeneticdiseasesandmicrobiomedata |