Cargando…

Nonlinear expression and visualization of nonmetric relationships in genetic diseases and microbiome data

BACKGROUND: The traditional methods of visualizing high-dimensional data objects in low-dimensional metric spaces are subject to the basic limitations of metric space. These limitations result in multidimensional scaling that fails to faithfully represent non-metric similarity data. RESULTS: Multipl...

Descripción completa

Detalles Bibliográficos
Autores principales: Zhu, Xianchao, Shen, Xianjun, Jiang, Xingpeng, Wei, Kaiping, He, Tingting, Ma, Yuanyuan, Liu, Jiaqi, Hu, Xiaohua
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2018
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6302369/
https://www.ncbi.nlm.nih.gov/pubmed/30577738
http://dx.doi.org/10.1186/s12859-018-2537-z
_version_ 1783381962829856768
author Zhu, Xianchao
Shen, Xianjun
Jiang, Xingpeng
Wei, Kaiping
He, Tingting
Ma, Yuanyuan
Liu, Jiaqi
Hu, Xiaohua
author_facet Zhu, Xianchao
Shen, Xianjun
Jiang, Xingpeng
Wei, Kaiping
He, Tingting
Ma, Yuanyuan
Liu, Jiaqi
Hu, Xiaohua
author_sort Zhu, Xianchao
collection PubMed
description BACKGROUND: The traditional methods of visualizing high-dimensional data objects in low-dimensional metric spaces are subject to the basic limitations of metric space. These limitations result in multidimensional scaling that fails to faithfully represent non-metric similarity data. RESULTS: Multiple maps t-SNE (mm-tSNE) has drawn much attention due to the construction of multiple mappings in low-dimensional space to visualize the non-metric pairwise similarity to eliminate the limitations of a single metric map. mm-tSNE regularization combines the intrinsic geometry between data points in a high-dimensional space. The weight of data points on each map is used as the regularization parameter of the manifold, so the weights of similar data points on the same map are also as close as possible. However, these methods use standard momentum methods to calculate parameters of gradient at each iteration, which may lead to erroneous gradient search directions so that the target loss function fails to achieve a better local minimum. In this article, we use a Nesterov momentum method to learn the target loss function and correct each gradient update by looking back at the previous gradient in the candidate search direction. By using indirect second-order information, the algorithm obtains faster convergence than the original algorithm. To further evaluate our approach from a comparative perspective, we conducted experiments on several datasets including social network data, phenotype similarity data, and microbiomic data. CONCLUSIONS: The experimental results show that the proposed method achieves better results than several versions of mm-tSNE based on three evaluation indicators including the neighborhood preservation ratio (NPR), error rate and time complexity.
format Online
Article
Text
id pubmed-6302369
institution National Center for Biotechnology Information
language English
publishDate 2018
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-63023692018-12-31 Nonlinear expression and visualization of nonmetric relationships in genetic diseases and microbiome data Zhu, Xianchao Shen, Xianjun Jiang, Xingpeng Wei, Kaiping He, Tingting Ma, Yuanyuan Liu, Jiaqi Hu, Xiaohua BMC Bioinformatics Research BACKGROUND: The traditional methods of visualizing high-dimensional data objects in low-dimensional metric spaces are subject to the basic limitations of metric space. These limitations result in multidimensional scaling that fails to faithfully represent non-metric similarity data. RESULTS: Multiple maps t-SNE (mm-tSNE) has drawn much attention due to the construction of multiple mappings in low-dimensional space to visualize the non-metric pairwise similarity to eliminate the limitations of a single metric map. mm-tSNE regularization combines the intrinsic geometry between data points in a high-dimensional space. The weight of data points on each map is used as the regularization parameter of the manifold, so the weights of similar data points on the same map are also as close as possible. However, these methods use standard momentum methods to calculate parameters of gradient at each iteration, which may lead to erroneous gradient search directions so that the target loss function fails to achieve a better local minimum. In this article, we use a Nesterov momentum method to learn the target loss function and correct each gradient update by looking back at the previous gradient in the candidate search direction. By using indirect second-order information, the algorithm obtains faster convergence than the original algorithm. To further evaluate our approach from a comparative perspective, we conducted experiments on several datasets including social network data, phenotype similarity data, and microbiomic data. CONCLUSIONS: The experimental results show that the proposed method achieves better results than several versions of mm-tSNE based on three evaluation indicators including the neighborhood preservation ratio (NPR), error rate and time complexity. BioMed Central 2018-12-21 /pmc/articles/PMC6302369/ /pubmed/30577738 http://dx.doi.org/10.1186/s12859-018-2537-z Text en © The Author(s). 2018 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Research
Zhu, Xianchao
Shen, Xianjun
Jiang, Xingpeng
Wei, Kaiping
He, Tingting
Ma, Yuanyuan
Liu, Jiaqi
Hu, Xiaohua
Nonlinear expression and visualization of nonmetric relationships in genetic diseases and microbiome data
title Nonlinear expression and visualization of nonmetric relationships in genetic diseases and microbiome data
title_full Nonlinear expression and visualization of nonmetric relationships in genetic diseases and microbiome data
title_fullStr Nonlinear expression and visualization of nonmetric relationships in genetic diseases and microbiome data
title_full_unstemmed Nonlinear expression and visualization of nonmetric relationships in genetic diseases and microbiome data
title_short Nonlinear expression and visualization of nonmetric relationships in genetic diseases and microbiome data
title_sort nonlinear expression and visualization of nonmetric relationships in genetic diseases and microbiome data
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6302369/
https://www.ncbi.nlm.nih.gov/pubmed/30577738
http://dx.doi.org/10.1186/s12859-018-2537-z
work_keys_str_mv AT zhuxianchao nonlinearexpressionandvisualizationofnonmetricrelationshipsingeneticdiseasesandmicrobiomedata
AT shenxianjun nonlinearexpressionandvisualizationofnonmetricrelationshipsingeneticdiseasesandmicrobiomedata
AT jiangxingpeng nonlinearexpressionandvisualizationofnonmetricrelationshipsingeneticdiseasesandmicrobiomedata
AT weikaiping nonlinearexpressionandvisualizationofnonmetricrelationshipsingeneticdiseasesandmicrobiomedata
AT hetingting nonlinearexpressionandvisualizationofnonmetricrelationshipsingeneticdiseasesandmicrobiomedata
AT mayuanyuan nonlinearexpressionandvisualizationofnonmetricrelationshipsingeneticdiseasesandmicrobiomedata
AT liujiaqi nonlinearexpressionandvisualizationofnonmetricrelationshipsingeneticdiseasesandmicrobiomedata
AT huxiaohua nonlinearexpressionandvisualizationofnonmetricrelationshipsingeneticdiseasesandmicrobiomedata