Cargando…

Visualization of genetic disease-phenotype similarities by multiple maps t-SNE with Laplacian regularization

BACKGROUND: From a phenotypic standpoint, certain types of diseases may prove to be difficult to accurately diagnose, due to specific combinations of confounding symptoms. Referred to as phenotypic overlap, these sets of disease-related symptoms suggest shared pathophysiological mechanisms. Few atte...

Descripción completa

Detalles Bibliográficos
Autores principales: Xu, Weiwei, Jiang, Xingpeng, Hu, Xiaohua, Li, Guangrong
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2014
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4243097/
https://www.ncbi.nlm.nih.gov/pubmed/25350393
http://dx.doi.org/10.1186/1755-8794-7-S2-S1
_version_ 1782346058485989376
author Xu, Weiwei
Jiang, Xingpeng
Hu, Xiaohua
Li, Guangrong
author_facet Xu, Weiwei
Jiang, Xingpeng
Hu, Xiaohua
Li, Guangrong
author_sort Xu, Weiwei
collection PubMed
description BACKGROUND: From a phenotypic standpoint, certain types of diseases may prove to be difficult to accurately diagnose, due to specific combinations of confounding symptoms. Referred to as phenotypic overlap, these sets of disease-related symptoms suggest shared pathophysiological mechanisms. Few attempts have been made to visualize the phenotypic relationships between different human diseases from a machine learning perspective. The proposed research, it is anticipated, will visually assist researchers in quickly disambiguating symptoms which can confound the timely and accurate diagnosis of a disease. METHODS: Our method is primarily based on multiple maps t-SNE (mm-tSNE), which is a probabilistic method for visualizing data points in multiple low dimensional spaces. We improved mm-tSNE by adding a Laplacian regularization term and subsequently provide an algorithm for optimizing the new objective function. The advantage of Laplacian regularization is that it adopts clustering structures of variables and provides more sparsity to the estimated parameters. RESULTS: In order to further assess our modified mm-tSNE algorithm from a comparative standpoint, we reexamined two social network datasets used by the previous authors. Subsequently, we apply our method on phenotype dataset. In all these cases, our proposed method demonstrated better performance than the original version of mm-tSNE, as measured by the neighbourhood preservation ratio. CONCLUSIONS: Phenotype grouping reflects the nature of human disease genetics. Thus, phenotype visualization may be complementary to investigate candidate genes for diseases as well as functional relations between genes and proteins. These relationships can be modelled by the modified mm-tSNE method. The modified mm-tSNE can be applied directly in other domain including social and biological datasets.
format Online
Article
Text
id pubmed-4243097
institution National Center for Biotechnology Information
language English
publishDate 2014
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-42430972014-11-26 Visualization of genetic disease-phenotype similarities by multiple maps t-SNE with Laplacian regularization Xu, Weiwei Jiang, Xingpeng Hu, Xiaohua Li, Guangrong BMC Med Genomics Research BACKGROUND: From a phenotypic standpoint, certain types of diseases may prove to be difficult to accurately diagnose, due to specific combinations of confounding symptoms. Referred to as phenotypic overlap, these sets of disease-related symptoms suggest shared pathophysiological mechanisms. Few attempts have been made to visualize the phenotypic relationships between different human diseases from a machine learning perspective. The proposed research, it is anticipated, will visually assist researchers in quickly disambiguating symptoms which can confound the timely and accurate diagnosis of a disease. METHODS: Our method is primarily based on multiple maps t-SNE (mm-tSNE), which is a probabilistic method for visualizing data points in multiple low dimensional spaces. We improved mm-tSNE by adding a Laplacian regularization term and subsequently provide an algorithm for optimizing the new objective function. The advantage of Laplacian regularization is that it adopts clustering structures of variables and provides more sparsity to the estimated parameters. RESULTS: In order to further assess our modified mm-tSNE algorithm from a comparative standpoint, we reexamined two social network datasets used by the previous authors. Subsequently, we apply our method on phenotype dataset. In all these cases, our proposed method demonstrated better performance than the original version of mm-tSNE, as measured by the neighbourhood preservation ratio. CONCLUSIONS: Phenotype grouping reflects the nature of human disease genetics. Thus, phenotype visualization may be complementary to investigate candidate genes for diseases as well as functional relations between genes and proteins. These relationships can be modelled by the modified mm-tSNE method. The modified mm-tSNE can be applied directly in other domain including social and biological datasets. BioMed Central 2014-10-22 /pmc/articles/PMC4243097/ /pubmed/25350393 http://dx.doi.org/10.1186/1755-8794-7-S2-S1 Text en Copyright © 2014 Xu et al.; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Research
Xu, Weiwei
Jiang, Xingpeng
Hu, Xiaohua
Li, Guangrong
Visualization of genetic disease-phenotype similarities by multiple maps t-SNE with Laplacian regularization
title Visualization of genetic disease-phenotype similarities by multiple maps t-SNE with Laplacian regularization
title_full Visualization of genetic disease-phenotype similarities by multiple maps t-SNE with Laplacian regularization
title_fullStr Visualization of genetic disease-phenotype similarities by multiple maps t-SNE with Laplacian regularization
title_full_unstemmed Visualization of genetic disease-phenotype similarities by multiple maps t-SNE with Laplacian regularization
title_short Visualization of genetic disease-phenotype similarities by multiple maps t-SNE with Laplacian regularization
title_sort visualization of genetic disease-phenotype similarities by multiple maps t-sne with laplacian regularization
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4243097/
https://www.ncbi.nlm.nih.gov/pubmed/25350393
http://dx.doi.org/10.1186/1755-8794-7-S2-S1
work_keys_str_mv AT xuweiwei visualizationofgeneticdiseasephenotypesimilaritiesbymultiplemapstsnewithlaplacianregularization
AT jiangxingpeng visualizationofgeneticdiseasephenotypesimilaritiesbymultiplemapstsnewithlaplacianregularization
AT huxiaohua visualizationofgeneticdiseasephenotypesimilaritiesbymultiplemapstsnewithlaplacianregularization
AT liguangrong visualizationofgeneticdiseasephenotypesimilaritiesbymultiplemapstsnewithlaplacianregularization