Cargando…
Visualization of genetic disease-phenotype similarities by multiple maps t-SNE with Laplacian regularization
BACKGROUND: From a phenotypic standpoint, certain types of diseases may prove to be difficult to accurately diagnose, due to specific combinations of confounding symptoms. Referred to as phenotypic overlap, these sets of disease-related symptoms suggest shared pathophysiological mechanisms. Few atte...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2014
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4243097/ https://www.ncbi.nlm.nih.gov/pubmed/25350393 http://dx.doi.org/10.1186/1755-8794-7-S2-S1 |
_version_ | 1782346058485989376 |
---|---|
author | Xu, Weiwei Jiang, Xingpeng Hu, Xiaohua Li, Guangrong |
author_facet | Xu, Weiwei Jiang, Xingpeng Hu, Xiaohua Li, Guangrong |
author_sort | Xu, Weiwei |
collection | PubMed |
description | BACKGROUND: From a phenotypic standpoint, certain types of diseases may prove to be difficult to accurately diagnose, due to specific combinations of confounding symptoms. Referred to as phenotypic overlap, these sets of disease-related symptoms suggest shared pathophysiological mechanisms. Few attempts have been made to visualize the phenotypic relationships between different human diseases from a machine learning perspective. The proposed research, it is anticipated, will visually assist researchers in quickly disambiguating symptoms which can confound the timely and accurate diagnosis of a disease. METHODS: Our method is primarily based on multiple maps t-SNE (mm-tSNE), which is a probabilistic method for visualizing data points in multiple low dimensional spaces. We improved mm-tSNE by adding a Laplacian regularization term and subsequently provide an algorithm for optimizing the new objective function. The advantage of Laplacian regularization is that it adopts clustering structures of variables and provides more sparsity to the estimated parameters. RESULTS: In order to further assess our modified mm-tSNE algorithm from a comparative standpoint, we reexamined two social network datasets used by the previous authors. Subsequently, we apply our method on phenotype dataset. In all these cases, our proposed method demonstrated better performance than the original version of mm-tSNE, as measured by the neighbourhood preservation ratio. CONCLUSIONS: Phenotype grouping reflects the nature of human disease genetics. Thus, phenotype visualization may be complementary to investigate candidate genes for diseases as well as functional relations between genes and proteins. These relationships can be modelled by the modified mm-tSNE method. The modified mm-tSNE can be applied directly in other domain including social and biological datasets. |
format | Online Article Text |
id | pubmed-4243097 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2014 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-42430972014-11-26 Visualization of genetic disease-phenotype similarities by multiple maps t-SNE with Laplacian regularization Xu, Weiwei Jiang, Xingpeng Hu, Xiaohua Li, Guangrong BMC Med Genomics Research BACKGROUND: From a phenotypic standpoint, certain types of diseases may prove to be difficult to accurately diagnose, due to specific combinations of confounding symptoms. Referred to as phenotypic overlap, these sets of disease-related symptoms suggest shared pathophysiological mechanisms. Few attempts have been made to visualize the phenotypic relationships between different human diseases from a machine learning perspective. The proposed research, it is anticipated, will visually assist researchers in quickly disambiguating symptoms which can confound the timely and accurate diagnosis of a disease. METHODS: Our method is primarily based on multiple maps t-SNE (mm-tSNE), which is a probabilistic method for visualizing data points in multiple low dimensional spaces. We improved mm-tSNE by adding a Laplacian regularization term and subsequently provide an algorithm for optimizing the new objective function. The advantage of Laplacian regularization is that it adopts clustering structures of variables and provides more sparsity to the estimated parameters. RESULTS: In order to further assess our modified mm-tSNE algorithm from a comparative standpoint, we reexamined two social network datasets used by the previous authors. Subsequently, we apply our method on phenotype dataset. In all these cases, our proposed method demonstrated better performance than the original version of mm-tSNE, as measured by the neighbourhood preservation ratio. CONCLUSIONS: Phenotype grouping reflects the nature of human disease genetics. Thus, phenotype visualization may be complementary to investigate candidate genes for diseases as well as functional relations between genes and proteins. These relationships can be modelled by the modified mm-tSNE method. The modified mm-tSNE can be applied directly in other domain including social and biological datasets. BioMed Central 2014-10-22 /pmc/articles/PMC4243097/ /pubmed/25350393 http://dx.doi.org/10.1186/1755-8794-7-S2-S1 Text en Copyright © 2014 Xu et al.; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated. |
spellingShingle | Research Xu, Weiwei Jiang, Xingpeng Hu, Xiaohua Li, Guangrong Visualization of genetic disease-phenotype similarities by multiple maps t-SNE with Laplacian regularization |
title | Visualization of genetic disease-phenotype similarities by multiple maps t-SNE with Laplacian regularization |
title_full | Visualization of genetic disease-phenotype similarities by multiple maps t-SNE with Laplacian regularization |
title_fullStr | Visualization of genetic disease-phenotype similarities by multiple maps t-SNE with Laplacian regularization |
title_full_unstemmed | Visualization of genetic disease-phenotype similarities by multiple maps t-SNE with Laplacian regularization |
title_short | Visualization of genetic disease-phenotype similarities by multiple maps t-SNE with Laplacian regularization |
title_sort | visualization of genetic disease-phenotype similarities by multiple maps t-sne with laplacian regularization |
topic | Research |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4243097/ https://www.ncbi.nlm.nih.gov/pubmed/25350393 http://dx.doi.org/10.1186/1755-8794-7-S2-S1 |
work_keys_str_mv | AT xuweiwei visualizationofgeneticdiseasephenotypesimilaritiesbymultiplemapstsnewithlaplacianregularization AT jiangxingpeng visualizationofgeneticdiseasephenotypesimilaritiesbymultiplemapstsnewithlaplacianregularization AT huxiaohua visualizationofgeneticdiseasephenotypesimilaritiesbymultiplemapstsnewithlaplacianregularization AT liguangrong visualizationofgeneticdiseasephenotypesimilaritiesbymultiplemapstsnewithlaplacianregularization |