Cargando…
Integrating heterogeneous genomic data to accurately identify disease subtypes
BACKGROUND: High-throughput biotechnologies have been widely used to characterize clinical samples from various perspectives e.g., epigenomics, genomics and transcriptomics. However, because of the heterogeneity of these technologies and their outputs, individual analysis of the various types of dat...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2015
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4653838/ https://www.ncbi.nlm.nih.gov/pubmed/26589589 http://dx.doi.org/10.1186/s12920-015-0154-5 |
_version_ | 1782401982938480640 |
---|---|
author | Ren, Xianwen Fu, Hua Jin, Qi |
author_facet | Ren, Xianwen Fu, Hua Jin, Qi |
author_sort | Ren, Xianwen |
collection | PubMed |
description | BACKGROUND: High-throughput biotechnologies have been widely used to characterize clinical samples from various perspectives e.g., epigenomics, genomics and transcriptomics. However, because of the heterogeneity of these technologies and their outputs, individual analysis of the various types of data is hard to create a comprehensive view of disease subtypes. Integrative methods are of pressing need. METHODS: In this study, we evaluated the possible issues that hamper integrative analysis of the heterogeneous disease data types, and proposed iBFE, an effective and efficient computational method to subvert those issues from a feature extraction perspective. RESULTS: Strict experiments on both simulated and real datasets demonstrated that iBFE can easily overcome issues caused by scale conflicts, noise conflicts, incompleteness of patient relationships, and conflicts between patient relationships, and that iBFE can effectively combine the merits of DNA methylation, mRNA expression and microRNA (miRNA) expression datasets to accurately identify disease subtypes of significantly different prognosis. CONCLUSIONS: iBFE is an effective and efficient method for integrative analysis of heterogeneous genomic data to accurately identify disease subtypes. The Matlab code of iBFE is freely available from http://zhangroup.aporc.org/iBFE. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12920-015-0154-5) contains supplementary material, which is available to authorized users. |
format | Online Article Text |
id | pubmed-4653838 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2015 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-46538382015-11-21 Integrating heterogeneous genomic data to accurately identify disease subtypes Ren, Xianwen Fu, Hua Jin, Qi BMC Med Genomics Technical Advance BACKGROUND: High-throughput biotechnologies have been widely used to characterize clinical samples from various perspectives e.g., epigenomics, genomics and transcriptomics. However, because of the heterogeneity of these technologies and their outputs, individual analysis of the various types of data is hard to create a comprehensive view of disease subtypes. Integrative methods are of pressing need. METHODS: In this study, we evaluated the possible issues that hamper integrative analysis of the heterogeneous disease data types, and proposed iBFE, an effective and efficient computational method to subvert those issues from a feature extraction perspective. RESULTS: Strict experiments on both simulated and real datasets demonstrated that iBFE can easily overcome issues caused by scale conflicts, noise conflicts, incompleteness of patient relationships, and conflicts between patient relationships, and that iBFE can effectively combine the merits of DNA methylation, mRNA expression and microRNA (miRNA) expression datasets to accurately identify disease subtypes of significantly different prognosis. CONCLUSIONS: iBFE is an effective and efficient method for integrative analysis of heterogeneous genomic data to accurately identify disease subtypes. The Matlab code of iBFE is freely available from http://zhangroup.aporc.org/iBFE. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12920-015-0154-5) contains supplementary material, which is available to authorized users. BioMed Central 2015-11-20 /pmc/articles/PMC4653838/ /pubmed/26589589 http://dx.doi.org/10.1186/s12920-015-0154-5 Text en © Ren et al. 2015 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated. |
spellingShingle | Technical Advance Ren, Xianwen Fu, Hua Jin, Qi Integrating heterogeneous genomic data to accurately identify disease subtypes |
title | Integrating heterogeneous genomic data to accurately identify disease subtypes |
title_full | Integrating heterogeneous genomic data to accurately identify disease subtypes |
title_fullStr | Integrating heterogeneous genomic data to accurately identify disease subtypes |
title_full_unstemmed | Integrating heterogeneous genomic data to accurately identify disease subtypes |
title_short | Integrating heterogeneous genomic data to accurately identify disease subtypes |
title_sort | integrating heterogeneous genomic data to accurately identify disease subtypes |
topic | Technical Advance |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4653838/ https://www.ncbi.nlm.nih.gov/pubmed/26589589 http://dx.doi.org/10.1186/s12920-015-0154-5 |
work_keys_str_mv | AT renxianwen integratingheterogeneousgenomicdatatoaccuratelyidentifydiseasesubtypes AT fuhua integratingheterogeneousgenomicdatatoaccuratelyidentifydiseasesubtypes AT jinqi integratingheterogeneousgenomicdatatoaccuratelyidentifydiseasesubtypes |