Cargando…

Integrating heterogeneous genomic data to accurately identify disease subtypes

BACKGROUND: High-throughput biotechnologies have been widely used to characterize clinical samples from various perspectives e.g., epigenomics, genomics and transcriptomics. However, because of the heterogeneity of these technologies and their outputs, individual analysis of the various types of dat...

Descripción completa

Detalles Bibliográficos
Autores principales: Ren, Xianwen, Fu, Hua, Jin, Qi
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2015
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4653838/
https://www.ncbi.nlm.nih.gov/pubmed/26589589
http://dx.doi.org/10.1186/s12920-015-0154-5
_version_ 1782401982938480640
author Ren, Xianwen
Fu, Hua
Jin, Qi
author_facet Ren, Xianwen
Fu, Hua
Jin, Qi
author_sort Ren, Xianwen
collection PubMed
description BACKGROUND: High-throughput biotechnologies have been widely used to characterize clinical samples from various perspectives e.g., epigenomics, genomics and transcriptomics. However, because of the heterogeneity of these technologies and their outputs, individual analysis of the various types of data is hard to create a comprehensive view of disease subtypes. Integrative methods are of pressing need. METHODS: In this study, we evaluated the possible issues that hamper integrative analysis of the heterogeneous disease data types, and proposed iBFE, an effective and efficient computational method to subvert those issues from a feature extraction perspective. RESULTS: Strict experiments on both simulated and real datasets demonstrated that iBFE can easily overcome issues caused by scale conflicts, noise conflicts, incompleteness of patient relationships, and conflicts between patient relationships, and that iBFE can effectively combine the merits of DNA methylation, mRNA expression and microRNA (miRNA) expression datasets to accurately identify disease subtypes of significantly different prognosis. CONCLUSIONS: iBFE is an effective and efficient method for integrative analysis of heterogeneous genomic data to accurately identify disease subtypes. The Matlab code of iBFE is freely available from http://zhangroup.aporc.org/iBFE. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12920-015-0154-5) contains supplementary material, which is available to authorized users.
format Online
Article
Text
id pubmed-4653838
institution National Center for Biotechnology Information
language English
publishDate 2015
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-46538382015-11-21 Integrating heterogeneous genomic data to accurately identify disease subtypes Ren, Xianwen Fu, Hua Jin, Qi BMC Med Genomics Technical Advance BACKGROUND: High-throughput biotechnologies have been widely used to characterize clinical samples from various perspectives e.g., epigenomics, genomics and transcriptomics. However, because of the heterogeneity of these technologies and their outputs, individual analysis of the various types of data is hard to create a comprehensive view of disease subtypes. Integrative methods are of pressing need. METHODS: In this study, we evaluated the possible issues that hamper integrative analysis of the heterogeneous disease data types, and proposed iBFE, an effective and efficient computational method to subvert those issues from a feature extraction perspective. RESULTS: Strict experiments on both simulated and real datasets demonstrated that iBFE can easily overcome issues caused by scale conflicts, noise conflicts, incompleteness of patient relationships, and conflicts between patient relationships, and that iBFE can effectively combine the merits of DNA methylation, mRNA expression and microRNA (miRNA) expression datasets to accurately identify disease subtypes of significantly different prognosis. CONCLUSIONS: iBFE is an effective and efficient method for integrative analysis of heterogeneous genomic data to accurately identify disease subtypes. The Matlab code of iBFE is freely available from http://zhangroup.aporc.org/iBFE. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12920-015-0154-5) contains supplementary material, which is available to authorized users. BioMed Central 2015-11-20 /pmc/articles/PMC4653838/ /pubmed/26589589 http://dx.doi.org/10.1186/s12920-015-0154-5 Text en © Ren et al. 2015 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Technical Advance
Ren, Xianwen
Fu, Hua
Jin, Qi
Integrating heterogeneous genomic data to accurately identify disease subtypes
title Integrating heterogeneous genomic data to accurately identify disease subtypes
title_full Integrating heterogeneous genomic data to accurately identify disease subtypes
title_fullStr Integrating heterogeneous genomic data to accurately identify disease subtypes
title_full_unstemmed Integrating heterogeneous genomic data to accurately identify disease subtypes
title_short Integrating heterogeneous genomic data to accurately identify disease subtypes
title_sort integrating heterogeneous genomic data to accurately identify disease subtypes
topic Technical Advance
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4653838/
https://www.ncbi.nlm.nih.gov/pubmed/26589589
http://dx.doi.org/10.1186/s12920-015-0154-5
work_keys_str_mv AT renxianwen integratingheterogeneousgenomicdatatoaccuratelyidentifydiseasesubtypes
AT fuhua integratingheterogeneousgenomicdatatoaccuratelyidentifydiseasesubtypes
AT jinqi integratingheterogeneousgenomicdatatoaccuratelyidentifydiseasesubtypes