Cargando…

Information content and analysis methods for Multi-Modal High-Throughput Biomedical Data

The spectrum of modern molecular high-throughput assaying includes diverse technologies such as microarray gene expression, miRNA expression, proteomics, DNA methylation, among many others. Now that these technologies have matured and become increasingly accessible, the next frontier is to collect “...

Descripción completa

Detalles Bibliográficos
Autores principales: Ray, Bisakha, Henaff, Mikael, Ma, Sisi, Efstathiadis, Efstratios, Peskin, Eric R., Picone, Marco, Poli, Tito, Aliferis, Constantin F., Statnikov, Alexander
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Nature Publishing Group 2014
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3961740/
https://www.ncbi.nlm.nih.gov/pubmed/24651673
http://dx.doi.org/10.1038/srep04411
_version_ 1782308338570100736
author Ray, Bisakha
Henaff, Mikael
Ma, Sisi
Efstathiadis, Efstratios
Peskin, Eric R.
Picone, Marco
Poli, Tito
Aliferis, Constantin F.
Statnikov, Alexander
author_facet Ray, Bisakha
Henaff, Mikael
Ma, Sisi
Efstathiadis, Efstratios
Peskin, Eric R.
Picone, Marco
Poli, Tito
Aliferis, Constantin F.
Statnikov, Alexander
author_sort Ray, Bisakha
collection PubMed
description The spectrum of modern molecular high-throughput assaying includes diverse technologies such as microarray gene expression, miRNA expression, proteomics, DNA methylation, among many others. Now that these technologies have matured and become increasingly accessible, the next frontier is to collect “multi-modal” data for the same set of subjects and conduct integrative, multi-level analyses. While multi-modal data does contain distinct biological information that can be useful for answering complex biology questions, its value for predicting clinical phenotypes and contributions of each type of input remain unknown. We obtained 47 datasets/predictive tasks that in total span over 9 data modalities and executed analytic experiments for predicting various clinical phenotypes and outcomes. First, we analyzed each modality separately using uni-modal approaches based on several state-of-the-art supervised classification and feature selection methods. Then, we applied integrative multi-modal classification techniques. We have found that gene expression is the most predictively informative modality. Other modalities such as protein expression, miRNA expression, and DNA methylation also provide highly predictive results, which are often statistically comparable but not superior to gene expression data. Integrative multi-modal analyses generally do not increase predictive signal compared to gene expression data.
format Online
Article
Text
id pubmed-3961740
institution National Center for Biotechnology Information
language English
publishDate 2014
publisher Nature Publishing Group
record_format MEDLINE/PubMed
spelling pubmed-39617402014-03-21 Information content and analysis methods for Multi-Modal High-Throughput Biomedical Data Ray, Bisakha Henaff, Mikael Ma, Sisi Efstathiadis, Efstratios Peskin, Eric R. Picone, Marco Poli, Tito Aliferis, Constantin F. Statnikov, Alexander Sci Rep Article The spectrum of modern molecular high-throughput assaying includes diverse technologies such as microarray gene expression, miRNA expression, proteomics, DNA methylation, among many others. Now that these technologies have matured and become increasingly accessible, the next frontier is to collect “multi-modal” data for the same set of subjects and conduct integrative, multi-level analyses. While multi-modal data does contain distinct biological information that can be useful for answering complex biology questions, its value for predicting clinical phenotypes and contributions of each type of input remain unknown. We obtained 47 datasets/predictive tasks that in total span over 9 data modalities and executed analytic experiments for predicting various clinical phenotypes and outcomes. First, we analyzed each modality separately using uni-modal approaches based on several state-of-the-art supervised classification and feature selection methods. Then, we applied integrative multi-modal classification techniques. We have found that gene expression is the most predictively informative modality. Other modalities such as protein expression, miRNA expression, and DNA methylation also provide highly predictive results, which are often statistically comparable but not superior to gene expression data. Integrative multi-modal analyses generally do not increase predictive signal compared to gene expression data. Nature Publishing Group 2014-03-21 /pmc/articles/PMC3961740/ /pubmed/24651673 http://dx.doi.org/10.1038/srep04411 Text en Copyright © 2014, Macmillan Publishers Limited. All rights reserved http://creativecommons.org/licenses/by-nc-nd/3.0/ This work is licensed under a Creative Commons Attribution-NonCommercial-NoDerivs 3.0 Unported License. To view a copy of this license, visit http://creativecommons.org/licenses/by-nc-nd/3.0/
spellingShingle Article
Ray, Bisakha
Henaff, Mikael
Ma, Sisi
Efstathiadis, Efstratios
Peskin, Eric R.
Picone, Marco
Poli, Tito
Aliferis, Constantin F.
Statnikov, Alexander
Information content and analysis methods for Multi-Modal High-Throughput Biomedical Data
title Information content and analysis methods for Multi-Modal High-Throughput Biomedical Data
title_full Information content and analysis methods for Multi-Modal High-Throughput Biomedical Data
title_fullStr Information content and analysis methods for Multi-Modal High-Throughput Biomedical Data
title_full_unstemmed Information content and analysis methods for Multi-Modal High-Throughput Biomedical Data
title_short Information content and analysis methods for Multi-Modal High-Throughput Biomedical Data
title_sort information content and analysis methods for multi-modal high-throughput biomedical data
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3961740/
https://www.ncbi.nlm.nih.gov/pubmed/24651673
http://dx.doi.org/10.1038/srep04411
work_keys_str_mv AT raybisakha informationcontentandanalysismethodsformultimodalhighthroughputbiomedicaldata
AT henaffmikael informationcontentandanalysismethodsformultimodalhighthroughputbiomedicaldata
AT masisi informationcontentandanalysismethodsformultimodalhighthroughputbiomedicaldata
AT efstathiadisefstratios informationcontentandanalysismethodsformultimodalhighthroughputbiomedicaldata
AT peskinericr informationcontentandanalysismethodsformultimodalhighthroughputbiomedicaldata
AT piconemarco informationcontentandanalysismethodsformultimodalhighthroughputbiomedicaldata
AT politito informationcontentandanalysismethodsformultimodalhighthroughputbiomedicaldata
AT aliferisconstantinf informationcontentandanalysismethodsformultimodalhighthroughputbiomedicaldata
AT statnikovalexander informationcontentandanalysismethodsformultimodalhighthroughputbiomedicaldata