Cargando…

Min-redundancy and max-relevance multi-view feature selection for predicting ovarian cancer survival using multi-omics data

BACKGROUND: Large-scale collaborative precision medicine initiatives (e.g., The Cancer Genome Atlas (TCGA)) are yielding rich multi-omics data. Integrative analyses of the resulting multi-omics data, such as somatic mutation, copy number alteration (CNA), DNA methylation, miRNA, gene expression, and...

Descripción completa

Detalles Bibliográficos
Autores principales: EL-Manzalawy, Yasser, Hsieh, Tsung-Yu, Shivakumar, Manu, Kim, Dokyoon, Honavar, Vasant
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2018
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6157248/
https://www.ncbi.nlm.nih.gov/pubmed/30255801
http://dx.doi.org/10.1186/s12920-018-0388-0
_version_ 1783358243772301312
author EL-Manzalawy, Yasser
Hsieh, Tsung-Yu
Shivakumar, Manu
Kim, Dokyoon
Honavar, Vasant
author_facet EL-Manzalawy, Yasser
Hsieh, Tsung-Yu
Shivakumar, Manu
Kim, Dokyoon
Honavar, Vasant
author_sort EL-Manzalawy, Yasser
collection PubMed
description BACKGROUND: Large-scale collaborative precision medicine initiatives (e.g., The Cancer Genome Atlas (TCGA)) are yielding rich multi-omics data. Integrative analyses of the resulting multi-omics data, such as somatic mutation, copy number alteration (CNA), DNA methylation, miRNA, gene expression, and protein expression, offer tantalizing possibilities for realizing the promise and potential of precision medicine in cancer prevention, diagnosis, and treatment by substantially improving our understanding of underlying mechanisms as well as the discovery of novel biomarkers for different types of cancers. However, such analyses present a number of challenges, including heterogeneity, and high-dimensionality of omics data. METHODS: We propose a novel framework for multi-omics data integration using multi-view feature selection. We introduce a novel multi-view feature selection algorithm, MRMR-mv, an adaptation of the well-known Min-Redundancy and Maximum-Relevance (MRMR) single-view feature selection algorithm to the multi-view setting. RESULTS: We report results of experiments using an ovarian cancer multi-omics dataset derived from the TCGA database on the task of predicting ovarian cancer survival. Our results suggest that multi-view models outperform both view-specific models (i.e., models trained and tested using a single type of omics data) and models based on two baseline data fusion methods. CONCLUSIONS: Our results demonstrate the potential of multi-view feature selection in integrative analyses and predictive modeling from multi-omics data. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (10.1186/s12920-018-0388-0) contains supplementary material, which is available to authorized users.
format Online
Article
Text
id pubmed-6157248
institution National Center for Biotechnology Information
language English
publishDate 2018
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-61572482018-10-01 Min-redundancy and max-relevance multi-view feature selection for predicting ovarian cancer survival using multi-omics data EL-Manzalawy, Yasser Hsieh, Tsung-Yu Shivakumar, Manu Kim, Dokyoon Honavar, Vasant BMC Med Genomics Research BACKGROUND: Large-scale collaborative precision medicine initiatives (e.g., The Cancer Genome Atlas (TCGA)) are yielding rich multi-omics data. Integrative analyses of the resulting multi-omics data, such as somatic mutation, copy number alteration (CNA), DNA methylation, miRNA, gene expression, and protein expression, offer tantalizing possibilities for realizing the promise and potential of precision medicine in cancer prevention, diagnosis, and treatment by substantially improving our understanding of underlying mechanisms as well as the discovery of novel biomarkers for different types of cancers. However, such analyses present a number of challenges, including heterogeneity, and high-dimensionality of omics data. METHODS: We propose a novel framework for multi-omics data integration using multi-view feature selection. We introduce a novel multi-view feature selection algorithm, MRMR-mv, an adaptation of the well-known Min-Redundancy and Maximum-Relevance (MRMR) single-view feature selection algorithm to the multi-view setting. RESULTS: We report results of experiments using an ovarian cancer multi-omics dataset derived from the TCGA database on the task of predicting ovarian cancer survival. Our results suggest that multi-view models outperform both view-specific models (i.e., models trained and tested using a single type of omics data) and models based on two baseline data fusion methods. CONCLUSIONS: Our results demonstrate the potential of multi-view feature selection in integrative analyses and predictive modeling from multi-omics data. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (10.1186/s12920-018-0388-0) contains supplementary material, which is available to authorized users. BioMed Central 2018-09-14 /pmc/articles/PMC6157248/ /pubmed/30255801 http://dx.doi.org/10.1186/s12920-018-0388-0 Text en © The Author(s). 2018 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Research
EL-Manzalawy, Yasser
Hsieh, Tsung-Yu
Shivakumar, Manu
Kim, Dokyoon
Honavar, Vasant
Min-redundancy and max-relevance multi-view feature selection for predicting ovarian cancer survival using multi-omics data
title Min-redundancy and max-relevance multi-view feature selection for predicting ovarian cancer survival using multi-omics data
title_full Min-redundancy and max-relevance multi-view feature selection for predicting ovarian cancer survival using multi-omics data
title_fullStr Min-redundancy and max-relevance multi-view feature selection for predicting ovarian cancer survival using multi-omics data
title_full_unstemmed Min-redundancy and max-relevance multi-view feature selection for predicting ovarian cancer survival using multi-omics data
title_short Min-redundancy and max-relevance multi-view feature selection for predicting ovarian cancer survival using multi-omics data
title_sort min-redundancy and max-relevance multi-view feature selection for predicting ovarian cancer survival using multi-omics data
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6157248/
https://www.ncbi.nlm.nih.gov/pubmed/30255801
http://dx.doi.org/10.1186/s12920-018-0388-0
work_keys_str_mv AT elmanzalawyyasser minredundancyandmaxrelevancemultiviewfeatureselectionforpredictingovariancancersurvivalusingmultiomicsdata
AT hsiehtsungyu minredundancyandmaxrelevancemultiviewfeatureselectionforpredictingovariancancersurvivalusingmultiomicsdata
AT shivakumarmanu minredundancyandmaxrelevancemultiviewfeatureselectionforpredictingovariancancersurvivalusingmultiomicsdata
AT kimdokyoon minredundancyandmaxrelevancemultiviewfeatureselectionforpredictingovariancancersurvivalusingmultiomicsdata
AT honavarvasant minredundancyandmaxrelevancemultiviewfeatureselectionforpredictingovariancancersurvivalusingmultiomicsdata