Cargando…

EMT network-based feature selection improves prognosis prediction in lung adenocarcinoma

Various feature selection algorithms have been proposed to identify cancer prognostic biomarkers. In recent years, however, their reproducibility is criticized. The performance of feature selection algorithms is shown to be affected by the datasets, underlying networks and evaluation metrics. One of...

Descripción completa

Detalles Bibliográficos
Autores principales: Shao, Borong, Bjaanæs, Maria Moksnes, Helland, Åslaug, Schütte, Christof, Conrad, Tim
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6354965/
https://www.ncbi.nlm.nih.gov/pubmed/30703089
http://dx.doi.org/10.1371/journal.pone.0204186
_version_ 1783391275505942528
author Shao, Borong
Bjaanæs, Maria Moksnes
Helland, Åslaug
Schütte, Christof
Conrad, Tim
author_facet Shao, Borong
Bjaanæs, Maria Moksnes
Helland, Åslaug
Schütte, Christof
Conrad, Tim
author_sort Shao, Borong
collection PubMed
description Various feature selection algorithms have been proposed to identify cancer prognostic biomarkers. In recent years, however, their reproducibility is criticized. The performance of feature selection algorithms is shown to be affected by the datasets, underlying networks and evaluation metrics. One of the causes is the curse of dimensionality, which makes it hard to select the features that generalize well on independent data. Even the integration of biological networks does not mitigate this issue because the networks are large and many of their components are not relevant for the phenotype of interest. With the availability of multi-omics data, integrative approaches are being developed to build more robust predictive models. In this scenario, the higher data dimensions create greater challenges. We proposed a phenotype relevant network-based feature selection (PRNFS) framework and demonstrated its advantages in lung cancer prognosis prediction. We constructed cancer prognosis relevant networks based on epithelial mesenchymal transition (EMT) and integrated them with different types of omics data for feature selection. With less than 2.5% of the total dimensionality, we obtained EMT prognostic signatures that achieved remarkable prediction performance (average AUC values >0.8), very significant sample stratifications, and meaningful biological interpretations. In addition to finding EMT signatures from different omics data levels, we combined these single-omics signatures into multi-omics signatures, which improved sample stratifications significantly. Both single- and multi-omics EMT signatures were tested on independent multi-omics lung cancer datasets and significant sample stratifications were obtained.
format Online
Article
Text
id pubmed-6354965
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-63549652019-02-15 EMT network-based feature selection improves prognosis prediction in lung adenocarcinoma Shao, Borong Bjaanæs, Maria Moksnes Helland, Åslaug Schütte, Christof Conrad, Tim PLoS One Research Article Various feature selection algorithms have been proposed to identify cancer prognostic biomarkers. In recent years, however, their reproducibility is criticized. The performance of feature selection algorithms is shown to be affected by the datasets, underlying networks and evaluation metrics. One of the causes is the curse of dimensionality, which makes it hard to select the features that generalize well on independent data. Even the integration of biological networks does not mitigate this issue because the networks are large and many of their components are not relevant for the phenotype of interest. With the availability of multi-omics data, integrative approaches are being developed to build more robust predictive models. In this scenario, the higher data dimensions create greater challenges. We proposed a phenotype relevant network-based feature selection (PRNFS) framework and demonstrated its advantages in lung cancer prognosis prediction. We constructed cancer prognosis relevant networks based on epithelial mesenchymal transition (EMT) and integrated them with different types of omics data for feature selection. With less than 2.5% of the total dimensionality, we obtained EMT prognostic signatures that achieved remarkable prediction performance (average AUC values >0.8), very significant sample stratifications, and meaningful biological interpretations. In addition to finding EMT signatures from different omics data levels, we combined these single-omics signatures into multi-omics signatures, which improved sample stratifications significantly. Both single- and multi-omics EMT signatures were tested on independent multi-omics lung cancer datasets and significant sample stratifications were obtained. Public Library of Science 2019-01-31 /pmc/articles/PMC6354965/ /pubmed/30703089 http://dx.doi.org/10.1371/journal.pone.0204186 Text en © 2019 Shao et al http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle Research Article
Shao, Borong
Bjaanæs, Maria Moksnes
Helland, Åslaug
Schütte, Christof
Conrad, Tim
EMT network-based feature selection improves prognosis prediction in lung adenocarcinoma
title EMT network-based feature selection improves prognosis prediction in lung adenocarcinoma
title_full EMT network-based feature selection improves prognosis prediction in lung adenocarcinoma
title_fullStr EMT network-based feature selection improves prognosis prediction in lung adenocarcinoma
title_full_unstemmed EMT network-based feature selection improves prognosis prediction in lung adenocarcinoma
title_short EMT network-based feature selection improves prognosis prediction in lung adenocarcinoma
title_sort emt network-based feature selection improves prognosis prediction in lung adenocarcinoma
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6354965/
https://www.ncbi.nlm.nih.gov/pubmed/30703089
http://dx.doi.org/10.1371/journal.pone.0204186
work_keys_str_mv AT shaoborong emtnetworkbasedfeatureselectionimprovesprognosispredictioninlungadenocarcinoma
AT bjaanæsmariamoksnes emtnetworkbasedfeatureselectionimprovesprognosispredictioninlungadenocarcinoma
AT hellandaslaug emtnetworkbasedfeatureselectionimprovesprognosispredictioninlungadenocarcinoma
AT schuttechristof emtnetworkbasedfeatureselectionimprovesprognosispredictioninlungadenocarcinoma
AT conradtim emtnetworkbasedfeatureselectionimprovesprognosispredictioninlungadenocarcinoma