Cargando…
EMT network-based feature selection improves prognosis prediction in lung adenocarcinoma
Various feature selection algorithms have been proposed to identify cancer prognostic biomarkers. In recent years, however, their reproducibility is criticized. The performance of feature selection algorithms is shown to be affected by the datasets, underlying networks and evaluation metrics. One of...
Autores principales: | , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Public Library of Science
2019
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6354965/ https://www.ncbi.nlm.nih.gov/pubmed/30703089 http://dx.doi.org/10.1371/journal.pone.0204186 |
_version_ | 1783391275505942528 |
---|---|
author | Shao, Borong Bjaanæs, Maria Moksnes Helland, Åslaug Schütte, Christof Conrad, Tim |
author_facet | Shao, Borong Bjaanæs, Maria Moksnes Helland, Åslaug Schütte, Christof Conrad, Tim |
author_sort | Shao, Borong |
collection | PubMed |
description | Various feature selection algorithms have been proposed to identify cancer prognostic biomarkers. In recent years, however, their reproducibility is criticized. The performance of feature selection algorithms is shown to be affected by the datasets, underlying networks and evaluation metrics. One of the causes is the curse of dimensionality, which makes it hard to select the features that generalize well on independent data. Even the integration of biological networks does not mitigate this issue because the networks are large and many of their components are not relevant for the phenotype of interest. With the availability of multi-omics data, integrative approaches are being developed to build more robust predictive models. In this scenario, the higher data dimensions create greater challenges. We proposed a phenotype relevant network-based feature selection (PRNFS) framework and demonstrated its advantages in lung cancer prognosis prediction. We constructed cancer prognosis relevant networks based on epithelial mesenchymal transition (EMT) and integrated them with different types of omics data for feature selection. With less than 2.5% of the total dimensionality, we obtained EMT prognostic signatures that achieved remarkable prediction performance (average AUC values >0.8), very significant sample stratifications, and meaningful biological interpretations. In addition to finding EMT signatures from different omics data levels, we combined these single-omics signatures into multi-omics signatures, which improved sample stratifications significantly. Both single- and multi-omics EMT signatures were tested on independent multi-omics lung cancer datasets and significant sample stratifications were obtained. |
format | Online Article Text |
id | pubmed-6354965 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2019 |
publisher | Public Library of Science |
record_format | MEDLINE/PubMed |
spelling | pubmed-63549652019-02-15 EMT network-based feature selection improves prognosis prediction in lung adenocarcinoma Shao, Borong Bjaanæs, Maria Moksnes Helland, Åslaug Schütte, Christof Conrad, Tim PLoS One Research Article Various feature selection algorithms have been proposed to identify cancer prognostic biomarkers. In recent years, however, their reproducibility is criticized. The performance of feature selection algorithms is shown to be affected by the datasets, underlying networks and evaluation metrics. One of the causes is the curse of dimensionality, which makes it hard to select the features that generalize well on independent data. Even the integration of biological networks does not mitigate this issue because the networks are large and many of their components are not relevant for the phenotype of interest. With the availability of multi-omics data, integrative approaches are being developed to build more robust predictive models. In this scenario, the higher data dimensions create greater challenges. We proposed a phenotype relevant network-based feature selection (PRNFS) framework and demonstrated its advantages in lung cancer prognosis prediction. We constructed cancer prognosis relevant networks based on epithelial mesenchymal transition (EMT) and integrated them with different types of omics data for feature selection. With less than 2.5% of the total dimensionality, we obtained EMT prognostic signatures that achieved remarkable prediction performance (average AUC values >0.8), very significant sample stratifications, and meaningful biological interpretations. In addition to finding EMT signatures from different omics data levels, we combined these single-omics signatures into multi-omics signatures, which improved sample stratifications significantly. Both single- and multi-omics EMT signatures were tested on independent multi-omics lung cancer datasets and significant sample stratifications were obtained. Public Library of Science 2019-01-31 /pmc/articles/PMC6354965/ /pubmed/30703089 http://dx.doi.org/10.1371/journal.pone.0204186 Text en © 2019 Shao et al http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited. |
spellingShingle | Research Article Shao, Borong Bjaanæs, Maria Moksnes Helland, Åslaug Schütte, Christof Conrad, Tim EMT network-based feature selection improves prognosis prediction in lung adenocarcinoma |
title | EMT network-based feature selection improves prognosis prediction in lung adenocarcinoma |
title_full | EMT network-based feature selection improves prognosis prediction in lung adenocarcinoma |
title_fullStr | EMT network-based feature selection improves prognosis prediction in lung adenocarcinoma |
title_full_unstemmed | EMT network-based feature selection improves prognosis prediction in lung adenocarcinoma |
title_short | EMT network-based feature selection improves prognosis prediction in lung adenocarcinoma |
title_sort | emt network-based feature selection improves prognosis prediction in lung adenocarcinoma |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6354965/ https://www.ncbi.nlm.nih.gov/pubmed/30703089 http://dx.doi.org/10.1371/journal.pone.0204186 |
work_keys_str_mv | AT shaoborong emtnetworkbasedfeatureselectionimprovesprognosispredictioninlungadenocarcinoma AT bjaanæsmariamoksnes emtnetworkbasedfeatureselectionimprovesprognosispredictioninlungadenocarcinoma AT hellandaslaug emtnetworkbasedfeatureselectionimprovesprognosispredictioninlungadenocarcinoma AT schuttechristof emtnetworkbasedfeatureselectionimprovesprognosispredictioninlungadenocarcinoma AT conradtim emtnetworkbasedfeatureselectionimprovesprognosispredictioninlungadenocarcinoma |