Cargando…
Correlation-centred variable selection of a gene expression signature to predict breast cancer metastasis
Predictions of distant cancer metastasis based on gene signatures are studied intensively to realise precise diagnosis and treatments. Gene selection i.e. feature selection is a cornerstone to both establish accurate predictions and understand underlying pathologies. Here, we developed a simple but...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Nature Publishing Group UK
2020
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7220948/ https://www.ncbi.nlm.nih.gov/pubmed/32404965 http://dx.doi.org/10.1038/s41598-020-64870-z |
_version_ | 1783533268291813376 |
---|---|
author | Hikichi, Shiori Sugimoto, Masahiro Tomita, Masaru |
author_facet | Hikichi, Shiori Sugimoto, Masahiro Tomita, Masaru |
author_sort | Hikichi, Shiori |
collection | PubMed |
description | Predictions of distant cancer metastasis based on gene signatures are studied intensively to realise precise diagnosis and treatments. Gene selection i.e. feature selection is a cornerstone to both establish accurate predictions and understand underlying pathologies. Here, we developed a simple but robust feature selection method using a correlation-centred approach to select minimal gene sets that have both high predictive and generalisation abilities. A multiple logistic regression model was used to predict 5-year metastases of patients with breast cancer. Gene expression data obtained from tumour samples of lymph node-negative breast cancer patients were randomly split into training and validation data. Our method selected 12 genes using training data and this showed a higher area under the receiver operating characteristic curve of 0.730 compared with 0.579 yielded by previously reported 76 genes. The signature with the predictive model was validated in an independent dataset, and its higher generalization ability was observed. Gene ontology analyses revealed that our method consistently selected genes with identical functions which frequently selected by the 76 genes. Taken together, our method identifies fewer gene sets bearing high predictive abilities, which would be versatile and applicable to predict other factors such as the outcomes of medical treatments and prognoses of other cancer types. |
format | Online Article Text |
id | pubmed-7220948 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2020 |
publisher | Nature Publishing Group UK |
record_format | MEDLINE/PubMed |
spelling | pubmed-72209482020-05-20 Correlation-centred variable selection of a gene expression signature to predict breast cancer metastasis Hikichi, Shiori Sugimoto, Masahiro Tomita, Masaru Sci Rep Article Predictions of distant cancer metastasis based on gene signatures are studied intensively to realise precise diagnosis and treatments. Gene selection i.e. feature selection is a cornerstone to both establish accurate predictions and understand underlying pathologies. Here, we developed a simple but robust feature selection method using a correlation-centred approach to select minimal gene sets that have both high predictive and generalisation abilities. A multiple logistic regression model was used to predict 5-year metastases of patients with breast cancer. Gene expression data obtained from tumour samples of lymph node-negative breast cancer patients were randomly split into training and validation data. Our method selected 12 genes using training data and this showed a higher area under the receiver operating characteristic curve of 0.730 compared with 0.579 yielded by previously reported 76 genes. The signature with the predictive model was validated in an independent dataset, and its higher generalization ability was observed. Gene ontology analyses revealed that our method consistently selected genes with identical functions which frequently selected by the 76 genes. Taken together, our method identifies fewer gene sets bearing high predictive abilities, which would be versatile and applicable to predict other factors such as the outcomes of medical treatments and prognoses of other cancer types. Nature Publishing Group UK 2020-05-13 /pmc/articles/PMC7220948/ /pubmed/32404965 http://dx.doi.org/10.1038/s41598-020-64870-z Text en © The Author(s) 2020 Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/. |
spellingShingle | Article Hikichi, Shiori Sugimoto, Masahiro Tomita, Masaru Correlation-centred variable selection of a gene expression signature to predict breast cancer metastasis |
title | Correlation-centred variable selection of a gene expression signature to predict breast cancer metastasis |
title_full | Correlation-centred variable selection of a gene expression signature to predict breast cancer metastasis |
title_fullStr | Correlation-centred variable selection of a gene expression signature to predict breast cancer metastasis |
title_full_unstemmed | Correlation-centred variable selection of a gene expression signature to predict breast cancer metastasis |
title_short | Correlation-centred variable selection of a gene expression signature to predict breast cancer metastasis |
title_sort | correlation-centred variable selection of a gene expression signature to predict breast cancer metastasis |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7220948/ https://www.ncbi.nlm.nih.gov/pubmed/32404965 http://dx.doi.org/10.1038/s41598-020-64870-z |
work_keys_str_mv | AT hikichishiori correlationcentredvariableselectionofageneexpressionsignaturetopredictbreastcancermetastasis AT sugimotomasahiro correlationcentredvariableselectionofageneexpressionsignaturetopredictbreastcancermetastasis AT tomitamasaru correlationcentredvariableselectionofageneexpressionsignaturetopredictbreastcancermetastasis |