Cargando…

Correlation-centred variable selection of a gene expression signature to predict breast cancer metastasis

Predictions of distant cancer metastasis based on gene signatures are studied intensively to realise precise diagnosis and treatments. Gene selection i.e. feature selection is a cornerstone to both establish accurate predictions and understand underlying pathologies. Here, we developed a simple but...

Descripción completa

Detalles Bibliográficos
Autores principales: Hikichi, Shiori, Sugimoto, Masahiro, Tomita, Masaru
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Nature Publishing Group UK 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7220948/
https://www.ncbi.nlm.nih.gov/pubmed/32404965
http://dx.doi.org/10.1038/s41598-020-64870-z
_version_ 1783533268291813376
author Hikichi, Shiori
Sugimoto, Masahiro
Tomita, Masaru
author_facet Hikichi, Shiori
Sugimoto, Masahiro
Tomita, Masaru
author_sort Hikichi, Shiori
collection PubMed
description Predictions of distant cancer metastasis based on gene signatures are studied intensively to realise precise diagnosis and treatments. Gene selection i.e. feature selection is a cornerstone to both establish accurate predictions and understand underlying pathologies. Here, we developed a simple but robust feature selection method using a correlation-centred approach to select minimal gene sets that have both high predictive and generalisation abilities. A multiple logistic regression model was used to predict 5-year metastases of patients with breast cancer. Gene expression data obtained from tumour samples of lymph node-negative breast cancer patients were randomly split into training and validation data. Our method selected 12 genes using training data and this showed a higher area under the receiver operating characteristic curve of 0.730 compared with 0.579 yielded by previously reported 76 genes. The signature with the predictive model was validated in an independent dataset, and its higher generalization ability was observed. Gene ontology analyses revealed that our method consistently selected genes with identical functions which frequently selected by the 76 genes. Taken together, our method identifies fewer gene sets bearing high predictive abilities, which would be versatile and applicable to predict other factors such as the outcomes of medical treatments and prognoses of other cancer types.
format Online
Article
Text
id pubmed-7220948
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher Nature Publishing Group UK
record_format MEDLINE/PubMed
spelling pubmed-72209482020-05-20 Correlation-centred variable selection of a gene expression signature to predict breast cancer metastasis Hikichi, Shiori Sugimoto, Masahiro Tomita, Masaru Sci Rep Article Predictions of distant cancer metastasis based on gene signatures are studied intensively to realise precise diagnosis and treatments. Gene selection i.e. feature selection is a cornerstone to both establish accurate predictions and understand underlying pathologies. Here, we developed a simple but robust feature selection method using a correlation-centred approach to select minimal gene sets that have both high predictive and generalisation abilities. A multiple logistic regression model was used to predict 5-year metastases of patients with breast cancer. Gene expression data obtained from tumour samples of lymph node-negative breast cancer patients were randomly split into training and validation data. Our method selected 12 genes using training data and this showed a higher area under the receiver operating characteristic curve of 0.730 compared with 0.579 yielded by previously reported 76 genes. The signature with the predictive model was validated in an independent dataset, and its higher generalization ability was observed. Gene ontology analyses revealed that our method consistently selected genes with identical functions which frequently selected by the 76 genes. Taken together, our method identifies fewer gene sets bearing high predictive abilities, which would be versatile and applicable to predict other factors such as the outcomes of medical treatments and prognoses of other cancer types. Nature Publishing Group UK 2020-05-13 /pmc/articles/PMC7220948/ /pubmed/32404965 http://dx.doi.org/10.1038/s41598-020-64870-z Text en © The Author(s) 2020 Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.
spellingShingle Article
Hikichi, Shiori
Sugimoto, Masahiro
Tomita, Masaru
Correlation-centred variable selection of a gene expression signature to predict breast cancer metastasis
title Correlation-centred variable selection of a gene expression signature to predict breast cancer metastasis
title_full Correlation-centred variable selection of a gene expression signature to predict breast cancer metastasis
title_fullStr Correlation-centred variable selection of a gene expression signature to predict breast cancer metastasis
title_full_unstemmed Correlation-centred variable selection of a gene expression signature to predict breast cancer metastasis
title_short Correlation-centred variable selection of a gene expression signature to predict breast cancer metastasis
title_sort correlation-centred variable selection of a gene expression signature to predict breast cancer metastasis
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7220948/
https://www.ncbi.nlm.nih.gov/pubmed/32404965
http://dx.doi.org/10.1038/s41598-020-64870-z
work_keys_str_mv AT hikichishiori correlationcentredvariableselectionofageneexpressionsignaturetopredictbreastcancermetastasis
AT sugimotomasahiro correlationcentredvariableselectionofageneexpressionsignaturetopredictbreastcancermetastasis
AT tomitamasaru correlationcentredvariableselectionofageneexpressionsignaturetopredictbreastcancermetastasis