Cargando…

Classification models for Invasive Ductal Carcinoma Progression, based on gene expression data-trained supervised machine learning

Early detection of breast cancer and its correct stage determination are important for prognosis and rendering appropriate personalized clinical treatment to breast cancer patients. However, despite considerable efforts and progress, there is a need to identify the specific genomic factors responsib...

Descripción completa

Detalles Bibliográficos
Autores principales: Roy, Shikha, Kumar, Rakesh, Mittal, Vaibhav, Gupta, Dinesh
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Nature Publishing Group UK 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7057992/
https://www.ncbi.nlm.nih.gov/pubmed/32139710
http://dx.doi.org/10.1038/s41598-020-60740-w
_version_ 1783503778707668992
author Roy, Shikha
Kumar, Rakesh
Mittal, Vaibhav
Gupta, Dinesh
author_facet Roy, Shikha
Kumar, Rakesh
Mittal, Vaibhav
Gupta, Dinesh
author_sort Roy, Shikha
collection PubMed
description Early detection of breast cancer and its correct stage determination are important for prognosis and rendering appropriate personalized clinical treatment to breast cancer patients. However, despite considerable efforts and progress, there is a need to identify the specific genomic factors responsible for, or accompanying Invasive Ductal Carcinoma (IDC) progression stages, which can aid the determination of the correct cancer stages. We have developed two-class machine-learning classification models to differentiate the early and late stages of IDC. The prediction models are trained with RNA-seq gene expression profiles representing different IDC stages of 610 patients, obtained from The Cancer Genome Atlas (TCGA). Different supervised learning algorithms were trained and evaluated with an enriched model learning, facilitated by different feature selection methods. We also developed a machine-learning classifier trained on the same datasets with training sets reduced data corresponding to IDC driver genes. Based on these two classifiers, we have developed a web-server Duct-BRCA-CSP to predict early stage from late stages of IDC based on input RNA-seq gene expression profiles. The analysis conducted by us also enables deeper insights into the stage-dependent molecular events accompanying IDC progression. The server is publicly available at http://bioinfo.icgeb.res.in/duct-BRCA-CSP.
format Online
Article
Text
id pubmed-7057992
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher Nature Publishing Group UK
record_format MEDLINE/PubMed
spelling pubmed-70579922020-03-12 Classification models for Invasive Ductal Carcinoma Progression, based on gene expression data-trained supervised machine learning Roy, Shikha Kumar, Rakesh Mittal, Vaibhav Gupta, Dinesh Sci Rep Article Early detection of breast cancer and its correct stage determination are important for prognosis and rendering appropriate personalized clinical treatment to breast cancer patients. However, despite considerable efforts and progress, there is a need to identify the specific genomic factors responsible for, or accompanying Invasive Ductal Carcinoma (IDC) progression stages, which can aid the determination of the correct cancer stages. We have developed two-class machine-learning classification models to differentiate the early and late stages of IDC. The prediction models are trained with RNA-seq gene expression profiles representing different IDC stages of 610 patients, obtained from The Cancer Genome Atlas (TCGA). Different supervised learning algorithms were trained and evaluated with an enriched model learning, facilitated by different feature selection methods. We also developed a machine-learning classifier trained on the same datasets with training sets reduced data corresponding to IDC driver genes. Based on these two classifiers, we have developed a web-server Duct-BRCA-CSP to predict early stage from late stages of IDC based on input RNA-seq gene expression profiles. The analysis conducted by us also enables deeper insights into the stage-dependent molecular events accompanying IDC progression. The server is publicly available at http://bioinfo.icgeb.res.in/duct-BRCA-CSP. Nature Publishing Group UK 2020-03-05 /pmc/articles/PMC7057992/ /pubmed/32139710 http://dx.doi.org/10.1038/s41598-020-60740-w Text en © The Author(s) 2020 Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.
spellingShingle Article
Roy, Shikha
Kumar, Rakesh
Mittal, Vaibhav
Gupta, Dinesh
Classification models for Invasive Ductal Carcinoma Progression, based on gene expression data-trained supervised machine learning
title Classification models for Invasive Ductal Carcinoma Progression, based on gene expression data-trained supervised machine learning
title_full Classification models for Invasive Ductal Carcinoma Progression, based on gene expression data-trained supervised machine learning
title_fullStr Classification models for Invasive Ductal Carcinoma Progression, based on gene expression data-trained supervised machine learning
title_full_unstemmed Classification models for Invasive Ductal Carcinoma Progression, based on gene expression data-trained supervised machine learning
title_short Classification models for Invasive Ductal Carcinoma Progression, based on gene expression data-trained supervised machine learning
title_sort classification models for invasive ductal carcinoma progression, based on gene expression data-trained supervised machine learning
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7057992/
https://www.ncbi.nlm.nih.gov/pubmed/32139710
http://dx.doi.org/10.1038/s41598-020-60740-w
work_keys_str_mv AT royshikha classificationmodelsforinvasiveductalcarcinomaprogressionbasedongeneexpressiondatatrainedsupervisedmachinelearning
AT kumarrakesh classificationmodelsforinvasiveductalcarcinomaprogressionbasedongeneexpressiondatatrainedsupervisedmachinelearning
AT mittalvaibhav classificationmodelsforinvasiveductalcarcinomaprogressionbasedongeneexpressiondatatrainedsupervisedmachinelearning
AT guptadinesh classificationmodelsforinvasiveductalcarcinomaprogressionbasedongeneexpressiondatatrainedsupervisedmachinelearning