Cargando…
Gamma distribution based predicting model for breast cancer drug response based on multi-layer feature selection
In the pursuit of precision medicine for cancer, a promising step is to predict drug response based on data mining, which can provide clinical decision support for cancer patients. Although some machine learning methods for predicting drug response from genomic data already exist, most of them focus...
Autores principales: | , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Frontiers Media S.A.
2023
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9932661/ https://www.ncbi.nlm.nih.gov/pubmed/36816042 http://dx.doi.org/10.3389/fgene.2023.1095976 |
_version_ | 1784889502890721280 |
---|---|
author | Cui, Tongtong Wang, Zeyuan Gu, Hong Qin, Pan Wang, Jia |
author_facet | Cui, Tongtong Wang, Zeyuan Gu, Hong Qin, Pan Wang, Jia |
author_sort | Cui, Tongtong |
collection | PubMed |
description | In the pursuit of precision medicine for cancer, a promising step is to predict drug response based on data mining, which can provide clinical decision support for cancer patients. Although some machine learning methods for predicting drug response from genomic data already exist, most of them focus on point prediction, which cannot reveal the distribution of predicted results. In this paper, we propose a three-layer feature selection combined with a gamma distribution based GLM and a two-layer feature selection combined with an ANN. The two regression methods are applied to the Encyclopedia of Cancer Cell Lines (CCLE) and the Cancer Drug Sensitivity Genomics (GDSC) datasets. Using ten-fold cross-validation, our methods achieve higher accuracy on anticancer drug response prediction compared to existing methods, with an R (2) and RMSE of 0.87 and 0.53, respectively. Through data validation, the significance of assessing the reliability of predictions by predicting confidence intervals and its role in personalized medicine are illustrated. The correlation analysis of the genes selected from the three layers of features also shows the effectiveness of our proposed methods. |
format | Online Article Text |
id | pubmed-9932661 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2023 |
publisher | Frontiers Media S.A. |
record_format | MEDLINE/PubMed |
spelling | pubmed-99326612023-02-17 Gamma distribution based predicting model for breast cancer drug response based on multi-layer feature selection Cui, Tongtong Wang, Zeyuan Gu, Hong Qin, Pan Wang, Jia Front Genet Genetics In the pursuit of precision medicine for cancer, a promising step is to predict drug response based on data mining, which can provide clinical decision support for cancer patients. Although some machine learning methods for predicting drug response from genomic data already exist, most of them focus on point prediction, which cannot reveal the distribution of predicted results. In this paper, we propose a three-layer feature selection combined with a gamma distribution based GLM and a two-layer feature selection combined with an ANN. The two regression methods are applied to the Encyclopedia of Cancer Cell Lines (CCLE) and the Cancer Drug Sensitivity Genomics (GDSC) datasets. Using ten-fold cross-validation, our methods achieve higher accuracy on anticancer drug response prediction compared to existing methods, with an R (2) and RMSE of 0.87 and 0.53, respectively. Through data validation, the significance of assessing the reliability of predictions by predicting confidence intervals and its role in personalized medicine are illustrated. The correlation analysis of the genes selected from the three layers of features also shows the effectiveness of our proposed methods. Frontiers Media S.A. 2023-02-02 /pmc/articles/PMC9932661/ /pubmed/36816042 http://dx.doi.org/10.3389/fgene.2023.1095976 Text en Copyright © 2023 Cui, Wang, Gu, Qin and Wang. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms. |
spellingShingle | Genetics Cui, Tongtong Wang, Zeyuan Gu, Hong Qin, Pan Wang, Jia Gamma distribution based predicting model for breast cancer drug response based on multi-layer feature selection |
title | Gamma distribution based predicting model for breast cancer drug response based on multi-layer feature selection |
title_full | Gamma distribution based predicting model for breast cancer drug response based on multi-layer feature selection |
title_fullStr | Gamma distribution based predicting model for breast cancer drug response based on multi-layer feature selection |
title_full_unstemmed | Gamma distribution based predicting model for breast cancer drug response based on multi-layer feature selection |
title_short | Gamma distribution based predicting model for breast cancer drug response based on multi-layer feature selection |
title_sort | gamma distribution based predicting model for breast cancer drug response based on multi-layer feature selection |
topic | Genetics |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9932661/ https://www.ncbi.nlm.nih.gov/pubmed/36816042 http://dx.doi.org/10.3389/fgene.2023.1095976 |
work_keys_str_mv | AT cuitongtong gammadistributionbasedpredictingmodelforbreastcancerdrugresponsebasedonmultilayerfeatureselection AT wangzeyuan gammadistributionbasedpredictingmodelforbreastcancerdrugresponsebasedonmultilayerfeatureselection AT guhong gammadistributionbasedpredictingmodelforbreastcancerdrugresponsebasedonmultilayerfeatureselection AT qinpan gammadistributionbasedpredictingmodelforbreastcancerdrugresponsebasedonmultilayerfeatureselection AT wangjia gammadistributionbasedpredictingmodelforbreastcancerdrugresponsebasedonmultilayerfeatureselection |