Cargando…
Machine learning based anti-cancer drug response prediction and search for predictor genes using cancer cell line gene expression
Although many models have been proposed to accurately predict the response of drugs in cell lines recent years, understanding the genome related to drug response is also the key for completing oncology precision medicine. In this paper, based on the cancer cell line gene expression and the drug resp...
Autores principales: | , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Korea Genome Organization
2021
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8042299/ https://www.ncbi.nlm.nih.gov/pubmed/33840174 http://dx.doi.org/10.5808/gi.20076 |
_version_ | 1783678097270243328 |
---|---|
author | Qiu, Kexin Lee, JoongHo Kim, HanByeol Yoon, Seokhyun Kang, Keunsoo |
author_facet | Qiu, Kexin Lee, JoongHo Kim, HanByeol Yoon, Seokhyun Kang, Keunsoo |
author_sort | Qiu, Kexin |
collection | PubMed |
description | Although many models have been proposed to accurately predict the response of drugs in cell lines recent years, understanding the genome related to drug response is also the key for completing oncology precision medicine. In this paper, based on the cancer cell line gene expression and the drug response data, we established a reliable and accurate drug response prediction model and found predictor genes for some drugs of interest. To this end, we first performed pre-selection of genes based on the Pearson correlation coefficient and then used ElasticNet regression model for drug response prediction and fine gene selection. To find more reliable set of predictor genes, we performed regression twice for each drug, one with IC(50) and the other with area under the curve (AUC) (or activity area). For the 12 drugs we tested, the predictive performance in terms of Pearson correlation coefficient exceeded 0.6 and the highest one was 17-AAG for which Pearson correlation coefficient was 0.811 for IC(50) and 0.81 for AUC. We identify common predictor genes for IC(50) and AUC, with which the performance was similar to those with genes separately found for IC(50) and AUC, but with much smaller number of predictor genes. By using only common predictor genes, the highest performance was AZD6244 (0.8016 for IC(50), 0.7945 for AUC) with 321 predictor genes. |
format | Online Article Text |
id | pubmed-8042299 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2021 |
publisher | Korea Genome Organization |
record_format | MEDLINE/PubMed |
spelling | pubmed-80422992021-04-19 Machine learning based anti-cancer drug response prediction and search for predictor genes using cancer cell line gene expression Qiu, Kexin Lee, JoongHo Kim, HanByeol Yoon, Seokhyun Kang, Keunsoo Genomics Inform Original Article Although many models have been proposed to accurately predict the response of drugs in cell lines recent years, understanding the genome related to drug response is also the key for completing oncology precision medicine. In this paper, based on the cancer cell line gene expression and the drug response data, we established a reliable and accurate drug response prediction model and found predictor genes for some drugs of interest. To this end, we first performed pre-selection of genes based on the Pearson correlation coefficient and then used ElasticNet regression model for drug response prediction and fine gene selection. To find more reliable set of predictor genes, we performed regression twice for each drug, one with IC(50) and the other with area under the curve (AUC) (or activity area). For the 12 drugs we tested, the predictive performance in terms of Pearson correlation coefficient exceeded 0.6 and the highest one was 17-AAG for which Pearson correlation coefficient was 0.811 for IC(50) and 0.81 for AUC. We identify common predictor genes for IC(50) and AUC, with which the performance was similar to those with genes separately found for IC(50) and AUC, but with much smaller number of predictor genes. By using only common predictor genes, the highest performance was AZD6244 (0.8016 for IC(50), 0.7945 for AUC) with 321 predictor genes. Korea Genome Organization 2021-03-26 /pmc/articles/PMC8042299/ /pubmed/33840174 http://dx.doi.org/10.5808/gi.20076 Text en (c) 2021, Korea Genome Organization https://creativecommons.org/licenses/by/4.0/(CC) This is an open-access article distributed under the terms of the Creative Commons Attribution license(https://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Original Article Qiu, Kexin Lee, JoongHo Kim, HanByeol Yoon, Seokhyun Kang, Keunsoo Machine learning based anti-cancer drug response prediction and search for predictor genes using cancer cell line gene expression |
title | Machine learning based anti-cancer drug response prediction and search for predictor genes using cancer cell line gene expression |
title_full | Machine learning based anti-cancer drug response prediction and search for predictor genes using cancer cell line gene expression |
title_fullStr | Machine learning based anti-cancer drug response prediction and search for predictor genes using cancer cell line gene expression |
title_full_unstemmed | Machine learning based anti-cancer drug response prediction and search for predictor genes using cancer cell line gene expression |
title_short | Machine learning based anti-cancer drug response prediction and search for predictor genes using cancer cell line gene expression |
title_sort | machine learning based anti-cancer drug response prediction and search for predictor genes using cancer cell line gene expression |
topic | Original Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8042299/ https://www.ncbi.nlm.nih.gov/pubmed/33840174 http://dx.doi.org/10.5808/gi.20076 |
work_keys_str_mv | AT qiukexin machinelearningbasedanticancerdrugresponsepredictionandsearchforpredictorgenesusingcancercelllinegeneexpression AT leejoongho machinelearningbasedanticancerdrugresponsepredictionandsearchforpredictorgenesusingcancercelllinegeneexpression AT kimhanbyeol machinelearningbasedanticancerdrugresponsepredictionandsearchforpredictorgenesusingcancercelllinegeneexpression AT yoonseokhyun machinelearningbasedanticancerdrugresponsepredictionandsearchforpredictorgenesusingcancercelllinegeneexpression AT kangkeunsoo machinelearningbasedanticancerdrugresponsepredictionandsearchforpredictorgenesusingcancercelllinegeneexpression |