Cargando…
A novel approach to predicting the synergy of anti-cancer drug combinations using document-based feature extraction
BACKGROUND: To reduce drug side effects and enhance their therapeutic effect compared with single drugs, drug combination research, combining two or more drugs, is highly important. Conducting in-vivo and in-vitro experiments on a vast number of drug combinations incurs astronomical time and cost. T...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9069794/ https://www.ncbi.nlm.nih.gov/pubmed/35513784 http://dx.doi.org/10.1186/s12859-022-04698-8 |
_version_ | 1784700506708377600 |
---|---|
author | Shim, Yongsun Lee, Munhwan Kim, Pil-Jong Kim, Hong-Gee |
author_facet | Shim, Yongsun Lee, Munhwan Kim, Pil-Jong Kim, Hong-Gee |
author_sort | Shim, Yongsun |
collection | PubMed |
description | BACKGROUND: To reduce drug side effects and enhance their therapeutic effect compared with single drugs, drug combination research, combining two or more drugs, is highly important. Conducting in-vivo and in-vitro experiments on a vast number of drug combinations incurs astronomical time and cost. To reduce the number of combinations, researchers classify whether drug combinations are synergistic through in-silico methods. Since unstructured data, such as biomedical documents, include experimental types, methods, and results, it can be beneficial extracting features from documents to predict anti-cancer drug combination synergy. However, few studies predict anti-cancer drug combination synergy using document-extracted features. RESULTS: We present a novel approach for anti-cancer drug combination synergy prediction using document-based feature extraction. Our approach is divided into two steps. First, we extracted documents containing validated anti-cancer drug combinations and cell lines. Drug and cell line synonyms in the extracted documents were converted into representative words, and the documents were preprocessed by tokenization, lemmatization, and stopword removal. Second, the drug and cell line features were extracted from the preprocessed documents, and training data were constructed by feature concatenation. A prediction model based on deep and machine learning was created using the training data. The use of our features yielded higher results compared to the majority of published studies. CONCLUSIONS: Using our prediction model, researchers can save time and cost on new anti-cancer drug combination discoveries. Additionally, since our feature extraction method does not require structuring of unstructured data, new data can be immediately applied without any data scalability issues. |
format | Online Article Text |
id | pubmed-9069794 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-90697942022-05-05 A novel approach to predicting the synergy of anti-cancer drug combinations using document-based feature extraction Shim, Yongsun Lee, Munhwan Kim, Pil-Jong Kim, Hong-Gee BMC Bioinformatics Research BACKGROUND: To reduce drug side effects and enhance their therapeutic effect compared with single drugs, drug combination research, combining two or more drugs, is highly important. Conducting in-vivo and in-vitro experiments on a vast number of drug combinations incurs astronomical time and cost. To reduce the number of combinations, researchers classify whether drug combinations are synergistic through in-silico methods. Since unstructured data, such as biomedical documents, include experimental types, methods, and results, it can be beneficial extracting features from documents to predict anti-cancer drug combination synergy. However, few studies predict anti-cancer drug combination synergy using document-extracted features. RESULTS: We present a novel approach for anti-cancer drug combination synergy prediction using document-based feature extraction. Our approach is divided into two steps. First, we extracted documents containing validated anti-cancer drug combinations and cell lines. Drug and cell line synonyms in the extracted documents were converted into representative words, and the documents were preprocessed by tokenization, lemmatization, and stopword removal. Second, the drug and cell line features were extracted from the preprocessed documents, and training data were constructed by feature concatenation. A prediction model based on deep and machine learning was created using the training data. The use of our features yielded higher results compared to the majority of published studies. CONCLUSIONS: Using our prediction model, researchers can save time and cost on new anti-cancer drug combination discoveries. Additionally, since our feature extraction method does not require structuring of unstructured data, new data can be immediately applied without any data scalability issues. BioMed Central 2022-05-05 /pmc/articles/PMC9069794/ /pubmed/35513784 http://dx.doi.org/10.1186/s12859-022-04698-8 Text en © The Author(s) 2022 https://creativecommons.org/licenses/by/4.0/Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/ (https://creativecommons.org/publicdomain/zero/1.0/) ) applies to the data made available in this article, unless otherwise stated in a credit line to the data. |
spellingShingle | Research Shim, Yongsun Lee, Munhwan Kim, Pil-Jong Kim, Hong-Gee A novel approach to predicting the synergy of anti-cancer drug combinations using document-based feature extraction |
title | A novel approach to predicting the synergy of anti-cancer drug combinations using document-based feature extraction |
title_full | A novel approach to predicting the synergy of anti-cancer drug combinations using document-based feature extraction |
title_fullStr | A novel approach to predicting the synergy of anti-cancer drug combinations using document-based feature extraction |
title_full_unstemmed | A novel approach to predicting the synergy of anti-cancer drug combinations using document-based feature extraction |
title_short | A novel approach to predicting the synergy of anti-cancer drug combinations using document-based feature extraction |
title_sort | novel approach to predicting the synergy of anti-cancer drug combinations using document-based feature extraction |
topic | Research |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9069794/ https://www.ncbi.nlm.nih.gov/pubmed/35513784 http://dx.doi.org/10.1186/s12859-022-04698-8 |
work_keys_str_mv | AT shimyongsun anovelapproachtopredictingthesynergyofanticancerdrugcombinationsusingdocumentbasedfeatureextraction AT leemunhwan anovelapproachtopredictingthesynergyofanticancerdrugcombinationsusingdocumentbasedfeatureextraction AT kimpiljong anovelapproachtopredictingthesynergyofanticancerdrugcombinationsusingdocumentbasedfeatureextraction AT kimhonggee anovelapproachtopredictingthesynergyofanticancerdrugcombinationsusingdocumentbasedfeatureextraction AT shimyongsun novelapproachtopredictingthesynergyofanticancerdrugcombinationsusingdocumentbasedfeatureextraction AT leemunhwan novelapproachtopredictingthesynergyofanticancerdrugcombinationsusingdocumentbasedfeatureextraction AT kimpiljong novelapproachtopredictingthesynergyofanticancerdrugcombinationsusingdocumentbasedfeatureextraction AT kimhonggee novelapproachtopredictingthesynergyofanticancerdrugcombinationsusingdocumentbasedfeatureextraction |