Cargando…

A novel approach to predicting the synergy of anti-cancer drug combinations using document-based feature extraction

BACKGROUND: To reduce drug side effects and enhance their therapeutic effect compared with single drugs, drug combination research, combining two or more drugs, is highly important. Conducting in-vivo and in-vitro experiments on a vast number of drug combinations incurs astronomical time and cost. T...

Descripción completa

Detalles Bibliográficos
Autores principales: Shim, Yongsun, Lee, Munhwan, Kim, Pil-Jong, Kim, Hong-Gee
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9069794/
https://www.ncbi.nlm.nih.gov/pubmed/35513784
http://dx.doi.org/10.1186/s12859-022-04698-8
_version_ 1784700506708377600
author Shim, Yongsun
Lee, Munhwan
Kim, Pil-Jong
Kim, Hong-Gee
author_facet Shim, Yongsun
Lee, Munhwan
Kim, Pil-Jong
Kim, Hong-Gee
author_sort Shim, Yongsun
collection PubMed
description BACKGROUND: To reduce drug side effects and enhance their therapeutic effect compared with single drugs, drug combination research, combining two or more drugs, is highly important. Conducting in-vivo and in-vitro experiments on a vast number of drug combinations incurs astronomical time and cost. To reduce the number of combinations, researchers classify whether drug combinations are synergistic through in-silico methods. Since unstructured data, such as biomedical documents, include experimental types, methods, and results, it can be beneficial extracting features from documents to predict anti-cancer drug combination synergy. However, few studies predict anti-cancer drug combination synergy using document-extracted features. RESULTS: We present a novel approach for anti-cancer drug combination synergy prediction using document-based feature extraction. Our approach is divided into two steps. First, we extracted documents containing validated anti-cancer drug combinations and cell lines. Drug and cell line synonyms in the extracted documents were converted into representative words, and the documents were preprocessed by tokenization, lemmatization, and stopword removal. Second, the drug and cell line features were extracted from the preprocessed documents, and training data were constructed by feature concatenation. A prediction model based on deep and machine learning was created using the training data. The use of our features yielded higher results compared to the majority of published studies. CONCLUSIONS: Using our prediction model, researchers can save time and cost on new anti-cancer drug combination discoveries. Additionally, since our feature extraction method does not require structuring of unstructured data, new data can be immediately applied without any data scalability issues.
format Online
Article
Text
id pubmed-9069794
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-90697942022-05-05 A novel approach to predicting the synergy of anti-cancer drug combinations using document-based feature extraction Shim, Yongsun Lee, Munhwan Kim, Pil-Jong Kim, Hong-Gee BMC Bioinformatics Research BACKGROUND: To reduce drug side effects and enhance their therapeutic effect compared with single drugs, drug combination research, combining two or more drugs, is highly important. Conducting in-vivo and in-vitro experiments on a vast number of drug combinations incurs astronomical time and cost. To reduce the number of combinations, researchers classify whether drug combinations are synergistic through in-silico methods. Since unstructured data, such as biomedical documents, include experimental types, methods, and results, it can be beneficial extracting features from documents to predict anti-cancer drug combination synergy. However, few studies predict anti-cancer drug combination synergy using document-extracted features. RESULTS: We present a novel approach for anti-cancer drug combination synergy prediction using document-based feature extraction. Our approach is divided into two steps. First, we extracted documents containing validated anti-cancer drug combinations and cell lines. Drug and cell line synonyms in the extracted documents were converted into representative words, and the documents were preprocessed by tokenization, lemmatization, and stopword removal. Second, the drug and cell line features were extracted from the preprocessed documents, and training data were constructed by feature concatenation. A prediction model based on deep and machine learning was created using the training data. The use of our features yielded higher results compared to the majority of published studies. CONCLUSIONS: Using our prediction model, researchers can save time and cost on new anti-cancer drug combination discoveries. Additionally, since our feature extraction method does not require structuring of unstructured data, new data can be immediately applied without any data scalability issues. BioMed Central 2022-05-05 /pmc/articles/PMC9069794/ /pubmed/35513784 http://dx.doi.org/10.1186/s12859-022-04698-8 Text en © The Author(s) 2022 https://creativecommons.org/licenses/by/4.0/Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/ (https://creativecommons.org/publicdomain/zero/1.0/) ) applies to the data made available in this article, unless otherwise stated in a credit line to the data.
spellingShingle Research
Shim, Yongsun
Lee, Munhwan
Kim, Pil-Jong
Kim, Hong-Gee
A novel approach to predicting the synergy of anti-cancer drug combinations using document-based feature extraction
title A novel approach to predicting the synergy of anti-cancer drug combinations using document-based feature extraction
title_full A novel approach to predicting the synergy of anti-cancer drug combinations using document-based feature extraction
title_fullStr A novel approach to predicting the synergy of anti-cancer drug combinations using document-based feature extraction
title_full_unstemmed A novel approach to predicting the synergy of anti-cancer drug combinations using document-based feature extraction
title_short A novel approach to predicting the synergy of anti-cancer drug combinations using document-based feature extraction
title_sort novel approach to predicting the synergy of anti-cancer drug combinations using document-based feature extraction
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9069794/
https://www.ncbi.nlm.nih.gov/pubmed/35513784
http://dx.doi.org/10.1186/s12859-022-04698-8
work_keys_str_mv AT shimyongsun anovelapproachtopredictingthesynergyofanticancerdrugcombinationsusingdocumentbasedfeatureextraction
AT leemunhwan anovelapproachtopredictingthesynergyofanticancerdrugcombinationsusingdocumentbasedfeatureextraction
AT kimpiljong anovelapproachtopredictingthesynergyofanticancerdrugcombinationsusingdocumentbasedfeatureextraction
AT kimhonggee anovelapproachtopredictingthesynergyofanticancerdrugcombinationsusingdocumentbasedfeatureextraction
AT shimyongsun novelapproachtopredictingthesynergyofanticancerdrugcombinationsusingdocumentbasedfeatureextraction
AT leemunhwan novelapproachtopredictingthesynergyofanticancerdrugcombinationsusingdocumentbasedfeatureextraction
AT kimpiljong novelapproachtopredictingthesynergyofanticancerdrugcombinationsusingdocumentbasedfeatureextraction
AT kimhonggee novelapproachtopredictingthesynergyofanticancerdrugcombinationsusingdocumentbasedfeatureextraction