Cargando…

Cancer gene expression profiles associated with clinical outcomes to chemotherapy treatments

BACKGROUND: Machine learning (ML) methods still have limited applicability in personalized oncology due to low numbers of available clinically annotated molecular profiles. This doesn’t allow sufficient training of ML classifiers that could be used for improving molecular diagnostics. METHODS: We re...

Descripción completa

Detalles Bibliográficos
Autores principales: Borisov, Nicolas, Sorokin, Maxim, Tkachev, Victor, Garazha, Andrew, Buzdin, Anton
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7499993/
https://www.ncbi.nlm.nih.gov/pubmed/32948183
http://dx.doi.org/10.1186/s12920-020-00759-0
_version_ 1783583777343143936
author Borisov, Nicolas
Sorokin, Maxim
Tkachev, Victor
Garazha, Andrew
Buzdin, Anton
author_facet Borisov, Nicolas
Sorokin, Maxim
Tkachev, Victor
Garazha, Andrew
Buzdin, Anton
author_sort Borisov, Nicolas
collection PubMed
description BACKGROUND: Machine learning (ML) methods still have limited applicability in personalized oncology due to low numbers of available clinically annotated molecular profiles. This doesn’t allow sufficient training of ML classifiers that could be used for improving molecular diagnostics. METHODS: We reviewed published datasets of high throughput gene expression profiles corresponding to cancer patients with known responses on chemotherapy treatments. We browsed Gene Expression Omnibus (GEO), The Cancer Genome Atlas (TCGA) and Tumor Alterations Relevant for GEnomics-driven Therapy (TARGET) repositories. RESULTS: We identified data collections suitable to build ML models for predicting responses on certain chemotherapeutic schemes. We identified 26 datasets, ranging from 41 till 508 cases per dataset. All the datasets identified were checked for ML applicability and robustness with leave-one-out cross validation. Twenty-three datasets were found suitable for using ML that had balanced numbers of treatment responder and non-responder cases. CONCLUSIONS: We collected a database of gene expression profiles associated with clinical responses on chemotherapy for 2786 individual cancer cases. Among them seven datasets included RNA sequencing data (for 645 cases) and the others – microarray expression profiles. The cases represented breast cancer, lung cancer, low-grade glioma, endothelial carcinoma, multiple myeloma, adult leukemia, pediatric leukemia and kidney tumors. Chemotherapeutics included taxanes, bortezomib, vincristine, trastuzumab, letrozole, tipifarnib, temozolomide, busulfan and cyclophosphamide.
format Online
Article
Text
id pubmed-7499993
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-74999932020-09-21 Cancer gene expression profiles associated with clinical outcomes to chemotherapy treatments Borisov, Nicolas Sorokin, Maxim Tkachev, Victor Garazha, Andrew Buzdin, Anton BMC Med Genomics Research BACKGROUND: Machine learning (ML) methods still have limited applicability in personalized oncology due to low numbers of available clinically annotated molecular profiles. This doesn’t allow sufficient training of ML classifiers that could be used for improving molecular diagnostics. METHODS: We reviewed published datasets of high throughput gene expression profiles corresponding to cancer patients with known responses on chemotherapy treatments. We browsed Gene Expression Omnibus (GEO), The Cancer Genome Atlas (TCGA) and Tumor Alterations Relevant for GEnomics-driven Therapy (TARGET) repositories. RESULTS: We identified data collections suitable to build ML models for predicting responses on certain chemotherapeutic schemes. We identified 26 datasets, ranging from 41 till 508 cases per dataset. All the datasets identified were checked for ML applicability and robustness with leave-one-out cross validation. Twenty-three datasets were found suitable for using ML that had balanced numbers of treatment responder and non-responder cases. CONCLUSIONS: We collected a database of gene expression profiles associated with clinical responses on chemotherapy for 2786 individual cancer cases. Among them seven datasets included RNA sequencing data (for 645 cases) and the others – microarray expression profiles. The cases represented breast cancer, lung cancer, low-grade glioma, endothelial carcinoma, multiple myeloma, adult leukemia, pediatric leukemia and kidney tumors. Chemotherapeutics included taxanes, bortezomib, vincristine, trastuzumab, letrozole, tipifarnib, temozolomide, busulfan and cyclophosphamide. BioMed Central 2020-09-18 /pmc/articles/PMC7499993/ /pubmed/32948183 http://dx.doi.org/10.1186/s12920-020-00759-0 Text en © The Author(s) 2020 Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated in a credit line to the data.
spellingShingle Research
Borisov, Nicolas
Sorokin, Maxim
Tkachev, Victor
Garazha, Andrew
Buzdin, Anton
Cancer gene expression profiles associated with clinical outcomes to chemotherapy treatments
title Cancer gene expression profiles associated with clinical outcomes to chemotherapy treatments
title_full Cancer gene expression profiles associated with clinical outcomes to chemotherapy treatments
title_fullStr Cancer gene expression profiles associated with clinical outcomes to chemotherapy treatments
title_full_unstemmed Cancer gene expression profiles associated with clinical outcomes to chemotherapy treatments
title_short Cancer gene expression profiles associated with clinical outcomes to chemotherapy treatments
title_sort cancer gene expression profiles associated with clinical outcomes to chemotherapy treatments
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7499993/
https://www.ncbi.nlm.nih.gov/pubmed/32948183
http://dx.doi.org/10.1186/s12920-020-00759-0
work_keys_str_mv AT borisovnicolas cancergeneexpressionprofilesassociatedwithclinicaloutcomestochemotherapytreatments
AT sorokinmaxim cancergeneexpressionprofilesassociatedwithclinicaloutcomestochemotherapytreatments
AT tkachevvictor cancergeneexpressionprofilesassociatedwithclinicaloutcomestochemotherapytreatments
AT garazhaandrew cancergeneexpressionprofilesassociatedwithclinicaloutcomestochemotherapytreatments
AT buzdinanton cancergeneexpressionprofilesassociatedwithclinicaloutcomestochemotherapytreatments