Cargando…
A cell-to-patient machine learning transfer approach uncovers novel basal-like breast cancer prognostic markers amongst alternative splice variants
BACKGROUND: Breast cancer is amongst the 10 first causes of death in women worldwide. Around 20% of patients are misdiagnosed leading to early metastasis, resistance to treatment and relapse. Many clinical and gene expression profiles have been successfully used to classify breast tumours into 5 maj...
Autores principales: | , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2021
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8042689/ https://www.ncbi.nlm.nih.gov/pubmed/33845831 http://dx.doi.org/10.1186/s12915-021-01002-7 |
_version_ | 1783678168683511808 |
---|---|
author | Villemin, Jean-Philippe Lorenzi, Claudio Cabrillac, Marie-Sarah Oldfield, Andrew Ritchie, William Luco, Reini F. |
author_facet | Villemin, Jean-Philippe Lorenzi, Claudio Cabrillac, Marie-Sarah Oldfield, Andrew Ritchie, William Luco, Reini F. |
author_sort | Villemin, Jean-Philippe |
collection | PubMed |
description | BACKGROUND: Breast cancer is amongst the 10 first causes of death in women worldwide. Around 20% of patients are misdiagnosed leading to early metastasis, resistance to treatment and relapse. Many clinical and gene expression profiles have been successfully used to classify breast tumours into 5 major types with different prognosis and sensitivity to specific treatments. Unfortunately, these profiles have failed to subclassify breast tumours into more subtypes to improve diagnostics and survival rate. Alternative splicing is emerging as a new source of highly specific biomarkers to classify tumours in different grades. Taking advantage of extensive public transcriptomics datasets in breast cancer cell lines (CCLE) and breast cancer tumours (TCGA), we have addressed the capacity of alternative splice variants to subclassify highly aggressive breast cancers. RESULTS: Transcriptomics analysis of alternative splicing events between luminal, basal A and basal B breast cancer cell lines identified a unique splicing signature for a subtype of tumours, the basal B, whose classification is not in use in the clinic yet. Basal B cell lines, in contrast with luminal and basal A, are highly metastatic and express epithelial-to-mesenchymal (EMT) markers, which are hallmarks of cell invasion and resistance to drugs. By developing a semi-supervised machine learning approach, we transferred the molecular knowledge gained from these cell lines into patients to subclassify basal-like triple negative tumours into basal A- and basal B-like categories. Changes in splicing of 25 alternative exons, intimately related to EMT and cell invasion such as ENAH, CD44 and CTNND1, were sufficient to identify the basal-like patients with the worst prognosis. Moreover, patients expressing this basal B-specific splicing signature also expressed newly identified biomarkers of metastasis-initiating cells, like CD36, supporting a more invasive phenotype for this basal B-like breast cancer subtype. CONCLUSIONS: Using a novel machine learning approach, we have identified an EMT-related splicing signature capable of subclassifying the most aggressive type of breast cancer, which are basal-like triple negative tumours. This proof-of-concept demonstrates that the biological knowledge acquired from cell lines can be transferred to patients data for further clinical investigation. More studies, particularly in 3D culture and organoids, will increase the accuracy of this transfer of knowledge, which will open new perspectives into the development of novel therapeutic strategies and the further identification of specific biomarkers for drug resistance and cancer relapse. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s12915-021-01002-7. |
format | Online Article Text |
id | pubmed-8042689 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2021 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-80426892021-04-14 A cell-to-patient machine learning transfer approach uncovers novel basal-like breast cancer prognostic markers amongst alternative splice variants Villemin, Jean-Philippe Lorenzi, Claudio Cabrillac, Marie-Sarah Oldfield, Andrew Ritchie, William Luco, Reini F. BMC Biol Research Article BACKGROUND: Breast cancer is amongst the 10 first causes of death in women worldwide. Around 20% of patients are misdiagnosed leading to early metastasis, resistance to treatment and relapse. Many clinical and gene expression profiles have been successfully used to classify breast tumours into 5 major types with different prognosis and sensitivity to specific treatments. Unfortunately, these profiles have failed to subclassify breast tumours into more subtypes to improve diagnostics and survival rate. Alternative splicing is emerging as a new source of highly specific biomarkers to classify tumours in different grades. Taking advantage of extensive public transcriptomics datasets in breast cancer cell lines (CCLE) and breast cancer tumours (TCGA), we have addressed the capacity of alternative splice variants to subclassify highly aggressive breast cancers. RESULTS: Transcriptomics analysis of alternative splicing events between luminal, basal A and basal B breast cancer cell lines identified a unique splicing signature for a subtype of tumours, the basal B, whose classification is not in use in the clinic yet. Basal B cell lines, in contrast with luminal and basal A, are highly metastatic and express epithelial-to-mesenchymal (EMT) markers, which are hallmarks of cell invasion and resistance to drugs. By developing a semi-supervised machine learning approach, we transferred the molecular knowledge gained from these cell lines into patients to subclassify basal-like triple negative tumours into basal A- and basal B-like categories. Changes in splicing of 25 alternative exons, intimately related to EMT and cell invasion such as ENAH, CD44 and CTNND1, were sufficient to identify the basal-like patients with the worst prognosis. Moreover, patients expressing this basal B-specific splicing signature also expressed newly identified biomarkers of metastasis-initiating cells, like CD36, supporting a more invasive phenotype for this basal B-like breast cancer subtype. CONCLUSIONS: Using a novel machine learning approach, we have identified an EMT-related splicing signature capable of subclassifying the most aggressive type of breast cancer, which are basal-like triple negative tumours. This proof-of-concept demonstrates that the biological knowledge acquired from cell lines can be transferred to patients data for further clinical investigation. More studies, particularly in 3D culture and organoids, will increase the accuracy of this transfer of knowledge, which will open new perspectives into the development of novel therapeutic strategies and the further identification of specific biomarkers for drug resistance and cancer relapse. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s12915-021-01002-7. BioMed Central 2021-04-12 /pmc/articles/PMC8042689/ /pubmed/33845831 http://dx.doi.org/10.1186/s12915-021-01002-7 Text en © The Author(s) 2021 https://creativecommons.org/licenses/by/4.0/Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/ (https://creativecommons.org/publicdomain/zero/1.0/) ) applies to the data made available in this article, unless otherwise stated in a credit line to the data. |
spellingShingle | Research Article Villemin, Jean-Philippe Lorenzi, Claudio Cabrillac, Marie-Sarah Oldfield, Andrew Ritchie, William Luco, Reini F. A cell-to-patient machine learning transfer approach uncovers novel basal-like breast cancer prognostic markers amongst alternative splice variants |
title | A cell-to-patient machine learning transfer approach uncovers novel basal-like breast cancer prognostic markers amongst alternative splice variants |
title_full | A cell-to-patient machine learning transfer approach uncovers novel basal-like breast cancer prognostic markers amongst alternative splice variants |
title_fullStr | A cell-to-patient machine learning transfer approach uncovers novel basal-like breast cancer prognostic markers amongst alternative splice variants |
title_full_unstemmed | A cell-to-patient machine learning transfer approach uncovers novel basal-like breast cancer prognostic markers amongst alternative splice variants |
title_short | A cell-to-patient machine learning transfer approach uncovers novel basal-like breast cancer prognostic markers amongst alternative splice variants |
title_sort | cell-to-patient machine learning transfer approach uncovers novel basal-like breast cancer prognostic markers amongst alternative splice variants |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8042689/ https://www.ncbi.nlm.nih.gov/pubmed/33845831 http://dx.doi.org/10.1186/s12915-021-01002-7 |
work_keys_str_mv | AT villeminjeanphilippe acelltopatientmachinelearningtransferapproachuncoversnovelbasallikebreastcancerprognosticmarkersamongstalternativesplicevariants AT lorenziclaudio acelltopatientmachinelearningtransferapproachuncoversnovelbasallikebreastcancerprognosticmarkersamongstalternativesplicevariants AT cabrillacmariesarah acelltopatientmachinelearningtransferapproachuncoversnovelbasallikebreastcancerprognosticmarkersamongstalternativesplicevariants AT oldfieldandrew acelltopatientmachinelearningtransferapproachuncoversnovelbasallikebreastcancerprognosticmarkersamongstalternativesplicevariants AT ritchiewilliam acelltopatientmachinelearningtransferapproachuncoversnovelbasallikebreastcancerprognosticmarkersamongstalternativesplicevariants AT lucoreinif acelltopatientmachinelearningtransferapproachuncoversnovelbasallikebreastcancerprognosticmarkersamongstalternativesplicevariants AT villeminjeanphilippe celltopatientmachinelearningtransferapproachuncoversnovelbasallikebreastcancerprognosticmarkersamongstalternativesplicevariants AT lorenziclaudio celltopatientmachinelearningtransferapproachuncoversnovelbasallikebreastcancerprognosticmarkersamongstalternativesplicevariants AT cabrillacmariesarah celltopatientmachinelearningtransferapproachuncoversnovelbasallikebreastcancerprognosticmarkersamongstalternativesplicevariants AT oldfieldandrew celltopatientmachinelearningtransferapproachuncoversnovelbasallikebreastcancerprognosticmarkersamongstalternativesplicevariants AT ritchiewilliam celltopatientmachinelearningtransferapproachuncoversnovelbasallikebreastcancerprognosticmarkersamongstalternativesplicevariants AT lucoreinif celltopatientmachinelearningtransferapproachuncoversnovelbasallikebreastcancerprognosticmarkersamongstalternativesplicevariants |