Cargando…

A cell-to-patient machine learning transfer approach uncovers novel basal-like breast cancer prognostic markers amongst alternative splice variants

BACKGROUND: Breast cancer is amongst the 10 first causes of death in women worldwide. Around 20% of patients are misdiagnosed leading to early metastasis, resistance to treatment and relapse. Many clinical and gene expression profiles have been successfully used to classify breast tumours into 5 maj...

Descripción completa

Detalles Bibliográficos
Autores principales: Villemin, Jean-Philippe, Lorenzi, Claudio, Cabrillac, Marie-Sarah, Oldfield, Andrew, Ritchie, William, Luco, Reini F.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8042689/
https://www.ncbi.nlm.nih.gov/pubmed/33845831
http://dx.doi.org/10.1186/s12915-021-01002-7
_version_ 1783678168683511808
author Villemin, Jean-Philippe
Lorenzi, Claudio
Cabrillac, Marie-Sarah
Oldfield, Andrew
Ritchie, William
Luco, Reini F.
author_facet Villemin, Jean-Philippe
Lorenzi, Claudio
Cabrillac, Marie-Sarah
Oldfield, Andrew
Ritchie, William
Luco, Reini F.
author_sort Villemin, Jean-Philippe
collection PubMed
description BACKGROUND: Breast cancer is amongst the 10 first causes of death in women worldwide. Around 20% of patients are misdiagnosed leading to early metastasis, resistance to treatment and relapse. Many clinical and gene expression profiles have been successfully used to classify breast tumours into 5 major types with different prognosis and sensitivity to specific treatments. Unfortunately, these profiles have failed to subclassify breast tumours into more subtypes to improve diagnostics and survival rate. Alternative splicing is emerging as a new source of highly specific biomarkers to classify tumours in different grades. Taking advantage of extensive public transcriptomics datasets in breast cancer cell lines (CCLE) and breast cancer tumours (TCGA), we have addressed the capacity of alternative splice variants to subclassify highly aggressive breast cancers. RESULTS: Transcriptomics analysis of alternative splicing events between luminal, basal A and basal B breast cancer cell lines identified a unique splicing signature for a subtype of tumours, the basal B, whose classification is not in use in the clinic yet. Basal B cell lines, in contrast with luminal and basal A, are highly metastatic and express epithelial-to-mesenchymal (EMT) markers, which are hallmarks of cell invasion and resistance to drugs. By developing a semi-supervised machine learning approach, we transferred the molecular knowledge gained from these cell lines into patients to subclassify basal-like triple negative tumours into basal A- and basal B-like categories. Changes in splicing of 25 alternative exons, intimately related to EMT and cell invasion such as ENAH, CD44 and CTNND1, were sufficient to identify the basal-like patients with the worst prognosis. Moreover, patients expressing this basal B-specific splicing signature also expressed newly identified biomarkers of metastasis-initiating cells, like CD36, supporting a more invasive phenotype for this basal B-like breast cancer subtype. CONCLUSIONS: Using a novel machine learning approach, we have identified an EMT-related splicing signature capable of subclassifying the most aggressive type of breast cancer, which are basal-like triple negative tumours. This proof-of-concept demonstrates that the biological knowledge acquired from cell lines can be transferred to patients data for further clinical investigation. More studies, particularly in 3D culture and organoids, will increase the accuracy of this transfer of knowledge, which will open new perspectives into the development of novel therapeutic strategies and the further identification of specific biomarkers for drug resistance and cancer relapse. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s12915-021-01002-7.
format Online
Article
Text
id pubmed-8042689
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-80426892021-04-14 A cell-to-patient machine learning transfer approach uncovers novel basal-like breast cancer prognostic markers amongst alternative splice variants Villemin, Jean-Philippe Lorenzi, Claudio Cabrillac, Marie-Sarah Oldfield, Andrew Ritchie, William Luco, Reini F. BMC Biol Research Article BACKGROUND: Breast cancer is amongst the 10 first causes of death in women worldwide. Around 20% of patients are misdiagnosed leading to early metastasis, resistance to treatment and relapse. Many clinical and gene expression profiles have been successfully used to classify breast tumours into 5 major types with different prognosis and sensitivity to specific treatments. Unfortunately, these profiles have failed to subclassify breast tumours into more subtypes to improve diagnostics and survival rate. Alternative splicing is emerging as a new source of highly specific biomarkers to classify tumours in different grades. Taking advantage of extensive public transcriptomics datasets in breast cancer cell lines (CCLE) and breast cancer tumours (TCGA), we have addressed the capacity of alternative splice variants to subclassify highly aggressive breast cancers. RESULTS: Transcriptomics analysis of alternative splicing events between luminal, basal A and basal B breast cancer cell lines identified a unique splicing signature for a subtype of tumours, the basal B, whose classification is not in use in the clinic yet. Basal B cell lines, in contrast with luminal and basal A, are highly metastatic and express epithelial-to-mesenchymal (EMT) markers, which are hallmarks of cell invasion and resistance to drugs. By developing a semi-supervised machine learning approach, we transferred the molecular knowledge gained from these cell lines into patients to subclassify basal-like triple negative tumours into basal A- and basal B-like categories. Changes in splicing of 25 alternative exons, intimately related to EMT and cell invasion such as ENAH, CD44 and CTNND1, were sufficient to identify the basal-like patients with the worst prognosis. Moreover, patients expressing this basal B-specific splicing signature also expressed newly identified biomarkers of metastasis-initiating cells, like CD36, supporting a more invasive phenotype for this basal B-like breast cancer subtype. CONCLUSIONS: Using a novel machine learning approach, we have identified an EMT-related splicing signature capable of subclassifying the most aggressive type of breast cancer, which are basal-like triple negative tumours. This proof-of-concept demonstrates that the biological knowledge acquired from cell lines can be transferred to patients data for further clinical investigation. More studies, particularly in 3D culture and organoids, will increase the accuracy of this transfer of knowledge, which will open new perspectives into the development of novel therapeutic strategies and the further identification of specific biomarkers for drug resistance and cancer relapse. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s12915-021-01002-7. BioMed Central 2021-04-12 /pmc/articles/PMC8042689/ /pubmed/33845831 http://dx.doi.org/10.1186/s12915-021-01002-7 Text en © The Author(s) 2021 https://creativecommons.org/licenses/by/4.0/Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/ (https://creativecommons.org/publicdomain/zero/1.0/) ) applies to the data made available in this article, unless otherwise stated in a credit line to the data.
spellingShingle Research Article
Villemin, Jean-Philippe
Lorenzi, Claudio
Cabrillac, Marie-Sarah
Oldfield, Andrew
Ritchie, William
Luco, Reini F.
A cell-to-patient machine learning transfer approach uncovers novel basal-like breast cancer prognostic markers amongst alternative splice variants
title A cell-to-patient machine learning transfer approach uncovers novel basal-like breast cancer prognostic markers amongst alternative splice variants
title_full A cell-to-patient machine learning transfer approach uncovers novel basal-like breast cancer prognostic markers amongst alternative splice variants
title_fullStr A cell-to-patient machine learning transfer approach uncovers novel basal-like breast cancer prognostic markers amongst alternative splice variants
title_full_unstemmed A cell-to-patient machine learning transfer approach uncovers novel basal-like breast cancer prognostic markers amongst alternative splice variants
title_short A cell-to-patient machine learning transfer approach uncovers novel basal-like breast cancer prognostic markers amongst alternative splice variants
title_sort cell-to-patient machine learning transfer approach uncovers novel basal-like breast cancer prognostic markers amongst alternative splice variants
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8042689/
https://www.ncbi.nlm.nih.gov/pubmed/33845831
http://dx.doi.org/10.1186/s12915-021-01002-7
work_keys_str_mv AT villeminjeanphilippe acelltopatientmachinelearningtransferapproachuncoversnovelbasallikebreastcancerprognosticmarkersamongstalternativesplicevariants
AT lorenziclaudio acelltopatientmachinelearningtransferapproachuncoversnovelbasallikebreastcancerprognosticmarkersamongstalternativesplicevariants
AT cabrillacmariesarah acelltopatientmachinelearningtransferapproachuncoversnovelbasallikebreastcancerprognosticmarkersamongstalternativesplicevariants
AT oldfieldandrew acelltopatientmachinelearningtransferapproachuncoversnovelbasallikebreastcancerprognosticmarkersamongstalternativesplicevariants
AT ritchiewilliam acelltopatientmachinelearningtransferapproachuncoversnovelbasallikebreastcancerprognosticmarkersamongstalternativesplicevariants
AT lucoreinif acelltopatientmachinelearningtransferapproachuncoversnovelbasallikebreastcancerprognosticmarkersamongstalternativesplicevariants
AT villeminjeanphilippe celltopatientmachinelearningtransferapproachuncoversnovelbasallikebreastcancerprognosticmarkersamongstalternativesplicevariants
AT lorenziclaudio celltopatientmachinelearningtransferapproachuncoversnovelbasallikebreastcancerprognosticmarkersamongstalternativesplicevariants
AT cabrillacmariesarah celltopatientmachinelearningtransferapproachuncoversnovelbasallikebreastcancerprognosticmarkersamongstalternativesplicevariants
AT oldfieldandrew celltopatientmachinelearningtransferapproachuncoversnovelbasallikebreastcancerprognosticmarkersamongstalternativesplicevariants
AT ritchiewilliam celltopatientmachinelearningtransferapproachuncoversnovelbasallikebreastcancerprognosticmarkersamongstalternativesplicevariants
AT lucoreinif celltopatientmachinelearningtransferapproachuncoversnovelbasallikebreastcancerprognosticmarkersamongstalternativesplicevariants