Cargando…

DeepTRIAGE: interpretable and individualised biomarker scores using attention mechanism for the classification of breast cancer sub-types

BACKGROUND: Breast cancer is a collection of multiple tissue pathologies, each with a distinct molecular signature that correlates with patient prognosis and response to therapy. Accurately differentiating between breast cancer sub-types is an important part of clinical decision-making. Although thi...

Descripción completa

Detalles Bibliográficos
Autores principales: Beykikhoshk, Adham, Quinn, Thomas P., Lee, Samuel C., Tran, Truyen, Venkatesh, Svetha
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7038647/
https://www.ncbi.nlm.nih.gov/pubmed/32093737
http://dx.doi.org/10.1186/s12920-020-0658-5
_version_ 1783500685005815808
author Beykikhoshk, Adham
Quinn, Thomas P.
Lee, Samuel C.
Tran, Truyen
Venkatesh, Svetha
author_facet Beykikhoshk, Adham
Quinn, Thomas P.
Lee, Samuel C.
Tran, Truyen
Venkatesh, Svetha
author_sort Beykikhoshk, Adham
collection PubMed
description BACKGROUND: Breast cancer is a collection of multiple tissue pathologies, each with a distinct molecular signature that correlates with patient prognosis and response to therapy. Accurately differentiating between breast cancer sub-types is an important part of clinical decision-making. Although this problem has been addressed using machine learning methods in the past, there remains unexplained heterogeneity within the established sub-types that cannot be resolved by the commonly used classification algorithms. METHODS: In this paper, we propose a novel deep learning architecture, called DeepTRIAGE (Deep learning for the TRactable Individualised Analysis of Gene Expression), which uses an attention mechanism to obtain personalised biomarker scores that describe how important each gene is in predicting the cancer sub-type for each sample. We then perform a principal component analysis of these biomarker scores to visualise the sample heterogeneity, and use a linear model to test whether the major principal axes associate with known clinical phenotypes. RESULTS: Our model not only classifies cancer sub-types with good accuracy, but simultaneously assigns each patient their own set of interpretable and individualised biomarker scores. These personalised scores describe how important each feature is in the classification of any patient, and can be analysed post-hoc to generate new hypotheses about latent heterogeneity. CONCLUSIONS: We apply the DeepTRIAGE framework to classify the gene expression signatures of luminal A and luminal B breast cancer sub-types, and illustrate its use for genes as well as the GO and KEGG gene sets. Using DeepTRIAGE, we calculate personalised biomarker scores that describe the most important features for classifying an individual patient as luminal A or luminal B. In doing so, DeepTRIAGE simultaneously reveals heterogeneity within the luminal A biomarker scores that significantly associate with tumour stage, placing all luminal samples along a continuum of severity.
format Online
Article
Text
id pubmed-7038647
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-70386472020-03-02 DeepTRIAGE: interpretable and individualised biomarker scores using attention mechanism for the classification of breast cancer sub-types Beykikhoshk, Adham Quinn, Thomas P. Lee, Samuel C. Tran, Truyen Venkatesh, Svetha BMC Med Genomics Research BACKGROUND: Breast cancer is a collection of multiple tissue pathologies, each with a distinct molecular signature that correlates with patient prognosis and response to therapy. Accurately differentiating between breast cancer sub-types is an important part of clinical decision-making. Although this problem has been addressed using machine learning methods in the past, there remains unexplained heterogeneity within the established sub-types that cannot be resolved by the commonly used classification algorithms. METHODS: In this paper, we propose a novel deep learning architecture, called DeepTRIAGE (Deep learning for the TRactable Individualised Analysis of Gene Expression), which uses an attention mechanism to obtain personalised biomarker scores that describe how important each gene is in predicting the cancer sub-type for each sample. We then perform a principal component analysis of these biomarker scores to visualise the sample heterogeneity, and use a linear model to test whether the major principal axes associate with known clinical phenotypes. RESULTS: Our model not only classifies cancer sub-types with good accuracy, but simultaneously assigns each patient their own set of interpretable and individualised biomarker scores. These personalised scores describe how important each feature is in the classification of any patient, and can be analysed post-hoc to generate new hypotheses about latent heterogeneity. CONCLUSIONS: We apply the DeepTRIAGE framework to classify the gene expression signatures of luminal A and luminal B breast cancer sub-types, and illustrate its use for genes as well as the GO and KEGG gene sets. Using DeepTRIAGE, we calculate personalised biomarker scores that describe the most important features for classifying an individual patient as luminal A or luminal B. In doing so, DeepTRIAGE simultaneously reveals heterogeneity within the luminal A biomarker scores that significantly associate with tumour stage, placing all luminal samples along a continuum of severity. BioMed Central 2020-02-24 /pmc/articles/PMC7038647/ /pubmed/32093737 http://dx.doi.org/10.1186/s12920-020-0658-5 Text en © The Author(s) 2020 Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Research
Beykikhoshk, Adham
Quinn, Thomas P.
Lee, Samuel C.
Tran, Truyen
Venkatesh, Svetha
DeepTRIAGE: interpretable and individualised biomarker scores using attention mechanism for the classification of breast cancer sub-types
title DeepTRIAGE: interpretable and individualised biomarker scores using attention mechanism for the classification of breast cancer sub-types
title_full DeepTRIAGE: interpretable and individualised biomarker scores using attention mechanism for the classification of breast cancer sub-types
title_fullStr DeepTRIAGE: interpretable and individualised biomarker scores using attention mechanism for the classification of breast cancer sub-types
title_full_unstemmed DeepTRIAGE: interpretable and individualised biomarker scores using attention mechanism for the classification of breast cancer sub-types
title_short DeepTRIAGE: interpretable and individualised biomarker scores using attention mechanism for the classification of breast cancer sub-types
title_sort deeptriage: interpretable and individualised biomarker scores using attention mechanism for the classification of breast cancer sub-types
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7038647/
https://www.ncbi.nlm.nih.gov/pubmed/32093737
http://dx.doi.org/10.1186/s12920-020-0658-5
work_keys_str_mv AT beykikhoshkadham deeptriageinterpretableandindividualisedbiomarkerscoresusingattentionmechanismfortheclassificationofbreastcancersubtypes
AT quinnthomasp deeptriageinterpretableandindividualisedbiomarkerscoresusingattentionmechanismfortheclassificationofbreastcancersubtypes
AT leesamuelc deeptriageinterpretableandindividualisedbiomarkerscoresusingattentionmechanismfortheclassificationofbreastcancersubtypes
AT trantruyen deeptriageinterpretableandindividualisedbiomarkerscoresusingattentionmechanismfortheclassificationofbreastcancersubtypes
AT venkateshsvetha deeptriageinterpretableandindividualisedbiomarkerscoresusingattentionmechanismfortheclassificationofbreastcancersubtypes