Cargando…

BreastMark: An Integrated Approach to Mining Publicly Available Transcriptomic Datasets Relating to Breast Cancer Outcome

INTRODUCTION: Breast cancer is a complex heterogeneous disease for which a substantial resource of transcriptomic data is available. Gene expression data have facilitated the division of breast cancer into, at least, five molecular subtypes, namely luminal A, luminal B, HER2, normal-like and basal....

Descripción completa

Detalles Bibliográficos
Autores principales: Madden, Stephen F, Clarke, Colin, Gaule, Patricia, Aherne, Sinead T, O'Donovan, Norma, Clynes, Martin, Crown, John, Gallagher, William M
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2013
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3978487/
https://www.ncbi.nlm.nih.gov/pubmed/23820017
http://dx.doi.org/10.1186/bcr3444
_version_ 1782310575419686912
author Madden, Stephen F
Clarke, Colin
Gaule, Patricia
Aherne, Sinead T
O'Donovan, Norma
Clynes, Martin
Crown, John
Gallagher, William M
author_facet Madden, Stephen F
Clarke, Colin
Gaule, Patricia
Aherne, Sinead T
O'Donovan, Norma
Clynes, Martin
Crown, John
Gallagher, William M
author_sort Madden, Stephen F
collection PubMed
description INTRODUCTION: Breast cancer is a complex heterogeneous disease for which a substantial resource of transcriptomic data is available. Gene expression data have facilitated the division of breast cancer into, at least, five molecular subtypes, namely luminal A, luminal B, HER2, normal-like and basal. Once identified, breast cancer subtypes can inform clinical decisions surrounding patient treatment and prognosis. Indeed, it is important to identify patients at risk of developing aggressive disease so as to tailor the level of clinical intervention. METHODS: We have developed a user-friendly, web-based system to allow the evaluation of genes/microRNAs (miRNAs) that are significantly associated with survival in breast cancer and its molecular subtypes. The algorithm combines gene expression data from multiple microarray experiments which frequently also contain miRNA expression information, and detailed clinical data to correlate outcome with gene/miRNA expression levels. This algorithm integrates gene expression and survival data from 26 datasets on 12 different microarray platforms corresponding to approximately 17,000 genes in up to 4,738 samples. In addition, the prognostic potential of 341 miRNAs can be analysed. RESULTS: We demonstrated the robustness of our approach in comparison to two commercially available prognostic tests, oncotype DX and MammaPrint. Our algorithm complements these prognostic tests and is consistent with their findings. In addition, BreastMark can act as a powerful reductionist approach to these more complex gene signatures, eliminating superfluous genes, potentially reducing the cost and complexity of these multi-index assays. Known miRNA prognostic markers, mir-205 and mir-93, were used to confirm the prognostic value of this tool in a miRNA setting. We also applied the algorithm to examine expression of 58 receptor tyrosine kinases in the basal-like subtype, identifying six receptor tyrosine kinases associated with poor disease-free survival and/or overall survival (EPHA5, FGFR1, FGFR3, VEGFR1, PDGFRβ, and TIE1). A web application for using this algorithm is currently available. CONCLUSIONS: BreastMark is a powerful tool for examining putative gene/miRNA prognostic markers in breast cancer. The value of this tool will be in the preliminary assessment of putative biomarkers in breast cancer. It will be of particular use to research groups with limited bioinformatics facilities.
format Online
Article
Text
id pubmed-3978487
institution National Center for Biotechnology Information
language English
publishDate 2013
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-39784872014-04-08 BreastMark: An Integrated Approach to Mining Publicly Available Transcriptomic Datasets Relating to Breast Cancer Outcome Madden, Stephen F Clarke, Colin Gaule, Patricia Aherne, Sinead T O'Donovan, Norma Clynes, Martin Crown, John Gallagher, William M Breast Cancer Res Research Article INTRODUCTION: Breast cancer is a complex heterogeneous disease for which a substantial resource of transcriptomic data is available. Gene expression data have facilitated the division of breast cancer into, at least, five molecular subtypes, namely luminal A, luminal B, HER2, normal-like and basal. Once identified, breast cancer subtypes can inform clinical decisions surrounding patient treatment and prognosis. Indeed, it is important to identify patients at risk of developing aggressive disease so as to tailor the level of clinical intervention. METHODS: We have developed a user-friendly, web-based system to allow the evaluation of genes/microRNAs (miRNAs) that are significantly associated with survival in breast cancer and its molecular subtypes. The algorithm combines gene expression data from multiple microarray experiments which frequently also contain miRNA expression information, and detailed clinical data to correlate outcome with gene/miRNA expression levels. This algorithm integrates gene expression and survival data from 26 datasets on 12 different microarray platforms corresponding to approximately 17,000 genes in up to 4,738 samples. In addition, the prognostic potential of 341 miRNAs can be analysed. RESULTS: We demonstrated the robustness of our approach in comparison to two commercially available prognostic tests, oncotype DX and MammaPrint. Our algorithm complements these prognostic tests and is consistent with their findings. In addition, BreastMark can act as a powerful reductionist approach to these more complex gene signatures, eliminating superfluous genes, potentially reducing the cost and complexity of these multi-index assays. Known miRNA prognostic markers, mir-205 and mir-93, were used to confirm the prognostic value of this tool in a miRNA setting. We also applied the algorithm to examine expression of 58 receptor tyrosine kinases in the basal-like subtype, identifying six receptor tyrosine kinases associated with poor disease-free survival and/or overall survival (EPHA5, FGFR1, FGFR3, VEGFR1, PDGFRβ, and TIE1). A web application for using this algorithm is currently available. CONCLUSIONS: BreastMark is a powerful tool for examining putative gene/miRNA prognostic markers in breast cancer. The value of this tool will be in the preliminary assessment of putative biomarkers in breast cancer. It will be of particular use to research groups with limited bioinformatics facilities. BioMed Central 2013 2013-07-02 /pmc/articles/PMC3978487/ /pubmed/23820017 http://dx.doi.org/10.1186/bcr3444 Text en Copyright © 2013 Madden et al.; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research Article
Madden, Stephen F
Clarke, Colin
Gaule, Patricia
Aherne, Sinead T
O'Donovan, Norma
Clynes, Martin
Crown, John
Gallagher, William M
BreastMark: An Integrated Approach to Mining Publicly Available Transcriptomic Datasets Relating to Breast Cancer Outcome
title BreastMark: An Integrated Approach to Mining Publicly Available Transcriptomic Datasets Relating to Breast Cancer Outcome
title_full BreastMark: An Integrated Approach to Mining Publicly Available Transcriptomic Datasets Relating to Breast Cancer Outcome
title_fullStr BreastMark: An Integrated Approach to Mining Publicly Available Transcriptomic Datasets Relating to Breast Cancer Outcome
title_full_unstemmed BreastMark: An Integrated Approach to Mining Publicly Available Transcriptomic Datasets Relating to Breast Cancer Outcome
title_short BreastMark: An Integrated Approach to Mining Publicly Available Transcriptomic Datasets Relating to Breast Cancer Outcome
title_sort breastmark: an integrated approach to mining publicly available transcriptomic datasets relating to breast cancer outcome
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3978487/
https://www.ncbi.nlm.nih.gov/pubmed/23820017
http://dx.doi.org/10.1186/bcr3444
work_keys_str_mv AT maddenstephenf breastmarkanintegratedapproachtominingpubliclyavailabletranscriptomicdatasetsrelatingtobreastcanceroutcome
AT clarkecolin breastmarkanintegratedapproachtominingpubliclyavailabletranscriptomicdatasetsrelatingtobreastcanceroutcome
AT gaulepatricia breastmarkanintegratedapproachtominingpubliclyavailabletranscriptomicdatasetsrelatingtobreastcanceroutcome
AT ahernesineadt breastmarkanintegratedapproachtominingpubliclyavailabletranscriptomicdatasetsrelatingtobreastcanceroutcome
AT odonovannorma breastmarkanintegratedapproachtominingpubliclyavailabletranscriptomicdatasetsrelatingtobreastcanceroutcome
AT clynesmartin breastmarkanintegratedapproachtominingpubliclyavailabletranscriptomicdatasetsrelatingtobreastcanceroutcome
AT crownjohn breastmarkanintegratedapproachtominingpubliclyavailabletranscriptomicdatasetsrelatingtobreastcanceroutcome
AT gallagherwilliamm breastmarkanintegratedapproachtominingpubliclyavailabletranscriptomicdatasetsrelatingtobreastcanceroutcome