Cargando…

Iteratively refining breast cancer intrinsic subtypes in the METABRIC dataset

BACKGROUND: Multi-gene lists and single sample predictor models have been currently used to reduce the multidimensional complexity of breast cancers, and to identify intrinsic subtypes. The perceived inability of some models to deal with the challenges of processing high-dimensional data, however, l...

Descripción completa

Detalles Bibliográficos
Autores principales: Milioli, Heloisa H., Vimieiro, Renato, Tishchenko, Inna, Riveros, Carlos, Berretta, Regina, Moscato, Pablo
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2016
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4712506/
https://www.ncbi.nlm.nih.gov/pubmed/26770261
http://dx.doi.org/10.1186/s13040-015-0078-9
_version_ 1782410076963733504
author Milioli, Heloisa H.
Vimieiro, Renato
Tishchenko, Inna
Riveros, Carlos
Berretta, Regina
Moscato, Pablo
author_facet Milioli, Heloisa H.
Vimieiro, Renato
Tishchenko, Inna
Riveros, Carlos
Berretta, Regina
Moscato, Pablo
author_sort Milioli, Heloisa H.
collection PubMed
description BACKGROUND: Multi-gene lists and single sample predictor models have been currently used to reduce the multidimensional complexity of breast cancers, and to identify intrinsic subtypes. The perceived inability of some models to deal with the challenges of processing high-dimensional data, however, limits the accurate characterisation of these subtypes. Towards the development of robust strategies, we designed an iterative approach to consistently discriminate intrinsic subtypes and improve class prediction in the METABRIC dataset. FINDINGS: In this study, we employed the CM1 score to identify the most discriminative probes for each group, and an ensemble learning technique to assess the ability of these probes on assigning subtype labels using 24 different classifiers. Our analysis is comprised of an iterative computation of these methods and statistical measures performed on a set of over 2000 samples. The refined labels assigned using this iterative approach revealed to be more consistent and in better agreement with clinicopathological markers and patients’ overall survival than those originally provided by the PAM50 method. CONCLUSIONS: The assignment of intrinsic subtypes has a significant impact in translational research for both understanding and managing breast cancer. The refined labelling, therefore, provides more accurate and reliable information by improving the source of fundamental science prior to clinical applications in medicine. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s13040-015-0078-9) contains supplementary material, which is available to authorized users.
format Online
Article
Text
id pubmed-4712506
institution National Center for Biotechnology Information
language English
publishDate 2016
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-47125062016-01-15 Iteratively refining breast cancer intrinsic subtypes in the METABRIC dataset Milioli, Heloisa H. Vimieiro, Renato Tishchenko, Inna Riveros, Carlos Berretta, Regina Moscato, Pablo BioData Min Short Report BACKGROUND: Multi-gene lists and single sample predictor models have been currently used to reduce the multidimensional complexity of breast cancers, and to identify intrinsic subtypes. The perceived inability of some models to deal with the challenges of processing high-dimensional data, however, limits the accurate characterisation of these subtypes. Towards the development of robust strategies, we designed an iterative approach to consistently discriminate intrinsic subtypes and improve class prediction in the METABRIC dataset. FINDINGS: In this study, we employed the CM1 score to identify the most discriminative probes for each group, and an ensemble learning technique to assess the ability of these probes on assigning subtype labels using 24 different classifiers. Our analysis is comprised of an iterative computation of these methods and statistical measures performed on a set of over 2000 samples. The refined labels assigned using this iterative approach revealed to be more consistent and in better agreement with clinicopathological markers and patients’ overall survival than those originally provided by the PAM50 method. CONCLUSIONS: The assignment of intrinsic subtypes has a significant impact in translational research for both understanding and managing breast cancer. The refined labelling, therefore, provides more accurate and reliable information by improving the source of fundamental science prior to clinical applications in medicine. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s13040-015-0078-9) contains supplementary material, which is available to authorized users. BioMed Central 2016-01-13 /pmc/articles/PMC4712506/ /pubmed/26770261 http://dx.doi.org/10.1186/s13040-015-0078-9 Text en © Milioli et al. 2015 Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Short Report
Milioli, Heloisa H.
Vimieiro, Renato
Tishchenko, Inna
Riveros, Carlos
Berretta, Regina
Moscato, Pablo
Iteratively refining breast cancer intrinsic subtypes in the METABRIC dataset
title Iteratively refining breast cancer intrinsic subtypes in the METABRIC dataset
title_full Iteratively refining breast cancer intrinsic subtypes in the METABRIC dataset
title_fullStr Iteratively refining breast cancer intrinsic subtypes in the METABRIC dataset
title_full_unstemmed Iteratively refining breast cancer intrinsic subtypes in the METABRIC dataset
title_short Iteratively refining breast cancer intrinsic subtypes in the METABRIC dataset
title_sort iteratively refining breast cancer intrinsic subtypes in the metabric dataset
topic Short Report
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4712506/
https://www.ncbi.nlm.nih.gov/pubmed/26770261
http://dx.doi.org/10.1186/s13040-015-0078-9
work_keys_str_mv AT milioliheloisah iterativelyrefiningbreastcancerintrinsicsubtypesinthemetabricdataset
AT vimieirorenato iterativelyrefiningbreastcancerintrinsicsubtypesinthemetabricdataset
AT tishchenkoinna iterativelyrefiningbreastcancerintrinsicsubtypesinthemetabricdataset
AT riveroscarlos iterativelyrefiningbreastcancerintrinsicsubtypesinthemetabricdataset
AT berrettaregina iterativelyrefiningbreastcancerintrinsicsubtypesinthemetabricdataset
AT moscatopablo iterativelyrefiningbreastcancerintrinsicsubtypesinthemetabricdataset