Cargando…

Breast cancer subtype predictors revisited: from consensus to concordance?

BACKGROUND: At the molecular level breast cancer comprises a heterogeneous set of subtypes associated with clear differences in gene expression and clinical outcomes. Single sample predictors (SSPs) are built via a two-stage approach consisting of clustering and subtype predictor construction based...

Descripción completa

Detalles Bibliográficos
Autores principales:	MJ. Sontrop, Herman, JT. Reinders, Marcel, D. Moerland, Perry
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	BioMed Central 2016
Materias:	Research Article
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4893290/ https://www.ncbi.nlm.nih.gov/pubmed/27259591 http://dx.doi.org/10.1186/s12920-016-0185-6

_version_	1782435529021718528
author	MJ. Sontrop, Herman JT. Reinders, Marcel D. Moerland, Perry
author_facet	MJ. Sontrop, Herman JT. Reinders, Marcel D. Moerland, Perry
author_sort	MJ. Sontrop, Herman
collection	PubMed
description	BACKGROUND: At the molecular level breast cancer comprises a heterogeneous set of subtypes associated with clear differences in gene expression and clinical outcomes. Single sample predictors (SSPs) are built via a two-stage approach consisting of clustering and subtype predictor construction based on the cluster labels of individual cases. SSPs have been criticized because their subtype assignments for the same samples were only moderately concordant (Cohen’s κ<0.6). METHODS: We propose a semi-supervised approach where for five datasets, consensus sets were constructed consisting of those samples that were concordantly subtyped by a number of different predictors. Next, nine subtype predictors - three SSPs, three subtype classification models (SCMs) and three novel rule-based predictors based on the St. Gallen surrogate intrinsic subtype definitions (STGs) - were constructed on the five consensus sets and their associated consensus subtype labels. The predictors were validated on a compendium of over 4,000 uniformly preprocessed Affymetrix microarrays. Concordance between subtype predictors was assessed using Cohen’s kappa statistic. RESULTS: In this standardized setup, subtype predictors of the same type (either SCM, SSP, or STG) but with a different gene list and/or consensus training set were associated with almost perfect levels of agreement (median κ>0.8). Interestingly, for a given predictor type a change in consensus set led to higher concordance than a change to another gene list. The more challenging scenario where the predictor type, gene list and training set were all different resulted in predictors with only substantial levels of concordance (median κ=0.74) on independent validation data. CONCLUSIONS: Our results demonstrate that for a given subtype predictor type stringent standardization of the preprocessing stage, combined with carefully devised consensus training sets, leads to predictors that show almost perfect levels of concordance. However, predictors of a different type are only substantially concordant, despite reaching almost perfect levels of concordance on training data. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12920-016-0185-6) contains supplementary material, which is available to authorized users.
format	Online Article Text
id	pubmed-4893290
institution	National Center for Biotechnology Information
language	English
publishDate	2016
publisher	BioMed Central
record_format	MEDLINE/PubMed
spelling	pubmed-48932902016-06-05 Breast cancer subtype predictors revisited: from consensus to concordance? MJ. Sontrop, Herman JT. Reinders, Marcel D. Moerland, Perry BMC Med Genomics Research Article BACKGROUND: At the molecular level breast cancer comprises a heterogeneous set of subtypes associated with clear differences in gene expression and clinical outcomes. Single sample predictors (SSPs) are built via a two-stage approach consisting of clustering and subtype predictor construction based on the cluster labels of individual cases. SSPs have been criticized because their subtype assignments for the same samples were only moderately concordant (Cohen’s κ<0.6). METHODS: We propose a semi-supervised approach where for five datasets, consensus sets were constructed consisting of those samples that were concordantly subtyped by a number of different predictors. Next, nine subtype predictors - three SSPs, three subtype classification models (SCMs) and three novel rule-based predictors based on the St. Gallen surrogate intrinsic subtype definitions (STGs) - were constructed on the five consensus sets and their associated consensus subtype labels. The predictors were validated on a compendium of over 4,000 uniformly preprocessed Affymetrix microarrays. Concordance between subtype predictors was assessed using Cohen’s kappa statistic. RESULTS: In this standardized setup, subtype predictors of the same type (either SCM, SSP, or STG) but with a different gene list and/or consensus training set were associated with almost perfect levels of agreement (median κ>0.8). Interestingly, for a given predictor type a change in consensus set led to higher concordance than a change to another gene list. The more challenging scenario where the predictor type, gene list and training set were all different resulted in predictors with only substantial levels of concordance (median κ=0.74) on independent validation data. CONCLUSIONS: Our results demonstrate that for a given subtype predictor type stringent standardization of the preprocessing stage, combined with carefully devised consensus training sets, leads to predictors that show almost perfect levels of concordance. However, predictors of a different type are only substantially concordant, despite reaching almost perfect levels of concordance on training data. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12920-016-0185-6) contains supplementary material, which is available to authorized users. BioMed Central 2016-06-03 /pmc/articles/PMC4893290/ /pubmed/27259591 http://dx.doi.org/10.1186/s12920-016-0185-6 Text en © Sontrop et al. 2016 Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver(http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle	Research Article MJ. Sontrop, Herman JT. Reinders, Marcel D. Moerland, Perry Breast cancer subtype predictors revisited: from consensus to concordance?
title	Breast cancer subtype predictors revisited: from consensus to concordance?
title_full	Breast cancer subtype predictors revisited: from consensus to concordance?
title_fullStr	Breast cancer subtype predictors revisited: from consensus to concordance?
title_full_unstemmed	Breast cancer subtype predictors revisited: from consensus to concordance?
title_short	Breast cancer subtype predictors revisited: from consensus to concordance?
title_sort	breast cancer subtype predictors revisited: from consensus to concordance?
topic	Research Article
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4893290/ https://www.ncbi.nlm.nih.gov/pubmed/27259591 http://dx.doi.org/10.1186/s12920-016-0185-6
work_keys_str_mv	AT mjsontropherman breastcancersubtypepredictorsrevisitedfromconsensustoconcordance AT jtreindersmarcel breastcancersubtypepredictorsrevisitedfromconsensustoconcordance AT dmoerlandperry breastcancersubtypepredictorsrevisitedfromconsensustoconcordance

Breast cancer subtype predictors revisited: from consensus to concordance?

Ejemplares similares