Cargando…

Molecular subtyping for clinically defined breast cancer subgroups

INTRODUCTION: Breast cancer is commonly classified into intrinsic molecular subtypes. Standard gene centering is routinely done prior to molecular subtyping, but it can produce inaccurate classifications when the distribution of clinicopathological characteristics in the study cohort differs from th...

Descripción completa

Detalles Bibliográficos
Autores principales: Zhao, Xi, Rødland, Einar Andreas, Tibshirani, Robert, Plevritis, Sylvia
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2015
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4365540/
https://www.ncbi.nlm.nih.gov/pubmed/25849221
http://dx.doi.org/10.1186/s13058-015-0520-4
_version_ 1782362234877378560
author Zhao, Xi
Rødland, Einar Andreas
Tibshirani, Robert
Plevritis, Sylvia
author_facet Zhao, Xi
Rødland, Einar Andreas
Tibshirani, Robert
Plevritis, Sylvia
author_sort Zhao, Xi
collection PubMed
description INTRODUCTION: Breast cancer is commonly classified into intrinsic molecular subtypes. Standard gene centering is routinely done prior to molecular subtyping, but it can produce inaccurate classifications when the distribution of clinicopathological characteristics in the study cohort differs from that of the training cohort used to derive the classifier. METHODS: We propose a subgroup-specific gene-centering method to perform molecular subtyping on a study cohort that has a skewed distribution of clinicopathological characteristics relative to the training cohort. On such a study cohort, we center each gene on a specified percentile, where the percentile is determined from a subgroup of the training cohort with clinicopathological characteristics similar to the study cohort. We demonstrate our method using the PAM50 classifier and its associated University of North Carolina (UNC) training cohort. We considered study cohorts with skewed clinicopathological characteristics, including subgroups composed of a single prototypic subtype of the UNC-PAM50 training cohort (n = 139), an external estrogen receptor (ER)-positive cohort (n = 48) and an external triple-negative cohort (n = 77). RESULTS: Subgroup-specific gene centering improved prediction performance with the accuracies between 77% and 100%, compared to accuracies between 17% and 33% from standard gene centering, when applied to the prototypic tumor subsets of the PAM50 training cohort. It reduced classification error rates on the ER-positive (11% versus 28%; P = 0.0389), the ER-negative (5% versus 41%; P < 0.0001) and the triple-negative (11% versus 56%; P = 0.1336) subgroups of the PAM50 training cohort. In addition, it produced higher accuracy for subtyping study cohorts composed of varying proportions of ER-positive versus ER-negative cases. Finally, it increased the percentage of assigned luminal subtypes on the external ER-positive cohort and basal-like subtype on the external triple-negative cohort. CONCLUSIONS: Gene centering is often necessary to accurately apply a molecular subtype classifier. Compared with standard gene centering, our proposed subgroup-specific gene centering produced more accurate molecular subtype assignments in a study cohort with skewed clinicopathological characteristics relative to the training cohort. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s13058-015-0520-4) contains supplementary material, which is available to authorized users.
format Online
Article
Text
id pubmed-4365540
institution National Center for Biotechnology Information
language English
publishDate 2015
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-43655402015-03-20 Molecular subtyping for clinically defined breast cancer subgroups Zhao, Xi Rødland, Einar Andreas Tibshirani, Robert Plevritis, Sylvia Breast Cancer Res Research Article INTRODUCTION: Breast cancer is commonly classified into intrinsic molecular subtypes. Standard gene centering is routinely done prior to molecular subtyping, but it can produce inaccurate classifications when the distribution of clinicopathological characteristics in the study cohort differs from that of the training cohort used to derive the classifier. METHODS: We propose a subgroup-specific gene-centering method to perform molecular subtyping on a study cohort that has a skewed distribution of clinicopathological characteristics relative to the training cohort. On such a study cohort, we center each gene on a specified percentile, where the percentile is determined from a subgroup of the training cohort with clinicopathological characteristics similar to the study cohort. We demonstrate our method using the PAM50 classifier and its associated University of North Carolina (UNC) training cohort. We considered study cohorts with skewed clinicopathological characteristics, including subgroups composed of a single prototypic subtype of the UNC-PAM50 training cohort (n = 139), an external estrogen receptor (ER)-positive cohort (n = 48) and an external triple-negative cohort (n = 77). RESULTS: Subgroup-specific gene centering improved prediction performance with the accuracies between 77% and 100%, compared to accuracies between 17% and 33% from standard gene centering, when applied to the prototypic tumor subsets of the PAM50 training cohort. It reduced classification error rates on the ER-positive (11% versus 28%; P = 0.0389), the ER-negative (5% versus 41%; P < 0.0001) and the triple-negative (11% versus 56%; P = 0.1336) subgroups of the PAM50 training cohort. In addition, it produced higher accuracy for subtyping study cohorts composed of varying proportions of ER-positive versus ER-negative cases. Finally, it increased the percentage of assigned luminal subtypes on the external ER-positive cohort and basal-like subtype on the external triple-negative cohort. CONCLUSIONS: Gene centering is often necessary to accurately apply a molecular subtype classifier. Compared with standard gene centering, our proposed subgroup-specific gene centering produced more accurate molecular subtype assignments in a study cohort with skewed clinicopathological characteristics relative to the training cohort. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s13058-015-0520-4) contains supplementary material, which is available to authorized users. BioMed Central 2015-02-26 2015 /pmc/articles/PMC4365540/ /pubmed/25849221 http://dx.doi.org/10.1186/s13058-015-0520-4 Text en © Zhao et al.; licensee BioMed Central. 2015 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly credited. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Research Article
Zhao, Xi
Rødland, Einar Andreas
Tibshirani, Robert
Plevritis, Sylvia
Molecular subtyping for clinically defined breast cancer subgroups
title Molecular subtyping for clinically defined breast cancer subgroups
title_full Molecular subtyping for clinically defined breast cancer subgroups
title_fullStr Molecular subtyping for clinically defined breast cancer subgroups
title_full_unstemmed Molecular subtyping for clinically defined breast cancer subgroups
title_short Molecular subtyping for clinically defined breast cancer subgroups
title_sort molecular subtyping for clinically defined breast cancer subgroups
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4365540/
https://www.ncbi.nlm.nih.gov/pubmed/25849221
http://dx.doi.org/10.1186/s13058-015-0520-4
work_keys_str_mv AT zhaoxi molecularsubtypingforclinicallydefinedbreastcancersubgroups
AT rødlandeinarandreas molecularsubtypingforclinicallydefinedbreastcancersubgroups
AT tibshiranirobert molecularsubtypingforclinicallydefinedbreastcancersubgroups
AT plevritissylvia molecularsubtypingforclinicallydefinedbreastcancersubgroups