Cargando…

Data Perturbation Independent Diagnosis and Validation of Breast Cancer Subtypes Using Clustering and Patterns

Molecular stratification of disease based on expression levels of sets of genes can help guide therapeutic decisions if such classifications can be shown to be stable against variations in sample source and data perturbation. Classifications inferred from one set of samples in one lab should be able...

Descripción completa

Detalles Bibliográficos
Autores principales: Alexe, G., Dalgin, G.S., Ramaswamy, R., DeLisi, C., Bhanot, G.
Formato: Texto
Lenguaje:English
Publicado: Libertas Academica 2007
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2675483/
https://www.ncbi.nlm.nih.gov/pubmed/19458770
_version_ 1782166698185457664
author Alexe, G.
Dalgin, G.S.
Ramaswamy, R.
DeLisi, C.
Bhanot, G.
author_facet Alexe, G.
Dalgin, G.S.
Ramaswamy, R.
DeLisi, C.
Bhanot, G.
author_sort Alexe, G.
collection PubMed
description Molecular stratification of disease based on expression levels of sets of genes can help guide therapeutic decisions if such classifications can be shown to be stable against variations in sample source and data perturbation. Classifications inferred from one set of samples in one lab should be able to consistently stratify a different set of samples in another lab. We present a method for assessing such stability and apply it to the breast cancer (BCA) datasets of Sorlie et al. 2003 and Ma et al. 2003. We find that within the now commonly accepted BCA categories identified by Sorlie et al. Luminal A and Basal are robust, but Luminal B and ERBB2+ are not. In particular, 36% of the samples identified as Luminal B and 55% identified as ERBB2+ cannot be assigned an accurate category because the classification is sensitive to data perturbation. We identify a “core cluster” of samples for each category, and from these we determine “patterns” of gene expression that distinguish the core clusters from each other. We find that the best markers for Luminal A and Basal are (ESR1, LIV1, GATA-3) and (CCNE1, LAD1, KRT5), respectively. Pathways enriched in the patterns regulate apoptosis, tissue remodeling and the immune response. We use a different dataset (Ma et al. 2003) to test the accuracy with which samples can be allocated to the four disease subtypes. We find, as expected, that the classification of samples identified as Luminal A and Basal is robust but classification into the other two subtypes is not.
format Text
id pubmed-2675483
institution National Center for Biotechnology Information
language English
publishDate 2007
publisher Libertas Academica
record_format MEDLINE/PubMed
spelling pubmed-26754832009-05-20 Data Perturbation Independent Diagnosis and Validation of Breast Cancer Subtypes Using Clustering and Patterns Alexe, G. Dalgin, G.S. Ramaswamy, R. DeLisi, C. Bhanot, G. Cancer Inform Original Research Molecular stratification of disease based on expression levels of sets of genes can help guide therapeutic decisions if such classifications can be shown to be stable against variations in sample source and data perturbation. Classifications inferred from one set of samples in one lab should be able to consistently stratify a different set of samples in another lab. We present a method for assessing such stability and apply it to the breast cancer (BCA) datasets of Sorlie et al. 2003 and Ma et al. 2003. We find that within the now commonly accepted BCA categories identified by Sorlie et al. Luminal A and Basal are robust, but Luminal B and ERBB2+ are not. In particular, 36% of the samples identified as Luminal B and 55% identified as ERBB2+ cannot be assigned an accurate category because the classification is sensitive to data perturbation. We identify a “core cluster” of samples for each category, and from these we determine “patterns” of gene expression that distinguish the core clusters from each other. We find that the best markers for Luminal A and Basal are (ESR1, LIV1, GATA-3) and (CCNE1, LAD1, KRT5), respectively. Pathways enriched in the patterns regulate apoptosis, tissue remodeling and the immune response. We use a different dataset (Ma et al. 2003) to test the accuracy with which samples can be allocated to the four disease subtypes. We find, as expected, that the classification of samples identified as Luminal A and Basal is robust but classification into the other two subtypes is not. Libertas Academica 2007-02-19 /pmc/articles/PMC2675483/ /pubmed/19458770 Text en © 2006 The authors.
spellingShingle Original Research
Alexe, G.
Dalgin, G.S.
Ramaswamy, R.
DeLisi, C.
Bhanot, G.
Data Perturbation Independent Diagnosis and Validation of Breast Cancer Subtypes Using Clustering and Patterns
title Data Perturbation Independent Diagnosis and Validation of Breast Cancer Subtypes Using Clustering and Patterns
title_full Data Perturbation Independent Diagnosis and Validation of Breast Cancer Subtypes Using Clustering and Patterns
title_fullStr Data Perturbation Independent Diagnosis and Validation of Breast Cancer Subtypes Using Clustering and Patterns
title_full_unstemmed Data Perturbation Independent Diagnosis and Validation of Breast Cancer Subtypes Using Clustering and Patterns
title_short Data Perturbation Independent Diagnosis and Validation of Breast Cancer Subtypes Using Clustering and Patterns
title_sort data perturbation independent diagnosis and validation of breast cancer subtypes using clustering and patterns
topic Original Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2675483/
https://www.ncbi.nlm.nih.gov/pubmed/19458770
work_keys_str_mv AT alexeg dataperturbationindependentdiagnosisandvalidationofbreastcancersubtypesusingclusteringandpatterns
AT dalgings dataperturbationindependentdiagnosisandvalidationofbreastcancersubtypesusingclusteringandpatterns
AT ramaswamyr dataperturbationindependentdiagnosisandvalidationofbreastcancersubtypesusingclusteringandpatterns
AT delisic dataperturbationindependentdiagnosisandvalidationofbreastcancersubtypesusingclusteringandpatterns
AT bhanotg dataperturbationindependentdiagnosisandvalidationofbreastcancersubtypesusingclusteringandpatterns