Cargando…

Reducing confounding and suppression effects in TCGA data: an integrated analysis of chemotherapy response in ovarian cancer

BACKGROUND: Despite initial response in adjuvant chemotherapy, ovarian cancer patients treated with the combination of paclitaxel and carboplatin frequently suffer from recurrence after few cycles of treatment, and the underlying mechanisms causing the chemoresistance remain unclear. Recently, The C...

Descripción completa

Detalles Bibliográficos
Autores principales: Hsu, Fang-Han, Serpedin, Erchin, Hsiao, Tzu-Hung, Bishop, Alexander JR, Dougherty, Edward R, Chen, Yidong
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2012
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3481440/
https://www.ncbi.nlm.nih.gov/pubmed/23134756
http://dx.doi.org/10.1186/1471-2164-13-S6-S13
_version_ 1782247738933510144
author Hsu, Fang-Han
Serpedin, Erchin
Hsiao, Tzu-Hung
Bishop, Alexander JR
Dougherty, Edward R
Chen, Yidong
author_facet Hsu, Fang-Han
Serpedin, Erchin
Hsiao, Tzu-Hung
Bishop, Alexander JR
Dougherty, Edward R
Chen, Yidong
author_sort Hsu, Fang-Han
collection PubMed
description BACKGROUND: Despite initial response in adjuvant chemotherapy, ovarian cancer patients treated with the combination of paclitaxel and carboplatin frequently suffer from recurrence after few cycles of treatment, and the underlying mechanisms causing the chemoresistance remain unclear. Recently, The Cancer Genome Atlas (TCGA) research network concluded an ovarian cancer study and released the dataset to the public. The TCGA dataset possesses large sample size, comprehensive molecular profiles, and clinical outcome information; however, because of the unknown molecular subtypes in ovarian cancer and the great diversity of adjuvant treatments TCGA patients went through, studying chemotherapeutic response using the TCGA data is difficult. Additionally, factors such as sample batches, patient ages, and tumor stages further confound or suppress the identification of relevant genes, and thus the biological functions and disease mechanisms. RESULTS: To address these issues, herein we propose an analysis procedure designed to reduce suppression effect by focusing on a specific chemotherapeutic treatment, and to remove confounding effects such as batch effect, patient's age, and tumor stages. The proposed procedure starts with a batch effect adjustment, followed by a rigorous sample selection process. Then, the gene expression, copy number, and methylation profiles from the TCGA ovarian cancer dataset are analyzed using a semi-supervised clustering method combined with a novel scoring function. As a result, two molecular classifications, one with poor copy number profiles and one with poor methylation profiles, enriched with unfavorable scores are identified. Compared with the samples enriched with favorable scores, these two classifications exhibit poor progression-free survival (PFS) and might be associated with poor chemotherapy response specifically to the combination of paclitaxel and carboplatin. Significant genes and biological processes are detected subsequently using classical statistical approaches and enrichment analysis. CONCLUSIONS: The proposed procedure for the reduction of confounding and suppression effects and the semi-supervised clustering method are essential steps to identify genes associated with the chemotherapeutic response.
format Online
Article
Text
id pubmed-3481440
institution National Center for Biotechnology Information
language English
publishDate 2012
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-34814402012-11-02 Reducing confounding and suppression effects in TCGA data: an integrated analysis of chemotherapy response in ovarian cancer Hsu, Fang-Han Serpedin, Erchin Hsiao, Tzu-Hung Bishop, Alexander JR Dougherty, Edward R Chen, Yidong BMC Genomics Research BACKGROUND: Despite initial response in adjuvant chemotherapy, ovarian cancer patients treated with the combination of paclitaxel and carboplatin frequently suffer from recurrence after few cycles of treatment, and the underlying mechanisms causing the chemoresistance remain unclear. Recently, The Cancer Genome Atlas (TCGA) research network concluded an ovarian cancer study and released the dataset to the public. The TCGA dataset possesses large sample size, comprehensive molecular profiles, and clinical outcome information; however, because of the unknown molecular subtypes in ovarian cancer and the great diversity of adjuvant treatments TCGA patients went through, studying chemotherapeutic response using the TCGA data is difficult. Additionally, factors such as sample batches, patient ages, and tumor stages further confound or suppress the identification of relevant genes, and thus the biological functions and disease mechanisms. RESULTS: To address these issues, herein we propose an analysis procedure designed to reduce suppression effect by focusing on a specific chemotherapeutic treatment, and to remove confounding effects such as batch effect, patient's age, and tumor stages. The proposed procedure starts with a batch effect adjustment, followed by a rigorous sample selection process. Then, the gene expression, copy number, and methylation profiles from the TCGA ovarian cancer dataset are analyzed using a semi-supervised clustering method combined with a novel scoring function. As a result, two molecular classifications, one with poor copy number profiles and one with poor methylation profiles, enriched with unfavorable scores are identified. Compared with the samples enriched with favorable scores, these two classifications exhibit poor progression-free survival (PFS) and might be associated with poor chemotherapy response specifically to the combination of paclitaxel and carboplatin. Significant genes and biological processes are detected subsequently using classical statistical approaches and enrichment analysis. CONCLUSIONS: The proposed procedure for the reduction of confounding and suppression effects and the semi-supervised clustering method are essential steps to identify genes associated with the chemotherapeutic response. BioMed Central 2012-10-26 /pmc/articles/PMC3481440/ /pubmed/23134756 http://dx.doi.org/10.1186/1471-2164-13-S6-S13 Text en Copyright ©2012 Hsu et al.; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research
Hsu, Fang-Han
Serpedin, Erchin
Hsiao, Tzu-Hung
Bishop, Alexander JR
Dougherty, Edward R
Chen, Yidong
Reducing confounding and suppression effects in TCGA data: an integrated analysis of chemotherapy response in ovarian cancer
title Reducing confounding and suppression effects in TCGA data: an integrated analysis of chemotherapy response in ovarian cancer
title_full Reducing confounding and suppression effects in TCGA data: an integrated analysis of chemotherapy response in ovarian cancer
title_fullStr Reducing confounding and suppression effects in TCGA data: an integrated analysis of chemotherapy response in ovarian cancer
title_full_unstemmed Reducing confounding and suppression effects in TCGA data: an integrated analysis of chemotherapy response in ovarian cancer
title_short Reducing confounding and suppression effects in TCGA data: an integrated analysis of chemotherapy response in ovarian cancer
title_sort reducing confounding and suppression effects in tcga data: an integrated analysis of chemotherapy response in ovarian cancer
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3481440/
https://www.ncbi.nlm.nih.gov/pubmed/23134756
http://dx.doi.org/10.1186/1471-2164-13-S6-S13
work_keys_str_mv AT hsufanghan reducingconfoundingandsuppressioneffectsintcgadataanintegratedanalysisofchemotherapyresponseinovariancancer
AT serpedinerchin reducingconfoundingandsuppressioneffectsintcgadataanintegratedanalysisofchemotherapyresponseinovariancancer
AT hsiaotzuhung reducingconfoundingandsuppressioneffectsintcgadataanintegratedanalysisofchemotherapyresponseinovariancancer
AT bishopalexanderjr reducingconfoundingandsuppressioneffectsintcgadataanintegratedanalysisofchemotherapyresponseinovariancancer
AT doughertyedwardr reducingconfoundingandsuppressioneffectsintcgadataanintegratedanalysisofchemotherapyresponseinovariancancer
AT chenyidong reducingconfoundingandsuppressioneffectsintcgadataanintegratedanalysisofchemotherapyresponseinovariancancer