Cargando…
CrossLink: a novel method for cross-condition classification of cancer subtypes
BACKGROUND: We considered the prediction of cancer classes (e.g. subtypes) using patient gene expression profiles that contain both systematic and condition-specific biases when compared with the training reference dataset. The conventional normalization-based approaches cannot guarantee that the ge...
Autores principales: | , , , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2016
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5001207/ https://www.ncbi.nlm.nih.gov/pubmed/27556419 http://dx.doi.org/10.1186/s12864-016-2903-z |
_version_ | 1782450430349934592 |
---|---|
author | Ma, Chifeng Sastry, Konduru S. Flore, Mario Gehani, Salah Al-Bozom, Issam Feng, Yusheng Serpedin, Erchin Chouchane, Lotfi Chen, Yidong Huang, Yufei |
author_facet | Ma, Chifeng Sastry, Konduru S. Flore, Mario Gehani, Salah Al-Bozom, Issam Feng, Yusheng Serpedin, Erchin Chouchane, Lotfi Chen, Yidong Huang, Yufei |
author_sort | Ma, Chifeng |
collection | PubMed |
description | BACKGROUND: We considered the prediction of cancer classes (e.g. subtypes) using patient gene expression profiles that contain both systematic and condition-specific biases when compared with the training reference dataset. The conventional normalization-based approaches cannot guarantee that the gene signatures in the reference and prediction datasets always have the same distribution for all different conditions as the class-specific gene signatures change with the condition. Therefore, the trained classifier would work well under one condition but not under another. METHODS: To address the problem of current normalization approaches, we propose a novel algorithm called CrossLink (CL). CL recognizes that there is no universal, condition-independent normalization mapping of signatures. In contrast, it exploits the fact that the signature is unique to its associated class under any condition and thus employs an unsupervised clustering algorithm to discover this unique signature. RESULTS: We assessed the performance of CL for cross-condition predictions of PAM50 subtypes of breast cancer by using a simulated dataset modeled after TCGA BRCA tumor samples with a cross-validation scheme, and datasets with known and unknown PAM50 classification. CL achieved prediction accuracy >73 %, highest among other methods we evaluated. We also applied the algorithm to a set of breast cancer tumors derived from Arabic population to assign a PAM50 classification to each tumor based on their gene expression profiles. CONCLUSIONS: A novel algorithm CrossLink for cross-condition prediction of cancer classes was proposed. In all test datasets, CL showed robust and consistent improvement in prediction performance over other state-of-the-art normalization and classification algorithms. |
format | Online Article Text |
id | pubmed-5001207 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2016 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-50012072016-09-06 CrossLink: a novel method for cross-condition classification of cancer subtypes Ma, Chifeng Sastry, Konduru S. Flore, Mario Gehani, Salah Al-Bozom, Issam Feng, Yusheng Serpedin, Erchin Chouchane, Lotfi Chen, Yidong Huang, Yufei BMC Genomics Research BACKGROUND: We considered the prediction of cancer classes (e.g. subtypes) using patient gene expression profiles that contain both systematic and condition-specific biases when compared with the training reference dataset. The conventional normalization-based approaches cannot guarantee that the gene signatures in the reference and prediction datasets always have the same distribution for all different conditions as the class-specific gene signatures change with the condition. Therefore, the trained classifier would work well under one condition but not under another. METHODS: To address the problem of current normalization approaches, we propose a novel algorithm called CrossLink (CL). CL recognizes that there is no universal, condition-independent normalization mapping of signatures. In contrast, it exploits the fact that the signature is unique to its associated class under any condition and thus employs an unsupervised clustering algorithm to discover this unique signature. RESULTS: We assessed the performance of CL for cross-condition predictions of PAM50 subtypes of breast cancer by using a simulated dataset modeled after TCGA BRCA tumor samples with a cross-validation scheme, and datasets with known and unknown PAM50 classification. CL achieved prediction accuracy >73 %, highest among other methods we evaluated. We also applied the algorithm to a set of breast cancer tumors derived from Arabic population to assign a PAM50 classification to each tumor based on their gene expression profiles. CONCLUSIONS: A novel algorithm CrossLink for cross-condition prediction of cancer classes was proposed. In all test datasets, CL showed robust and consistent improvement in prediction performance over other state-of-the-art normalization and classification algorithms. BioMed Central 2016-08-22 /pmc/articles/PMC5001207/ /pubmed/27556419 http://dx.doi.org/10.1186/s12864-016-2903-z Text en © The Author(s). 2016 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated. |
spellingShingle | Research Ma, Chifeng Sastry, Konduru S. Flore, Mario Gehani, Salah Al-Bozom, Issam Feng, Yusheng Serpedin, Erchin Chouchane, Lotfi Chen, Yidong Huang, Yufei CrossLink: a novel method for cross-condition classification of cancer subtypes |
title | CrossLink: a novel method for cross-condition classification of cancer subtypes |
title_full | CrossLink: a novel method for cross-condition classification of cancer subtypes |
title_fullStr | CrossLink: a novel method for cross-condition classification of cancer subtypes |
title_full_unstemmed | CrossLink: a novel method for cross-condition classification of cancer subtypes |
title_short | CrossLink: a novel method for cross-condition classification of cancer subtypes |
title_sort | crosslink: a novel method for cross-condition classification of cancer subtypes |
topic | Research |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5001207/ https://www.ncbi.nlm.nih.gov/pubmed/27556419 http://dx.doi.org/10.1186/s12864-016-2903-z |
work_keys_str_mv | AT machifeng crosslinkanovelmethodforcrossconditionclassificationofcancersubtypes AT sastrykondurus crosslinkanovelmethodforcrossconditionclassificationofcancersubtypes AT floremario crosslinkanovelmethodforcrossconditionclassificationofcancersubtypes AT gehanisalah crosslinkanovelmethodforcrossconditionclassificationofcancersubtypes AT albozomissam crosslinkanovelmethodforcrossconditionclassificationofcancersubtypes AT fengyusheng crosslinkanovelmethodforcrossconditionclassificationofcancersubtypes AT serpedinerchin crosslinkanovelmethodforcrossconditionclassificationofcancersubtypes AT chouchanelotfi crosslinkanovelmethodforcrossconditionclassificationofcancersubtypes AT chenyidong crosslinkanovelmethodforcrossconditionclassificationofcancersubtypes AT huangyufei crosslinkanovelmethodforcrossconditionclassificationofcancersubtypes |