Cargando…

CrossLink: a novel method for cross-condition classification of cancer subtypes

BACKGROUND: We considered the prediction of cancer classes (e.g. subtypes) using patient gene expression profiles that contain both systematic and condition-specific biases when compared with the training reference dataset. The conventional normalization-based approaches cannot guarantee that the ge...

Descripción completa

Detalles Bibliográficos
Autores principales: Ma, Chifeng, Sastry, Konduru S., Flore, Mario, Gehani, Salah, Al-Bozom, Issam, Feng, Yusheng, Serpedin, Erchin, Chouchane, Lotfi, Chen, Yidong, Huang, Yufei
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2016
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5001207/
https://www.ncbi.nlm.nih.gov/pubmed/27556419
http://dx.doi.org/10.1186/s12864-016-2903-z
_version_ 1782450430349934592
author Ma, Chifeng
Sastry, Konduru S.
Flore, Mario
Gehani, Salah
Al-Bozom, Issam
Feng, Yusheng
Serpedin, Erchin
Chouchane, Lotfi
Chen, Yidong
Huang, Yufei
author_facet Ma, Chifeng
Sastry, Konduru S.
Flore, Mario
Gehani, Salah
Al-Bozom, Issam
Feng, Yusheng
Serpedin, Erchin
Chouchane, Lotfi
Chen, Yidong
Huang, Yufei
author_sort Ma, Chifeng
collection PubMed
description BACKGROUND: We considered the prediction of cancer classes (e.g. subtypes) using patient gene expression profiles that contain both systematic and condition-specific biases when compared with the training reference dataset. The conventional normalization-based approaches cannot guarantee that the gene signatures in the reference and prediction datasets always have the same distribution for all different conditions as the class-specific gene signatures change with the condition. Therefore, the trained classifier would work well under one condition but not under another. METHODS: To address the problem of current normalization approaches, we propose a novel algorithm called CrossLink (CL). CL recognizes that there is no universal, condition-independent normalization mapping of signatures. In contrast, it exploits the fact that the signature is unique to its associated class under any condition and thus employs an unsupervised clustering algorithm to discover this unique signature. RESULTS: We assessed the performance of CL for cross-condition predictions of PAM50 subtypes of breast cancer by using a simulated dataset modeled after TCGA BRCA tumor samples with a cross-validation scheme, and datasets with known and unknown PAM50 classification. CL achieved prediction accuracy >73 %, highest among other methods we evaluated. We also applied the algorithm to a set of breast cancer tumors derived from Arabic population to assign a PAM50 classification to each tumor based on their gene expression profiles. CONCLUSIONS: A novel algorithm CrossLink for cross-condition prediction of cancer classes was proposed. In all test datasets, CL showed robust and consistent improvement in prediction performance over other state-of-the-art normalization and classification algorithms.
format Online
Article
Text
id pubmed-5001207
institution National Center for Biotechnology Information
language English
publishDate 2016
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-50012072016-09-06 CrossLink: a novel method for cross-condition classification of cancer subtypes Ma, Chifeng Sastry, Konduru S. Flore, Mario Gehani, Salah Al-Bozom, Issam Feng, Yusheng Serpedin, Erchin Chouchane, Lotfi Chen, Yidong Huang, Yufei BMC Genomics Research BACKGROUND: We considered the prediction of cancer classes (e.g. subtypes) using patient gene expression profiles that contain both systematic and condition-specific biases when compared with the training reference dataset. The conventional normalization-based approaches cannot guarantee that the gene signatures in the reference and prediction datasets always have the same distribution for all different conditions as the class-specific gene signatures change with the condition. Therefore, the trained classifier would work well under one condition but not under another. METHODS: To address the problem of current normalization approaches, we propose a novel algorithm called CrossLink (CL). CL recognizes that there is no universal, condition-independent normalization mapping of signatures. In contrast, it exploits the fact that the signature is unique to its associated class under any condition and thus employs an unsupervised clustering algorithm to discover this unique signature. RESULTS: We assessed the performance of CL for cross-condition predictions of PAM50 subtypes of breast cancer by using a simulated dataset modeled after TCGA BRCA tumor samples with a cross-validation scheme, and datasets with known and unknown PAM50 classification. CL achieved prediction accuracy >73 %, highest among other methods we evaluated. We also applied the algorithm to a set of breast cancer tumors derived from Arabic population to assign a PAM50 classification to each tumor based on their gene expression profiles. CONCLUSIONS: A novel algorithm CrossLink for cross-condition prediction of cancer classes was proposed. In all test datasets, CL showed robust and consistent improvement in prediction performance over other state-of-the-art normalization and classification algorithms. BioMed Central 2016-08-22 /pmc/articles/PMC5001207/ /pubmed/27556419 http://dx.doi.org/10.1186/s12864-016-2903-z Text en © The Author(s). 2016 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Research
Ma, Chifeng
Sastry, Konduru S.
Flore, Mario
Gehani, Salah
Al-Bozom, Issam
Feng, Yusheng
Serpedin, Erchin
Chouchane, Lotfi
Chen, Yidong
Huang, Yufei
CrossLink: a novel method for cross-condition classification of cancer subtypes
title CrossLink: a novel method for cross-condition classification of cancer subtypes
title_full CrossLink: a novel method for cross-condition classification of cancer subtypes
title_fullStr CrossLink: a novel method for cross-condition classification of cancer subtypes
title_full_unstemmed CrossLink: a novel method for cross-condition classification of cancer subtypes
title_short CrossLink: a novel method for cross-condition classification of cancer subtypes
title_sort crosslink: a novel method for cross-condition classification of cancer subtypes
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5001207/
https://www.ncbi.nlm.nih.gov/pubmed/27556419
http://dx.doi.org/10.1186/s12864-016-2903-z
work_keys_str_mv AT machifeng crosslinkanovelmethodforcrossconditionclassificationofcancersubtypes
AT sastrykondurus crosslinkanovelmethodforcrossconditionclassificationofcancersubtypes
AT floremario crosslinkanovelmethodforcrossconditionclassificationofcancersubtypes
AT gehanisalah crosslinkanovelmethodforcrossconditionclassificationofcancersubtypes
AT albozomissam crosslinkanovelmethodforcrossconditionclassificationofcancersubtypes
AT fengyusheng crosslinkanovelmethodforcrossconditionclassificationofcancersubtypes
AT serpedinerchin crosslinkanovelmethodforcrossconditionclassificationofcancersubtypes
AT chouchanelotfi crosslinkanovelmethodforcrossconditionclassificationofcancersubtypes
AT chenyidong crosslinkanovelmethodforcrossconditionclassificationofcancersubtypes
AT huangyufei crosslinkanovelmethodforcrossconditionclassificationofcancersubtypes