Cargando…

An algorithm for classifying tumors based on genomic aberrations and selecting representative tumor models

BACKGROUND: Cancer is a heterogeneous disease caused by genomic aberrations and characterized by significant variability in clinical outcomes and response to therapies. Several subtypes of common cancers have been identified based on alterations of individual cancer genes, such as HER2, EGFR, and ot...

Descripción completa

Detalles Bibliográficos
Autores principales: Lu, Xin, Zhang, Ke, Van Sant, Charles, Coon, John, Semizarov, Dimitri
Formato: Texto
Lenguaje:English
Publicado: BioMed Central 2010
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2901344/
https://www.ncbi.nlm.nih.gov/pubmed/20569491
http://dx.doi.org/10.1186/1755-8794-3-23
_version_ 1782183678729781248
author Lu, Xin
Zhang, Ke
Van Sant, Charles
Coon, John
Semizarov, Dimitri
author_facet Lu, Xin
Zhang, Ke
Van Sant, Charles
Coon, John
Semizarov, Dimitri
author_sort Lu, Xin
collection PubMed
description BACKGROUND: Cancer is a heterogeneous disease caused by genomic aberrations and characterized by significant variability in clinical outcomes and response to therapies. Several subtypes of common cancers have been identified based on alterations of individual cancer genes, such as HER2, EGFR, and others. However, cancer is a complex disease driven by the interaction of multiple genes, so the copy number status of individual genes is not sufficient to define cancer subtypes and predict responses to treatments. A classification based on genome-wide copy number patterns would be better suited for this purpose. METHOD: To develop a more comprehensive cancer taxonomy based on genome-wide patterns of copy number abnormalities, we designed an unsupervised classification algorithm that identifies genomic subgroups of tumors. This algorithm is based on a modified genomic Non-negative Matrix Factorization (gNMF) algorithm and includes several additional components, namely a pilot hierarchical clustering procedure to determine the number of clusters, a multiple random initiation scheme, a new stop criterion for the core gNMF, as well as a 10-fold cross-validation stability test for quality assessment. RESULT: We applied our algorithm to identify genomic subgroups of three major cancer types: non-small cell lung carcinoma (NSCLC), colorectal cancer (CRC), and malignant melanoma. High-density SNP array datasets for patient tumors and established cell lines were used to define genomic subclasses of the diseases and identify cell lines representative of each genomic subtype. The algorithm was compared with several traditional clustering methods and showed improved performance. To validate our genomic taxonomy of NSCLC, we correlated the genomic classification with disease outcomes. Overall survival time and time to recurrence were shown to differ significantly between the genomic subtypes. CONCLUSIONS: We developed an algorithm for cancer classification based on genome-wide patterns of copy number aberrations and demonstrated its superiority to existing clustering methods. The algorithm was applied to define genomic subgroups of three cancer types and identify cell lines representative of these subgroups. Our data enabled the assembly of representative cell line panels for testing drug candidates.
format Text
id pubmed-2901344
institution National Center for Biotechnology Information
language English
publishDate 2010
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-29013442010-07-10 An algorithm for classifying tumors based on genomic aberrations and selecting representative tumor models Lu, Xin Zhang, Ke Van Sant, Charles Coon, John Semizarov, Dimitri BMC Med Genomics Research article BACKGROUND: Cancer is a heterogeneous disease caused by genomic aberrations and characterized by significant variability in clinical outcomes and response to therapies. Several subtypes of common cancers have been identified based on alterations of individual cancer genes, such as HER2, EGFR, and others. However, cancer is a complex disease driven by the interaction of multiple genes, so the copy number status of individual genes is not sufficient to define cancer subtypes and predict responses to treatments. A classification based on genome-wide copy number patterns would be better suited for this purpose. METHOD: To develop a more comprehensive cancer taxonomy based on genome-wide patterns of copy number abnormalities, we designed an unsupervised classification algorithm that identifies genomic subgroups of tumors. This algorithm is based on a modified genomic Non-negative Matrix Factorization (gNMF) algorithm and includes several additional components, namely a pilot hierarchical clustering procedure to determine the number of clusters, a multiple random initiation scheme, a new stop criterion for the core gNMF, as well as a 10-fold cross-validation stability test for quality assessment. RESULT: We applied our algorithm to identify genomic subgroups of three major cancer types: non-small cell lung carcinoma (NSCLC), colorectal cancer (CRC), and malignant melanoma. High-density SNP array datasets for patient tumors and established cell lines were used to define genomic subclasses of the diseases and identify cell lines representative of each genomic subtype. The algorithm was compared with several traditional clustering methods and showed improved performance. To validate our genomic taxonomy of NSCLC, we correlated the genomic classification with disease outcomes. Overall survival time and time to recurrence were shown to differ significantly between the genomic subtypes. CONCLUSIONS: We developed an algorithm for cancer classification based on genome-wide patterns of copy number aberrations and demonstrated its superiority to existing clustering methods. The algorithm was applied to define genomic subgroups of three cancer types and identify cell lines representative of these subgroups. Our data enabled the assembly of representative cell line panels for testing drug candidates. BioMed Central 2010-06-22 /pmc/articles/PMC2901344/ /pubmed/20569491 http://dx.doi.org/10.1186/1755-8794-3-23 Text en Copyright ©2010 Lu et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research article
Lu, Xin
Zhang, Ke
Van Sant, Charles
Coon, John
Semizarov, Dimitri
An algorithm for classifying tumors based on genomic aberrations and selecting representative tumor models
title An algorithm for classifying tumors based on genomic aberrations and selecting representative tumor models
title_full An algorithm for classifying tumors based on genomic aberrations and selecting representative tumor models
title_fullStr An algorithm for classifying tumors based on genomic aberrations and selecting representative tumor models
title_full_unstemmed An algorithm for classifying tumors based on genomic aberrations and selecting representative tumor models
title_short An algorithm for classifying tumors based on genomic aberrations and selecting representative tumor models
title_sort algorithm for classifying tumors based on genomic aberrations and selecting representative tumor models
topic Research article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2901344/
https://www.ncbi.nlm.nih.gov/pubmed/20569491
http://dx.doi.org/10.1186/1755-8794-3-23
work_keys_str_mv AT luxin analgorithmforclassifyingtumorsbasedongenomicaberrationsandselectingrepresentativetumormodels
AT zhangke analgorithmforclassifyingtumorsbasedongenomicaberrationsandselectingrepresentativetumormodels
AT vansantcharles analgorithmforclassifyingtumorsbasedongenomicaberrationsandselectingrepresentativetumormodels
AT coonjohn analgorithmforclassifyingtumorsbasedongenomicaberrationsandselectingrepresentativetumormodels
AT semizarovdimitri analgorithmforclassifyingtumorsbasedongenomicaberrationsandselectingrepresentativetumormodels
AT luxin algorithmforclassifyingtumorsbasedongenomicaberrationsandselectingrepresentativetumormodels
AT zhangke algorithmforclassifyingtumorsbasedongenomicaberrationsandselectingrepresentativetumormodels
AT vansantcharles algorithmforclassifyingtumorsbasedongenomicaberrationsandselectingrepresentativetumormodels
AT coonjohn algorithmforclassifyingtumorsbasedongenomicaberrationsandselectingrepresentativetumormodels
AT semizarovdimitri algorithmforclassifyingtumorsbasedongenomicaberrationsandselectingrepresentativetumormodels