Cargando…

Classification of human cancers based on DNA copy number amplification modeling

BACKGROUND: DNA amplifications alter gene dosage in cancer genomes by multiplying the gene copy number. Amplifications are quintessential in a considerable number of advanced cancers of various anatomical locations. The aims of this study were to classify human cancers based on their amplification p...

Descripción completa

Detalles Bibliográficos
Autores principales: Myllykangas, Samuel, Tikka, Jarkko, Böhling, Tom, Knuutila, Sakari, Hollmén, Jaakko
Formato: Texto
Lenguaje:English
Publicado: BioMed Central 2008
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2397431/
https://www.ncbi.nlm.nih.gov/pubmed/18477412
http://dx.doi.org/10.1186/1755-8794-1-15
_version_ 1782155624716435456
author Myllykangas, Samuel
Tikka, Jarkko
Böhling, Tom
Knuutila, Sakari
Hollmén, Jaakko
author_facet Myllykangas, Samuel
Tikka, Jarkko
Böhling, Tom
Knuutila, Sakari
Hollmén, Jaakko
author_sort Myllykangas, Samuel
collection PubMed
description BACKGROUND: DNA amplifications alter gene dosage in cancer genomes by multiplying the gene copy number. Amplifications are quintessential in a considerable number of advanced cancers of various anatomical locations. The aims of this study were to classify human cancers based on their amplification patterns, explore the biological and clinical fundamentals behind their amplification-pattern based classification, and understand the characteristics in human genomic architecture that associate with amplification mechanisms. METHODS: We applied a machine learning approach to model DNA copy number amplifications using a data set of binary amplification records at chromosome sub-band resolution from 4400 cases that represent 82 cancer types. Amplification data was fused with background data: clinical, histological and biological classifications, and cytogenetic annotations. Statistical hypothesis testing was used to mine associations between the data sets. RESULTS: Probabilistic clustering of each chromosome identified 111 amplification models and divided the cancer cases into clusters. The distribution of classification terms in the amplification-model based clustering of cancer cases revealed cancer classes that were associated with specific DNA copy number amplification models. Amplification patterns – finite or bounded descriptions of the ranges of the amplifications in the chromosome – were extracted from the clustered data and expressed according to the original cytogenetic nomenclature. This was achieved by maximal frequent itemset mining using the cluster-specific data sets. The boundaries of amplification patterns were shown to be enriched with fragile sites, telomeres, centromeres, and light chromosome bands. CONCLUSIONS: Our results demonstrate that amplifications are non-random chromosomal changes and specifically selected in tumor tissue microenvironment. Furthermore, statistical evidence showed that specific chromosomal features co-localize with amplification breakpoints and link them in the amplification process.
format Text
id pubmed-2397431
institution National Center for Biotechnology Information
language English
publishDate 2008
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-23974312008-05-29 Classification of human cancers based on DNA copy number amplification modeling Myllykangas, Samuel Tikka, Jarkko Böhling, Tom Knuutila, Sakari Hollmén, Jaakko BMC Med Genomics Research Article BACKGROUND: DNA amplifications alter gene dosage in cancer genomes by multiplying the gene copy number. Amplifications are quintessential in a considerable number of advanced cancers of various anatomical locations. The aims of this study were to classify human cancers based on their amplification patterns, explore the biological and clinical fundamentals behind their amplification-pattern based classification, and understand the characteristics in human genomic architecture that associate with amplification mechanisms. METHODS: We applied a machine learning approach to model DNA copy number amplifications using a data set of binary amplification records at chromosome sub-band resolution from 4400 cases that represent 82 cancer types. Amplification data was fused with background data: clinical, histological and biological classifications, and cytogenetic annotations. Statistical hypothesis testing was used to mine associations between the data sets. RESULTS: Probabilistic clustering of each chromosome identified 111 amplification models and divided the cancer cases into clusters. The distribution of classification terms in the amplification-model based clustering of cancer cases revealed cancer classes that were associated with specific DNA copy number amplification models. Amplification patterns – finite or bounded descriptions of the ranges of the amplifications in the chromosome – were extracted from the clustered data and expressed according to the original cytogenetic nomenclature. This was achieved by maximal frequent itemset mining using the cluster-specific data sets. The boundaries of amplification patterns were shown to be enriched with fragile sites, telomeres, centromeres, and light chromosome bands. CONCLUSIONS: Our results demonstrate that amplifications are non-random chromosomal changes and specifically selected in tumor tissue microenvironment. Furthermore, statistical evidence showed that specific chromosomal features co-localize with amplification breakpoints and link them in the amplification process. BioMed Central 2008-05-14 /pmc/articles/PMC2397431/ /pubmed/18477412 http://dx.doi.org/10.1186/1755-8794-1-15 Text en Copyright © 2008 Myllykangas et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License ( (http://creativecommons.org/licenses/by/2.0) ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research Article
Myllykangas, Samuel
Tikka, Jarkko
Böhling, Tom
Knuutila, Sakari
Hollmén, Jaakko
Classification of human cancers based on DNA copy number amplification modeling
title Classification of human cancers based on DNA copy number amplification modeling
title_full Classification of human cancers based on DNA copy number amplification modeling
title_fullStr Classification of human cancers based on DNA copy number amplification modeling
title_full_unstemmed Classification of human cancers based on DNA copy number amplification modeling
title_short Classification of human cancers based on DNA copy number amplification modeling
title_sort classification of human cancers based on dna copy number amplification modeling
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2397431/
https://www.ncbi.nlm.nih.gov/pubmed/18477412
http://dx.doi.org/10.1186/1755-8794-1-15
work_keys_str_mv AT myllykangassamuel classificationofhumancancersbasedondnacopynumberamplificationmodeling
AT tikkajarkko classificationofhumancancersbasedondnacopynumberamplificationmodeling
AT bohlingtom classificationofhumancancersbasedondnacopynumberamplificationmodeling
AT knuutilasakari classificationofhumancancersbasedondnacopynumberamplificationmodeling
AT hollmenjaakko classificationofhumancancersbasedondnacopynumberamplificationmodeling