Cargando…
Classification of human cancers based on DNA copy number amplification modeling
BACKGROUND: DNA amplifications alter gene dosage in cancer genomes by multiplying the gene copy number. Amplifications are quintessential in a considerable number of advanced cancers of various anatomical locations. The aims of this study were to classify human cancers based on their amplification p...
Autores principales: | , , , , |
---|---|
Formato: | Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2008
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2397431/ https://www.ncbi.nlm.nih.gov/pubmed/18477412 http://dx.doi.org/10.1186/1755-8794-1-15 |
_version_ | 1782155624716435456 |
---|---|
author | Myllykangas, Samuel Tikka, Jarkko Böhling, Tom Knuutila, Sakari Hollmén, Jaakko |
author_facet | Myllykangas, Samuel Tikka, Jarkko Böhling, Tom Knuutila, Sakari Hollmén, Jaakko |
author_sort | Myllykangas, Samuel |
collection | PubMed |
description | BACKGROUND: DNA amplifications alter gene dosage in cancer genomes by multiplying the gene copy number. Amplifications are quintessential in a considerable number of advanced cancers of various anatomical locations. The aims of this study were to classify human cancers based on their amplification patterns, explore the biological and clinical fundamentals behind their amplification-pattern based classification, and understand the characteristics in human genomic architecture that associate with amplification mechanisms. METHODS: We applied a machine learning approach to model DNA copy number amplifications using a data set of binary amplification records at chromosome sub-band resolution from 4400 cases that represent 82 cancer types. Amplification data was fused with background data: clinical, histological and biological classifications, and cytogenetic annotations. Statistical hypothesis testing was used to mine associations between the data sets. RESULTS: Probabilistic clustering of each chromosome identified 111 amplification models and divided the cancer cases into clusters. The distribution of classification terms in the amplification-model based clustering of cancer cases revealed cancer classes that were associated with specific DNA copy number amplification models. Amplification patterns – finite or bounded descriptions of the ranges of the amplifications in the chromosome – were extracted from the clustered data and expressed according to the original cytogenetic nomenclature. This was achieved by maximal frequent itemset mining using the cluster-specific data sets. The boundaries of amplification patterns were shown to be enriched with fragile sites, telomeres, centromeres, and light chromosome bands. CONCLUSIONS: Our results demonstrate that amplifications are non-random chromosomal changes and specifically selected in tumor tissue microenvironment. Furthermore, statistical evidence showed that specific chromosomal features co-localize with amplification breakpoints and link them in the amplification process. |
format | Text |
id | pubmed-2397431 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2008 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-23974312008-05-29 Classification of human cancers based on DNA copy number amplification modeling Myllykangas, Samuel Tikka, Jarkko Böhling, Tom Knuutila, Sakari Hollmén, Jaakko BMC Med Genomics Research Article BACKGROUND: DNA amplifications alter gene dosage in cancer genomes by multiplying the gene copy number. Amplifications are quintessential in a considerable number of advanced cancers of various anatomical locations. The aims of this study were to classify human cancers based on their amplification patterns, explore the biological and clinical fundamentals behind their amplification-pattern based classification, and understand the characteristics in human genomic architecture that associate with amplification mechanisms. METHODS: We applied a machine learning approach to model DNA copy number amplifications using a data set of binary amplification records at chromosome sub-band resolution from 4400 cases that represent 82 cancer types. Amplification data was fused with background data: clinical, histological and biological classifications, and cytogenetic annotations. Statistical hypothesis testing was used to mine associations between the data sets. RESULTS: Probabilistic clustering of each chromosome identified 111 amplification models and divided the cancer cases into clusters. The distribution of classification terms in the amplification-model based clustering of cancer cases revealed cancer classes that were associated with specific DNA copy number amplification models. Amplification patterns – finite or bounded descriptions of the ranges of the amplifications in the chromosome – were extracted from the clustered data and expressed according to the original cytogenetic nomenclature. This was achieved by maximal frequent itemset mining using the cluster-specific data sets. The boundaries of amplification patterns were shown to be enriched with fragile sites, telomeres, centromeres, and light chromosome bands. CONCLUSIONS: Our results demonstrate that amplifications are non-random chromosomal changes and specifically selected in tumor tissue microenvironment. Furthermore, statistical evidence showed that specific chromosomal features co-localize with amplification breakpoints and link them in the amplification process. BioMed Central 2008-05-14 /pmc/articles/PMC2397431/ /pubmed/18477412 http://dx.doi.org/10.1186/1755-8794-1-15 Text en Copyright © 2008 Myllykangas et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License ( (http://creativecommons.org/licenses/by/2.0) ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Research Article Myllykangas, Samuel Tikka, Jarkko Böhling, Tom Knuutila, Sakari Hollmén, Jaakko Classification of human cancers based on DNA copy number amplification modeling |
title | Classification of human cancers based on DNA copy number amplification modeling |
title_full | Classification of human cancers based on DNA copy number amplification modeling |
title_fullStr | Classification of human cancers based on DNA copy number amplification modeling |
title_full_unstemmed | Classification of human cancers based on DNA copy number amplification modeling |
title_short | Classification of human cancers based on DNA copy number amplification modeling |
title_sort | classification of human cancers based on dna copy number amplification modeling |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2397431/ https://www.ncbi.nlm.nih.gov/pubmed/18477412 http://dx.doi.org/10.1186/1755-8794-1-15 |
work_keys_str_mv | AT myllykangassamuel classificationofhumancancersbasedondnacopynumberamplificationmodeling AT tikkajarkko classificationofhumancancersbasedondnacopynumberamplificationmodeling AT bohlingtom classificationofhumancancersbasedondnacopynumberamplificationmodeling AT knuutilasakari classificationofhumancancersbasedondnacopynumberamplificationmodeling AT hollmenjaakko classificationofhumancancersbasedondnacopynumberamplificationmodeling |