Cargando…

A Method for Generating New Datasets Based on Copy Number for Cancer Analysis

New data sources for the analysis of cancer data are rapidly supplementing the large number of gene-expression markers used for current methods of analysis. Significant among these new sources are copy number variation (CNV) datasets, which typically enumerate several hundred thousand CNVs distribut...

Descripción completa

Detalles Bibliográficos
Autores principales: Kim, Shinuk, Kon, Mark, Kang, Hyunsik
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Hindawi Publishing Corporation 2015
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4407403/
https://www.ncbi.nlm.nih.gov/pubmed/25949998
http://dx.doi.org/10.1155/2015/467514
_version_ 1782367902961238016
author Kim, Shinuk
Kon, Mark
Kang, Hyunsik
author_facet Kim, Shinuk
Kon, Mark
Kang, Hyunsik
author_sort Kim, Shinuk
collection PubMed
description New data sources for the analysis of cancer data are rapidly supplementing the large number of gene-expression markers used for current methods of analysis. Significant among these new sources are copy number variation (CNV) datasets, which typically enumerate several hundred thousand CNVs distributed throughout the genome. Several useful algorithms allow systems-level analyses of such datasets. However, these rich data sources have not yet been analyzed as deeply as gene-expression data. To address this issue, the extensive toolsets used for analyzing expression data in cancerous and noncancerous tissue (e.g., gene set enrichment analysis and phenotype prediction) could be redirected to extract a great deal of predictive information from CNV data, in particular those derived from cancers. Here we present a software package capable of preprocessing standard Agilent copy number datasets into a form to which essentially all expression analysis tools can be applied. We illustrate the use of this toolset in predicting the survival time of patients with ovarian cancer or glioblastoma multiforme and also provide an analysis of gene- and pathway-level deletions in these two types of cancer.
format Online
Article
Text
id pubmed-4407403
institution National Center for Biotechnology Information
language English
publishDate 2015
publisher Hindawi Publishing Corporation
record_format MEDLINE/PubMed
spelling pubmed-44074032015-05-06 A Method for Generating New Datasets Based on Copy Number for Cancer Analysis Kim, Shinuk Kon, Mark Kang, Hyunsik Biomed Res Int Research Article New data sources for the analysis of cancer data are rapidly supplementing the large number of gene-expression markers used for current methods of analysis. Significant among these new sources are copy number variation (CNV) datasets, which typically enumerate several hundred thousand CNVs distributed throughout the genome. Several useful algorithms allow systems-level analyses of such datasets. However, these rich data sources have not yet been analyzed as deeply as gene-expression data. To address this issue, the extensive toolsets used for analyzing expression data in cancerous and noncancerous tissue (e.g., gene set enrichment analysis and phenotype prediction) could be redirected to extract a great deal of predictive information from CNV data, in particular those derived from cancers. Here we present a software package capable of preprocessing standard Agilent copy number datasets into a form to which essentially all expression analysis tools can be applied. We illustrate the use of this toolset in predicting the survival time of patients with ovarian cancer or glioblastoma multiforme and also provide an analysis of gene- and pathway-level deletions in these two types of cancer. Hindawi Publishing Corporation 2015 2015-04-08 /pmc/articles/PMC4407403/ /pubmed/25949998 http://dx.doi.org/10.1155/2015/467514 Text en Copyright © 2015 Shinuk Kim et al. https://creativecommons.org/licenses/by/3.0/ This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research Article
Kim, Shinuk
Kon, Mark
Kang, Hyunsik
A Method for Generating New Datasets Based on Copy Number for Cancer Analysis
title A Method for Generating New Datasets Based on Copy Number for Cancer Analysis
title_full A Method for Generating New Datasets Based on Copy Number for Cancer Analysis
title_fullStr A Method for Generating New Datasets Based on Copy Number for Cancer Analysis
title_full_unstemmed A Method for Generating New Datasets Based on Copy Number for Cancer Analysis
title_short A Method for Generating New Datasets Based on Copy Number for Cancer Analysis
title_sort method for generating new datasets based on copy number for cancer analysis
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4407403/
https://www.ncbi.nlm.nih.gov/pubmed/25949998
http://dx.doi.org/10.1155/2015/467514
work_keys_str_mv AT kimshinuk amethodforgeneratingnewdatasetsbasedoncopynumberforcanceranalysis
AT konmark amethodforgeneratingnewdatasetsbasedoncopynumberforcanceranalysis
AT kanghyunsik amethodforgeneratingnewdatasetsbasedoncopynumberforcanceranalysis
AT kimshinuk methodforgeneratingnewdatasetsbasedoncopynumberforcanceranalysis
AT konmark methodforgeneratingnewdatasetsbasedoncopynumberforcanceranalysis
AT kanghyunsik methodforgeneratingnewdatasetsbasedoncopynumberforcanceranalysis