Cargando…

Genome-wide identification of significant aberrations in cancer genome

BACKGROUND: Somatic Copy Number Alterations (CNAs) in human genomes are present in almost all human cancers. Systematic efforts to characterize such structural variants must effectively distinguish significant consensus events from random background aberrations. Here we introduce Significant Aberrat...

Descripción completa

Detalles Bibliográficos
Autores principales: Yuan, Xiguo, Yu, Guoqiang, Hou, Xuchu, Shih, Ie-Ming, Clarke, Robert, Zhang, Junying, Hoffman, Eric P, Wang, Roger R, Zhang, Zhen, Wang, Yue
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2012
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3428679/
https://www.ncbi.nlm.nih.gov/pubmed/22839576
http://dx.doi.org/10.1186/1471-2164-13-342
_version_ 1782241726934548480
author Yuan, Xiguo
Yu, Guoqiang
Hou, Xuchu
Shih, Ie-Ming
Clarke, Robert
Zhang, Junying
Hoffman, Eric P
Wang, Roger R
Zhang, Zhen
Wang, Yue
author_facet Yuan, Xiguo
Yu, Guoqiang
Hou, Xuchu
Shih, Ie-Ming
Clarke, Robert
Zhang, Junying
Hoffman, Eric P
Wang, Roger R
Zhang, Zhen
Wang, Yue
author_sort Yuan, Xiguo
collection PubMed
description BACKGROUND: Somatic Copy Number Alterations (CNAs) in human genomes are present in almost all human cancers. Systematic efforts to characterize such structural variants must effectively distinguish significant consensus events from random background aberrations. Here we introduce Significant Aberration in Cancer (SAIC), a new method for characterizing and assessing the statistical significance of recurrent CNA units. Three main features of SAIC include: (1) exploiting the intrinsic correlation among consecutive probes to assign a score to each CNA unit instead of single probes; (2) performing permutations on CNA units that preserve correlations inherent in the copy number data; and (3) iteratively detecting Significant Copy Number Aberrations (SCAs) and estimating an unbiased null distribution by applying an SCA-exclusive permutation scheme. RESULTS: We test and compare the performance of SAIC against four peer methods (GISTIC, STAC, KC-SMART, CMDS) on a large number of simulation datasets. Experimental results show that SAIC outperforms peer methods in terms of larger area under the Receiver Operating Characteristics curve and increased detection power. We then apply SAIC to analyze structural genomic aberrations acquired in four real cancer genome-wide copy number data sets (ovarian cancer, metastatic prostate cancer, lung adenocarcinoma, glioblastoma). When compared with previously reported results, SAIC successfully identifies most SCAs known to be of biological significance and associated with oncogenes (e.g., KRAS, CCNE1, and MYC) or tumor suppressor genes (e.g., CDKN2A/B). Furthermore, SAIC identifies a number of novel SCAs in these copy number data that encompass tumor related genes and may warrant further studies. CONCLUSIONS: Supported by a well-grounded theoretical framework, SAIC has been developed and used to identify SCAs in various cancer copy number data sets, providing useful information to study the landscape of cancer genomes. Open–source and platform-independent SAIC software is implemented using C++, together with R scripts for data formatting and Perl scripts for user interfacing, and it is easy to install and efficient to use. The source code and documentation are freely available at http://www.cbil.ece.vt.edu/software.htm.
format Online
Article
Text
id pubmed-3428679
institution National Center for Biotechnology Information
language English
publishDate 2012
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-34286792012-08-30 Genome-wide identification of significant aberrations in cancer genome Yuan, Xiguo Yu, Guoqiang Hou, Xuchu Shih, Ie-Ming Clarke, Robert Zhang, Junying Hoffman, Eric P Wang, Roger R Zhang, Zhen Wang, Yue BMC Genomics Methodology Article BACKGROUND: Somatic Copy Number Alterations (CNAs) in human genomes are present in almost all human cancers. Systematic efforts to characterize such structural variants must effectively distinguish significant consensus events from random background aberrations. Here we introduce Significant Aberration in Cancer (SAIC), a new method for characterizing and assessing the statistical significance of recurrent CNA units. Three main features of SAIC include: (1) exploiting the intrinsic correlation among consecutive probes to assign a score to each CNA unit instead of single probes; (2) performing permutations on CNA units that preserve correlations inherent in the copy number data; and (3) iteratively detecting Significant Copy Number Aberrations (SCAs) and estimating an unbiased null distribution by applying an SCA-exclusive permutation scheme. RESULTS: We test and compare the performance of SAIC against four peer methods (GISTIC, STAC, KC-SMART, CMDS) on a large number of simulation datasets. Experimental results show that SAIC outperforms peer methods in terms of larger area under the Receiver Operating Characteristics curve and increased detection power. We then apply SAIC to analyze structural genomic aberrations acquired in four real cancer genome-wide copy number data sets (ovarian cancer, metastatic prostate cancer, lung adenocarcinoma, glioblastoma). When compared with previously reported results, SAIC successfully identifies most SCAs known to be of biological significance and associated with oncogenes (e.g., KRAS, CCNE1, and MYC) or tumor suppressor genes (e.g., CDKN2A/B). Furthermore, SAIC identifies a number of novel SCAs in these copy number data that encompass tumor related genes and may warrant further studies. CONCLUSIONS: Supported by a well-grounded theoretical framework, SAIC has been developed and used to identify SCAs in various cancer copy number data sets, providing useful information to study the landscape of cancer genomes. Open–source and platform-independent SAIC software is implemented using C++, together with R scripts for data formatting and Perl scripts for user interfacing, and it is easy to install and efficient to use. The source code and documentation are freely available at http://www.cbil.ece.vt.edu/software.htm. BioMed Central 2012-07-27 /pmc/articles/PMC3428679/ /pubmed/22839576 http://dx.doi.org/10.1186/1471-2164-13-342 Text en Copyright ©2012 Yuan et al.; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Methodology Article
Yuan, Xiguo
Yu, Guoqiang
Hou, Xuchu
Shih, Ie-Ming
Clarke, Robert
Zhang, Junying
Hoffman, Eric P
Wang, Roger R
Zhang, Zhen
Wang, Yue
Genome-wide identification of significant aberrations in cancer genome
title Genome-wide identification of significant aberrations in cancer genome
title_full Genome-wide identification of significant aberrations in cancer genome
title_fullStr Genome-wide identification of significant aberrations in cancer genome
title_full_unstemmed Genome-wide identification of significant aberrations in cancer genome
title_short Genome-wide identification of significant aberrations in cancer genome
title_sort genome-wide identification of significant aberrations in cancer genome
topic Methodology Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3428679/
https://www.ncbi.nlm.nih.gov/pubmed/22839576
http://dx.doi.org/10.1186/1471-2164-13-342
work_keys_str_mv AT yuanxiguo genomewideidentificationofsignificantaberrationsincancergenome
AT yuguoqiang genomewideidentificationofsignificantaberrationsincancergenome
AT houxuchu genomewideidentificationofsignificantaberrationsincancergenome
AT shihieming genomewideidentificationofsignificantaberrationsincancergenome
AT clarkerobert genomewideidentificationofsignificantaberrationsincancergenome
AT zhangjunying genomewideidentificationofsignificantaberrationsincancergenome
AT hoffmanericp genomewideidentificationofsignificantaberrationsincancergenome
AT wangrogerr genomewideidentificationofsignificantaberrationsincancergenome
AT zhangzhen genomewideidentificationofsignificantaberrationsincancergenome
AT wangyue genomewideidentificationofsignificantaberrationsincancergenome