Cargando…

TEGS-CN: A Statistical Method for Pathway Analysis of Genome-wide Copy Number Profile

The effects of copy number alterations make up a significant part of the tumor genome profile, but pathway analyses of these alterations are still not well established. We proposed a novel method to analyze multiple copy numbers of genes within a pathway, termed Test for the Effect of a Gene Set wit...

Descripción completa

Detalles Bibliográficos
Autores principales: Huang, Yen-Tsung, Hsu, Thomas, Christiani, David C
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Libertas Academica 2014
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4218657/
https://www.ncbi.nlm.nih.gov/pubmed/25452685
http://dx.doi.org/10.4137/CIN.S13978
_version_ 1782342453352726528
author Huang, Yen-Tsung
Hsu, Thomas
Christiani, David C
author_facet Huang, Yen-Tsung
Hsu, Thomas
Christiani, David C
author_sort Huang, Yen-Tsung
collection PubMed
description The effects of copy number alterations make up a significant part of the tumor genome profile, but pathway analyses of these alterations are still not well established. We proposed a novel method to analyze multiple copy numbers of genes within a pathway, termed Test for the Effect of a Gene Set with Copy Number data (TEGS-CN). TEGS-CN was adapted from TEGS, a method that we previously developed for gene expression data using a variance component score test. With additional development, we extend the method to analyze DNA copy number data, accounting for different sizes and thus various numbers of copy number probes in genes. The test statistic follows a mixture of X(2) distributions that can be obtained using permutation with scaled X(2) approximation. We conducted simulation studies to evaluate the size and the power of TEGS-CN and to compare its performance with TEGS. We analyzed a genome-wide copy number data from 264 patients of non-small-cell lung cancer. With the Molecular Signatures Database (MSigDB) pathway database, the genome-wide copy number data can be classified into 1814 biological pathways or gene sets. We investigated associations of the copy number profile of the 1814 gene sets with pack-years of cigarette smoking. Our analysis revealed five pathways with significant P values after Bonferroni adjustment (<2.8 × 10(−5)), including the PTEN pathway (7.8 × 10(−7)), the gene set up-regulated under heat shock (3.6 × 10(−6)), the gene sets involved in the immune profile for rejection of kidney transplantation (9.2 × 10(−6)) and for transcriptional control of leukocytes (2.2 × 10(−5)), and the ganglioside biosynthesis pathway (2.7 × 10(−5)). In conclusion, we present a new method for pathway analyses of copy number data, and causal mechanisms of the five pathways require further study.
format Online
Article
Text
id pubmed-4218657
institution National Center for Biotechnology Information
language English
publishDate 2014
publisher Libertas Academica
record_format MEDLINE/PubMed
spelling pubmed-42186572014-12-01 TEGS-CN: A Statistical Method for Pathway Analysis of Genome-wide Copy Number Profile Huang, Yen-Tsung Hsu, Thomas Christiani, David C Cancer Inform Original Research The effects of copy number alterations make up a significant part of the tumor genome profile, but pathway analyses of these alterations are still not well established. We proposed a novel method to analyze multiple copy numbers of genes within a pathway, termed Test for the Effect of a Gene Set with Copy Number data (TEGS-CN). TEGS-CN was adapted from TEGS, a method that we previously developed for gene expression data using a variance component score test. With additional development, we extend the method to analyze DNA copy number data, accounting for different sizes and thus various numbers of copy number probes in genes. The test statistic follows a mixture of X(2) distributions that can be obtained using permutation with scaled X(2) approximation. We conducted simulation studies to evaluate the size and the power of TEGS-CN and to compare its performance with TEGS. We analyzed a genome-wide copy number data from 264 patients of non-small-cell lung cancer. With the Molecular Signatures Database (MSigDB) pathway database, the genome-wide copy number data can be classified into 1814 biological pathways or gene sets. We investigated associations of the copy number profile of the 1814 gene sets with pack-years of cigarette smoking. Our analysis revealed five pathways with significant P values after Bonferroni adjustment (<2.8 × 10(−5)), including the PTEN pathway (7.8 × 10(−7)), the gene set up-regulated under heat shock (3.6 × 10(−6)), the gene sets involved in the immune profile for rejection of kidney transplantation (9.2 × 10(−6)) and for transcriptional control of leukocytes (2.2 × 10(−5)), and the ganglioside biosynthesis pathway (2.7 × 10(−5)). In conclusion, we present a new method for pathway analyses of copy number data, and causal mechanisms of the five pathways require further study. Libertas Academica 2014-10-15 /pmc/articles/PMC4218657/ /pubmed/25452685 http://dx.doi.org/10.4137/CIN.S13978 Text en © 2014 the author(s), publisher and licensee Libertas Academica Ltd. This is an open-access article distributed under the terms of the Creative Commons CC-BY-NC 3.0 License.
spellingShingle Original Research
Huang, Yen-Tsung
Hsu, Thomas
Christiani, David C
TEGS-CN: A Statistical Method for Pathway Analysis of Genome-wide Copy Number Profile
title TEGS-CN: A Statistical Method for Pathway Analysis of Genome-wide Copy Number Profile
title_full TEGS-CN: A Statistical Method for Pathway Analysis of Genome-wide Copy Number Profile
title_fullStr TEGS-CN: A Statistical Method for Pathway Analysis of Genome-wide Copy Number Profile
title_full_unstemmed TEGS-CN: A Statistical Method for Pathway Analysis of Genome-wide Copy Number Profile
title_short TEGS-CN: A Statistical Method for Pathway Analysis of Genome-wide Copy Number Profile
title_sort tegs-cn: a statistical method for pathway analysis of genome-wide copy number profile
topic Original Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4218657/
https://www.ncbi.nlm.nih.gov/pubmed/25452685
http://dx.doi.org/10.4137/CIN.S13978
work_keys_str_mv AT huangyentsung tegscnastatisticalmethodforpathwayanalysisofgenomewidecopynumberprofile
AT hsuthomas tegscnastatisticalmethodforpathwayanalysisofgenomewidecopynumberprofile
AT christianidavidc tegscnastatisticalmethodforpathwayanalysisofgenomewidecopynumberprofile