Cargando…
Complex systems analysis of bladder cancer susceptibility reveals a role for decarboxylase activity in two genome-wide association studies
BACKGROUND: Bladder cancer is common disease with a complex etiology that is likely due to many different genetic and environmental factors. The goal of this study was to embrace this complexity using a bioinformatics analysis pipeline designed to use machine learning to measure synergistic interact...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2016
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5154053/ https://www.ncbi.nlm.nih.gov/pubmed/27999618 http://dx.doi.org/10.1186/s13040-016-0119-z |
_version_ | 1782474807493787648 |
---|---|
author | Cheng, Samantha Andrew, Angeline S. Andrews, Peter C. Moore, Jason H. |
author_facet | Cheng, Samantha Andrew, Angeline S. Andrews, Peter C. Moore, Jason H. |
author_sort | Cheng, Samantha |
collection | PubMed |
description | BACKGROUND: Bladder cancer is common disease with a complex etiology that is likely due to many different genetic and environmental factors. The goal of this study was to embrace this complexity using a bioinformatics analysis pipeline designed to use machine learning to measure synergistic interactions between single nucleotide polymorphisms (SNPs) in two genome-wide association studies (GWAS) and then to assess their enrichment within functional groups defined by Gene Ontology. The significance of the results was evaluated using permutation testing and those results that replicated between the two GWAS data sets were reported. RESULTS: In the first step of our bioinformatics pipeline, we estimated the pairwise synergistic effects of SNPs on bladder cancer risk in both GWAS data sets using Multifactor Dimensionality Reduction (MDR) machine learning method that is designed specifically for this purpose. Statistical significance was assessed using a 1000-fold permutation test. Each single SNP was assigned a p-value based on its strongest pairwise association. Each SNP was then mapped to one or more genes using a window of 500 kb upstream and downstream from each gene boundary. This window was chosen to capture as many regulatory variants as possible. Using Exploratory Visual Analysis (EVA), we then carried out a gene set enrichment analysis at the gene level to identify those genes with an overabundance of significant SNPs relative to the size of their mapped regions. Each gene was assigned to a biological functional group defined by Gene Ontology (GO). We next used EVA to evaluate the overabundance of significant genes in biological functional groups. Our study yielded one GO category, carboxy-lysase activity (GO:0016831), that was significant in analyses from both GWAS data sets. Interestingly, only the gamma-glutamyl carboxylase (GGCX) gene from this GO group was significant in both the detection and replication data, highlighting the complexity of the pathway-level effects on risk. The GGCX gene is expressed in the bladder, but has not been previously associated with bladder cancer in univariate GWAS. However, there is some experimental evidence that carboxy-lysase activity might play a role in cancer and that genes in this pathway should be explored as drug targets. This study provides a genetic basis for that observation. CONCLUSIONS: Our machine learning analysis of genetic associations in two GWAS for bladder cancer identified numerous associations with pairs of SNPs. Gene set enrichment analysis found aggregation of risk-associated SNPs in genes and significant genes in GO functional groups. This study supports a role for decarboxylase protein complexes in bladder cancer susceptibility. Previous research has implicated decarboxylases in bladder cancer etiology; however, the genes that we found to be significant in the detection and replication data are not known to have direct influence on bladder cancer, suggesting some novel hypotheses. This study highlights the need for a complex systems approach to the genetic and genomic analysis of common diseases such as cancer. |
format | Online Article Text |
id | pubmed-5154053 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2016 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-51540532016-12-20 Complex systems analysis of bladder cancer susceptibility reveals a role for decarboxylase activity in two genome-wide association studies Cheng, Samantha Andrew, Angeline S. Andrews, Peter C. Moore, Jason H. BioData Min Short Report BACKGROUND: Bladder cancer is common disease with a complex etiology that is likely due to many different genetic and environmental factors. The goal of this study was to embrace this complexity using a bioinformatics analysis pipeline designed to use machine learning to measure synergistic interactions between single nucleotide polymorphisms (SNPs) in two genome-wide association studies (GWAS) and then to assess their enrichment within functional groups defined by Gene Ontology. The significance of the results was evaluated using permutation testing and those results that replicated between the two GWAS data sets were reported. RESULTS: In the first step of our bioinformatics pipeline, we estimated the pairwise synergistic effects of SNPs on bladder cancer risk in both GWAS data sets using Multifactor Dimensionality Reduction (MDR) machine learning method that is designed specifically for this purpose. Statistical significance was assessed using a 1000-fold permutation test. Each single SNP was assigned a p-value based on its strongest pairwise association. Each SNP was then mapped to one or more genes using a window of 500 kb upstream and downstream from each gene boundary. This window was chosen to capture as many regulatory variants as possible. Using Exploratory Visual Analysis (EVA), we then carried out a gene set enrichment analysis at the gene level to identify those genes with an overabundance of significant SNPs relative to the size of their mapped regions. Each gene was assigned to a biological functional group defined by Gene Ontology (GO). We next used EVA to evaluate the overabundance of significant genes in biological functional groups. Our study yielded one GO category, carboxy-lysase activity (GO:0016831), that was significant in analyses from both GWAS data sets. Interestingly, only the gamma-glutamyl carboxylase (GGCX) gene from this GO group was significant in both the detection and replication data, highlighting the complexity of the pathway-level effects on risk. The GGCX gene is expressed in the bladder, but has not been previously associated with bladder cancer in univariate GWAS. However, there is some experimental evidence that carboxy-lysase activity might play a role in cancer and that genes in this pathway should be explored as drug targets. This study provides a genetic basis for that observation. CONCLUSIONS: Our machine learning analysis of genetic associations in two GWAS for bladder cancer identified numerous associations with pairs of SNPs. Gene set enrichment analysis found aggregation of risk-associated SNPs in genes and significant genes in GO functional groups. This study supports a role for decarboxylase protein complexes in bladder cancer susceptibility. Previous research has implicated decarboxylases in bladder cancer etiology; however, the genes that we found to be significant in the detection and replication data are not known to have direct influence on bladder cancer, suggesting some novel hypotheses. This study highlights the need for a complex systems approach to the genetic and genomic analysis of common diseases such as cancer. BioMed Central 2016-12-12 /pmc/articles/PMC5154053/ /pubmed/27999618 http://dx.doi.org/10.1186/s13040-016-0119-z Text en © The Author(s). 2016 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated. |
spellingShingle | Short Report Cheng, Samantha Andrew, Angeline S. Andrews, Peter C. Moore, Jason H. Complex systems analysis of bladder cancer susceptibility reveals a role for decarboxylase activity in two genome-wide association studies |
title | Complex systems analysis of bladder cancer susceptibility reveals a role for decarboxylase activity in two genome-wide association studies |
title_full | Complex systems analysis of bladder cancer susceptibility reveals a role for decarboxylase activity in two genome-wide association studies |
title_fullStr | Complex systems analysis of bladder cancer susceptibility reveals a role for decarboxylase activity in two genome-wide association studies |
title_full_unstemmed | Complex systems analysis of bladder cancer susceptibility reveals a role for decarboxylase activity in two genome-wide association studies |
title_short | Complex systems analysis of bladder cancer susceptibility reveals a role for decarboxylase activity in two genome-wide association studies |
title_sort | complex systems analysis of bladder cancer susceptibility reveals a role for decarboxylase activity in two genome-wide association studies |
topic | Short Report |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5154053/ https://www.ncbi.nlm.nih.gov/pubmed/27999618 http://dx.doi.org/10.1186/s13040-016-0119-z |
work_keys_str_mv | AT chengsamantha complexsystemsanalysisofbladdercancersusceptibilityrevealsarolefordecarboxylaseactivityintwogenomewideassociationstudies AT andrewangelines complexsystemsanalysisofbladdercancersusceptibilityrevealsarolefordecarboxylaseactivityintwogenomewideassociationstudies AT andrewspeterc complexsystemsanalysisofbladdercancersusceptibilityrevealsarolefordecarboxylaseactivityintwogenomewideassociationstudies AT moorejasonh complexsystemsanalysisofbladdercancersusceptibilityrevealsarolefordecarboxylaseactivityintwogenomewideassociationstudies |