Cargando…
FitSNPs: highly differentially expressed genes are more likely to have variants associated with disease
BACKGROUND: Candidate single nucleotide polymorphisms (SNPs) from genome-wide association studies (GWASs) were often selected for validation based on their functional annotation, which was inadequate and biased. We propose to use the more than 200,000 microarray studies in the Gene Expression Omnibu...
Autores principales: | , , , , , , , |
---|---|
Formato: | Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2008
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2646274/ https://www.ncbi.nlm.nih.gov/pubmed/19061490 http://dx.doi.org/10.1186/gb-2008-9-12-r170 |
_version_ | 1782164835204595712 |
---|---|
author | Chen, Rong Morgan, Alex A Dudley, Joel Deshpande, Tarangini Li, Li Kodama, Keiichi Chiang, Annie P Butte, Atul J |
author_facet | Chen, Rong Morgan, Alex A Dudley, Joel Deshpande, Tarangini Li, Li Kodama, Keiichi Chiang, Annie P Butte, Atul J |
author_sort | Chen, Rong |
collection | PubMed |
description | BACKGROUND: Candidate single nucleotide polymorphisms (SNPs) from genome-wide association studies (GWASs) were often selected for validation based on their functional annotation, which was inadequate and biased. We propose to use the more than 200,000 microarray studies in the Gene Expression Omnibus to systematically prioritize candidate SNPs from GWASs. RESULTS: We analyzed all human microarray studies from the Gene Expression Omnibus, and calculated the observed frequency of differential expression, which we called differential expression ratio, for every human gene. Analysis conducted in a comprehensive list of curated disease genes revealed a positive association between differential expression ratio values and the likelihood of harboring disease-associated variants. By considering highly differentially expressed genes, we were able to rediscover disease genes with 79% specificity and 37% sensitivity. We successfully distinguished true disease genes from false positives in multiple GWASs for multiple diseases. We then derived a list of functionally interpolating SNPs (fitSNPs) to analyze the top seven loci of Wellcome Trust Case Control Consortium type 1 diabetes mellitus GWASs, rediscovered all type 1 diabetes mellitus genes, and predicted a novel gene (KIAA1109) for an unexplained locus 4q27. We suggest that fitSNPs would work equally well for both Mendelian and complex diseases (being more effective for cancer) and proposed candidate genes to sequence for their association with 597 syndromes with unknown molecular basis. CONCLUSIONS: Our study demonstrates that highly differentially expressed genes are more likely to harbor disease-associated DNA variants. FitSNPs can serve as an effective tool to systematically prioritize candidate SNPs from GWASs. |
format | Text |
id | pubmed-2646274 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2008 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-26462742009-02-23 FitSNPs: highly differentially expressed genes are more likely to have variants associated with disease Chen, Rong Morgan, Alex A Dudley, Joel Deshpande, Tarangini Li, Li Kodama, Keiichi Chiang, Annie P Butte, Atul J Genome Biol Research BACKGROUND: Candidate single nucleotide polymorphisms (SNPs) from genome-wide association studies (GWASs) were often selected for validation based on their functional annotation, which was inadequate and biased. We propose to use the more than 200,000 microarray studies in the Gene Expression Omnibus to systematically prioritize candidate SNPs from GWASs. RESULTS: We analyzed all human microarray studies from the Gene Expression Omnibus, and calculated the observed frequency of differential expression, which we called differential expression ratio, for every human gene. Analysis conducted in a comprehensive list of curated disease genes revealed a positive association between differential expression ratio values and the likelihood of harboring disease-associated variants. By considering highly differentially expressed genes, we were able to rediscover disease genes with 79% specificity and 37% sensitivity. We successfully distinguished true disease genes from false positives in multiple GWASs for multiple diseases. We then derived a list of functionally interpolating SNPs (fitSNPs) to analyze the top seven loci of Wellcome Trust Case Control Consortium type 1 diabetes mellitus GWASs, rediscovered all type 1 diabetes mellitus genes, and predicted a novel gene (KIAA1109) for an unexplained locus 4q27. We suggest that fitSNPs would work equally well for both Mendelian and complex diseases (being more effective for cancer) and proposed candidate genes to sequence for their association with 597 syndromes with unknown molecular basis. CONCLUSIONS: Our study demonstrates that highly differentially expressed genes are more likely to harbor disease-associated DNA variants. FitSNPs can serve as an effective tool to systematically prioritize candidate SNPs from GWASs. BioMed Central 2008 2008-12-05 /pmc/articles/PMC2646274/ /pubmed/19061490 http://dx.doi.org/10.1186/gb-2008-9-12-r170 Text en Copyright © 2008 Chen et al.; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an open access article distributed under the terms of the Creative Commons Attribution License ( (http://creativecommons.org/licenses/by/2.0) ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Research Chen, Rong Morgan, Alex A Dudley, Joel Deshpande, Tarangini Li, Li Kodama, Keiichi Chiang, Annie P Butte, Atul J FitSNPs: highly differentially expressed genes are more likely to have variants associated with disease |
title | FitSNPs: highly differentially expressed genes are more likely to have variants associated with disease |
title_full | FitSNPs: highly differentially expressed genes are more likely to have variants associated with disease |
title_fullStr | FitSNPs: highly differentially expressed genes are more likely to have variants associated with disease |
title_full_unstemmed | FitSNPs: highly differentially expressed genes are more likely to have variants associated with disease |
title_short | FitSNPs: highly differentially expressed genes are more likely to have variants associated with disease |
title_sort | fitsnps: highly differentially expressed genes are more likely to have variants associated with disease |
topic | Research |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2646274/ https://www.ncbi.nlm.nih.gov/pubmed/19061490 http://dx.doi.org/10.1186/gb-2008-9-12-r170 |
work_keys_str_mv | AT chenrong fitsnpshighlydifferentiallyexpressedgenesaremorelikelytohavevariantsassociatedwithdisease AT morganalexa fitsnpshighlydifferentiallyexpressedgenesaremorelikelytohavevariantsassociatedwithdisease AT dudleyjoel fitsnpshighlydifferentiallyexpressedgenesaremorelikelytohavevariantsassociatedwithdisease AT deshpandetarangini fitsnpshighlydifferentiallyexpressedgenesaremorelikelytohavevariantsassociatedwithdisease AT lili fitsnpshighlydifferentiallyexpressedgenesaremorelikelytohavevariantsassociatedwithdisease AT kodamakeiichi fitsnpshighlydifferentiallyexpressedgenesaremorelikelytohavevariantsassociatedwithdisease AT chianganniep fitsnpshighlydifferentiallyexpressedgenesaremorelikelytohavevariantsassociatedwithdisease AT butteatulj fitsnpshighlydifferentiallyexpressedgenesaremorelikelytohavevariantsassociatedwithdisease |