Cargando…

FitSNPs: highly differentially expressed genes are more likely to have variants associated with disease

BACKGROUND: Candidate single nucleotide polymorphisms (SNPs) from genome-wide association studies (GWASs) were often selected for validation based on their functional annotation, which was inadequate and biased. We propose to use the more than 200,000 microarray studies in the Gene Expression Omnibu...

Descripción completa

Detalles Bibliográficos
Autores principales: Chen, Rong, Morgan, Alex A, Dudley, Joel, Deshpande, Tarangini, Li, Li, Kodama, Keiichi, Chiang, Annie P, Butte, Atul J
Formato: Texto
Lenguaje:English
Publicado: BioMed Central 2008
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2646274/
https://www.ncbi.nlm.nih.gov/pubmed/19061490
http://dx.doi.org/10.1186/gb-2008-9-12-r170
_version_ 1782164835204595712
author Chen, Rong
Morgan, Alex A
Dudley, Joel
Deshpande, Tarangini
Li, Li
Kodama, Keiichi
Chiang, Annie P
Butte, Atul J
author_facet Chen, Rong
Morgan, Alex A
Dudley, Joel
Deshpande, Tarangini
Li, Li
Kodama, Keiichi
Chiang, Annie P
Butte, Atul J
author_sort Chen, Rong
collection PubMed
description BACKGROUND: Candidate single nucleotide polymorphisms (SNPs) from genome-wide association studies (GWASs) were often selected for validation based on their functional annotation, which was inadequate and biased. We propose to use the more than 200,000 microarray studies in the Gene Expression Omnibus to systematically prioritize candidate SNPs from GWASs. RESULTS: We analyzed all human microarray studies from the Gene Expression Omnibus, and calculated the observed frequency of differential expression, which we called differential expression ratio, for every human gene. Analysis conducted in a comprehensive list of curated disease genes revealed a positive association between differential expression ratio values and the likelihood of harboring disease-associated variants. By considering highly differentially expressed genes, we were able to rediscover disease genes with 79% specificity and 37% sensitivity. We successfully distinguished true disease genes from false positives in multiple GWASs for multiple diseases. We then derived a list of functionally interpolating SNPs (fitSNPs) to analyze the top seven loci of Wellcome Trust Case Control Consortium type 1 diabetes mellitus GWASs, rediscovered all type 1 diabetes mellitus genes, and predicted a novel gene (KIAA1109) for an unexplained locus 4q27. We suggest that fitSNPs would work equally well for both Mendelian and complex diseases (being more effective for cancer) and proposed candidate genes to sequence for their association with 597 syndromes with unknown molecular basis. CONCLUSIONS: Our study demonstrates that highly differentially expressed genes are more likely to harbor disease-associated DNA variants. FitSNPs can serve as an effective tool to systematically prioritize candidate SNPs from GWASs.
format Text
id pubmed-2646274
institution National Center for Biotechnology Information
language English
publishDate 2008
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-26462742009-02-23 FitSNPs: highly differentially expressed genes are more likely to have variants associated with disease Chen, Rong Morgan, Alex A Dudley, Joel Deshpande, Tarangini Li, Li Kodama, Keiichi Chiang, Annie P Butte, Atul J Genome Biol Research BACKGROUND: Candidate single nucleotide polymorphisms (SNPs) from genome-wide association studies (GWASs) were often selected for validation based on their functional annotation, which was inadequate and biased. We propose to use the more than 200,000 microarray studies in the Gene Expression Omnibus to systematically prioritize candidate SNPs from GWASs. RESULTS: We analyzed all human microarray studies from the Gene Expression Omnibus, and calculated the observed frequency of differential expression, which we called differential expression ratio, for every human gene. Analysis conducted in a comprehensive list of curated disease genes revealed a positive association between differential expression ratio values and the likelihood of harboring disease-associated variants. By considering highly differentially expressed genes, we were able to rediscover disease genes with 79% specificity and 37% sensitivity. We successfully distinguished true disease genes from false positives in multiple GWASs for multiple diseases. We then derived a list of functionally interpolating SNPs (fitSNPs) to analyze the top seven loci of Wellcome Trust Case Control Consortium type 1 diabetes mellitus GWASs, rediscovered all type 1 diabetes mellitus genes, and predicted a novel gene (KIAA1109) for an unexplained locus 4q27. We suggest that fitSNPs would work equally well for both Mendelian and complex diseases (being more effective for cancer) and proposed candidate genes to sequence for their association with 597 syndromes with unknown molecular basis. CONCLUSIONS: Our study demonstrates that highly differentially expressed genes are more likely to harbor disease-associated DNA variants. FitSNPs can serve as an effective tool to systematically prioritize candidate SNPs from GWASs. BioMed Central 2008 2008-12-05 /pmc/articles/PMC2646274/ /pubmed/19061490 http://dx.doi.org/10.1186/gb-2008-9-12-r170 Text en Copyright © 2008 Chen et al.; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an open access article distributed under the terms of the Creative Commons Attribution License ( (http://creativecommons.org/licenses/by/2.0) ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research
Chen, Rong
Morgan, Alex A
Dudley, Joel
Deshpande, Tarangini
Li, Li
Kodama, Keiichi
Chiang, Annie P
Butte, Atul J
FitSNPs: highly differentially expressed genes are more likely to have variants associated with disease
title FitSNPs: highly differentially expressed genes are more likely to have variants associated with disease
title_full FitSNPs: highly differentially expressed genes are more likely to have variants associated with disease
title_fullStr FitSNPs: highly differentially expressed genes are more likely to have variants associated with disease
title_full_unstemmed FitSNPs: highly differentially expressed genes are more likely to have variants associated with disease
title_short FitSNPs: highly differentially expressed genes are more likely to have variants associated with disease
title_sort fitsnps: highly differentially expressed genes are more likely to have variants associated with disease
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2646274/
https://www.ncbi.nlm.nih.gov/pubmed/19061490
http://dx.doi.org/10.1186/gb-2008-9-12-r170
work_keys_str_mv AT chenrong fitsnpshighlydifferentiallyexpressedgenesaremorelikelytohavevariantsassociatedwithdisease
AT morganalexa fitsnpshighlydifferentiallyexpressedgenesaremorelikelytohavevariantsassociatedwithdisease
AT dudleyjoel fitsnpshighlydifferentiallyexpressedgenesaremorelikelytohavevariantsassociatedwithdisease
AT deshpandetarangini fitsnpshighlydifferentiallyexpressedgenesaremorelikelytohavevariantsassociatedwithdisease
AT lili fitsnpshighlydifferentiallyexpressedgenesaremorelikelytohavevariantsassociatedwithdisease
AT kodamakeiichi fitsnpshighlydifferentiallyexpressedgenesaremorelikelytohavevariantsassociatedwithdisease
AT chianganniep fitsnpshighlydifferentiallyexpressedgenesaremorelikelytohavevariantsassociatedwithdisease
AT butteatulj fitsnpshighlydifferentiallyexpressedgenesaremorelikelytohavevariantsassociatedwithdisease