Cargando…

Comparison of automated candidate gene prediction systems using genes implicated in type 2 diabetes by genome-wide association studies

BACKGROUND: Automated candidate gene prediction systems allow geneticists to hone in on disease genes more rapidly by identifying the most probable candidate genes linked to the disease phenotypes under investigation. Here we assessed the ability of eight different candidate gene prediction systems...

Descripción completa

Detalles Bibliográficos
Autores principales: Teber, Erdahl T, Liu, Jason Y, Ballouz, Sara, Fatkin, Diane, Wouters, Merridee A
Formato: Texto
Lenguaje:English
Publicado: BioMed Central 2009
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2648789/
https://www.ncbi.nlm.nih.gov/pubmed/19208173
http://dx.doi.org/10.1186/1471-2105-10-S1-S69
_version_ 1782164988702490624
author Teber, Erdahl T
Liu, Jason Y
Ballouz, Sara
Fatkin, Diane
Wouters, Merridee A
author_facet Teber, Erdahl T
Liu, Jason Y
Ballouz, Sara
Fatkin, Diane
Wouters, Merridee A
author_sort Teber, Erdahl T
collection PubMed
description BACKGROUND: Automated candidate gene prediction systems allow geneticists to hone in on disease genes more rapidly by identifying the most probable candidate genes linked to the disease phenotypes under investigation. Here we assessed the ability of eight different candidate gene prediction systems to predict disease genes in intervals previously associated with type 2 diabetes by benchmarking their performance against genes implicated by recent genome-wide association studies. RESULTS: Using a search space of 9556 genes, all but one of the systems pruned the genome in favour of genes associated with moderate to highly significant SNPs. Of the 11 genes associated with highly significant SNPs identified by the genome-wide association studies, eight were flagged as likely candidates by at least one of the prediction systems. A list of candidates produced by a previous consensus approach did not match any of the genes implicated by 706 moderate to highly significant SNPs flagged by the genome-wide association studies. We prioritized genes associated with medium significance SNPs. CONCLUSION: The study appraises the relative success of several candidate gene prediction systems against independent genetic data. Even when confronted with challengingly large intervals, the candidate gene prediction systems can successfully select likely disease genes. Furthermore, they can be used to filter statistically less-well-supported genetic data to select more likely candidates. We suggest consensus approaches fail because they penalize novel predictions made from independent underlying databases. To realize their full potential further work needs to be done on prioritization and annotation of genes.
format Text
id pubmed-2648789
institution National Center for Biotechnology Information
language English
publishDate 2009
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-26487892009-03-03 Comparison of automated candidate gene prediction systems using genes implicated in type 2 diabetes by genome-wide association studies Teber, Erdahl T Liu, Jason Y Ballouz, Sara Fatkin, Diane Wouters, Merridee A BMC Bioinformatics Research BACKGROUND: Automated candidate gene prediction systems allow geneticists to hone in on disease genes more rapidly by identifying the most probable candidate genes linked to the disease phenotypes under investigation. Here we assessed the ability of eight different candidate gene prediction systems to predict disease genes in intervals previously associated with type 2 diabetes by benchmarking their performance against genes implicated by recent genome-wide association studies. RESULTS: Using a search space of 9556 genes, all but one of the systems pruned the genome in favour of genes associated with moderate to highly significant SNPs. Of the 11 genes associated with highly significant SNPs identified by the genome-wide association studies, eight were flagged as likely candidates by at least one of the prediction systems. A list of candidates produced by a previous consensus approach did not match any of the genes implicated by 706 moderate to highly significant SNPs flagged by the genome-wide association studies. We prioritized genes associated with medium significance SNPs. CONCLUSION: The study appraises the relative success of several candidate gene prediction systems against independent genetic data. Even when confronted with challengingly large intervals, the candidate gene prediction systems can successfully select likely disease genes. Furthermore, they can be used to filter statistically less-well-supported genetic data to select more likely candidates. We suggest consensus approaches fail because they penalize novel predictions made from independent underlying databases. To realize their full potential further work needs to be done on prioritization and annotation of genes. BioMed Central 2009-01-30 /pmc/articles/PMC2648789/ /pubmed/19208173 http://dx.doi.org/10.1186/1471-2105-10-S1-S69 Text en Copyright © 2009 Teber et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an open access article distributed under the terms of the Creative Commons Attribution License ( (http://creativecommons.org/licenses/by/2.0) ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research
Teber, Erdahl T
Liu, Jason Y
Ballouz, Sara
Fatkin, Diane
Wouters, Merridee A
Comparison of automated candidate gene prediction systems using genes implicated in type 2 diabetes by genome-wide association studies
title Comparison of automated candidate gene prediction systems using genes implicated in type 2 diabetes by genome-wide association studies
title_full Comparison of automated candidate gene prediction systems using genes implicated in type 2 diabetes by genome-wide association studies
title_fullStr Comparison of automated candidate gene prediction systems using genes implicated in type 2 diabetes by genome-wide association studies
title_full_unstemmed Comparison of automated candidate gene prediction systems using genes implicated in type 2 diabetes by genome-wide association studies
title_short Comparison of automated candidate gene prediction systems using genes implicated in type 2 diabetes by genome-wide association studies
title_sort comparison of automated candidate gene prediction systems using genes implicated in type 2 diabetes by genome-wide association studies
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2648789/
https://www.ncbi.nlm.nih.gov/pubmed/19208173
http://dx.doi.org/10.1186/1471-2105-10-S1-S69
work_keys_str_mv AT tebererdahlt comparisonofautomatedcandidategenepredictionsystemsusinggenesimplicatedintype2diabetesbygenomewideassociationstudies
AT liujasony comparisonofautomatedcandidategenepredictionsystemsusinggenesimplicatedintype2diabetesbygenomewideassociationstudies
AT ballouzsara comparisonofautomatedcandidategenepredictionsystemsusinggenesimplicatedintype2diabetesbygenomewideassociationstudies
AT fatkindiane comparisonofautomatedcandidategenepredictionsystemsusinggenesimplicatedintype2diabetesbygenomewideassociationstudies
AT woutersmerrideea comparisonofautomatedcandidategenepredictionsystemsusinggenesimplicatedintype2diabetesbygenomewideassociationstudies