Cargando…
Comparison of automated candidate gene prediction systems using genes implicated in type 2 diabetes by genome-wide association studies
BACKGROUND: Automated candidate gene prediction systems allow geneticists to hone in on disease genes more rapidly by identifying the most probable candidate genes linked to the disease phenotypes under investigation. Here we assessed the ability of eight different candidate gene prediction systems...
Autores principales: | , , , , |
---|---|
Formato: | Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2009
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2648789/ https://www.ncbi.nlm.nih.gov/pubmed/19208173 http://dx.doi.org/10.1186/1471-2105-10-S1-S69 |
_version_ | 1782164988702490624 |
---|---|
author | Teber, Erdahl T Liu, Jason Y Ballouz, Sara Fatkin, Diane Wouters, Merridee A |
author_facet | Teber, Erdahl T Liu, Jason Y Ballouz, Sara Fatkin, Diane Wouters, Merridee A |
author_sort | Teber, Erdahl T |
collection | PubMed |
description | BACKGROUND: Automated candidate gene prediction systems allow geneticists to hone in on disease genes more rapidly by identifying the most probable candidate genes linked to the disease phenotypes under investigation. Here we assessed the ability of eight different candidate gene prediction systems to predict disease genes in intervals previously associated with type 2 diabetes by benchmarking their performance against genes implicated by recent genome-wide association studies. RESULTS: Using a search space of 9556 genes, all but one of the systems pruned the genome in favour of genes associated with moderate to highly significant SNPs. Of the 11 genes associated with highly significant SNPs identified by the genome-wide association studies, eight were flagged as likely candidates by at least one of the prediction systems. A list of candidates produced by a previous consensus approach did not match any of the genes implicated by 706 moderate to highly significant SNPs flagged by the genome-wide association studies. We prioritized genes associated with medium significance SNPs. CONCLUSION: The study appraises the relative success of several candidate gene prediction systems against independent genetic data. Even when confronted with challengingly large intervals, the candidate gene prediction systems can successfully select likely disease genes. Furthermore, they can be used to filter statistically less-well-supported genetic data to select more likely candidates. We suggest consensus approaches fail because they penalize novel predictions made from independent underlying databases. To realize their full potential further work needs to be done on prioritization and annotation of genes. |
format | Text |
id | pubmed-2648789 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2009 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-26487892009-03-03 Comparison of automated candidate gene prediction systems using genes implicated in type 2 diabetes by genome-wide association studies Teber, Erdahl T Liu, Jason Y Ballouz, Sara Fatkin, Diane Wouters, Merridee A BMC Bioinformatics Research BACKGROUND: Automated candidate gene prediction systems allow geneticists to hone in on disease genes more rapidly by identifying the most probable candidate genes linked to the disease phenotypes under investigation. Here we assessed the ability of eight different candidate gene prediction systems to predict disease genes in intervals previously associated with type 2 diabetes by benchmarking their performance against genes implicated by recent genome-wide association studies. RESULTS: Using a search space of 9556 genes, all but one of the systems pruned the genome in favour of genes associated with moderate to highly significant SNPs. Of the 11 genes associated with highly significant SNPs identified by the genome-wide association studies, eight were flagged as likely candidates by at least one of the prediction systems. A list of candidates produced by a previous consensus approach did not match any of the genes implicated by 706 moderate to highly significant SNPs flagged by the genome-wide association studies. We prioritized genes associated with medium significance SNPs. CONCLUSION: The study appraises the relative success of several candidate gene prediction systems against independent genetic data. Even when confronted with challengingly large intervals, the candidate gene prediction systems can successfully select likely disease genes. Furthermore, they can be used to filter statistically less-well-supported genetic data to select more likely candidates. We suggest consensus approaches fail because they penalize novel predictions made from independent underlying databases. To realize their full potential further work needs to be done on prioritization and annotation of genes. BioMed Central 2009-01-30 /pmc/articles/PMC2648789/ /pubmed/19208173 http://dx.doi.org/10.1186/1471-2105-10-S1-S69 Text en Copyright © 2009 Teber et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an open access article distributed under the terms of the Creative Commons Attribution License ( (http://creativecommons.org/licenses/by/2.0) ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Research Teber, Erdahl T Liu, Jason Y Ballouz, Sara Fatkin, Diane Wouters, Merridee A Comparison of automated candidate gene prediction systems using genes implicated in type 2 diabetes by genome-wide association studies |
title | Comparison of automated candidate gene prediction systems using genes implicated in type 2 diabetes by genome-wide association studies |
title_full | Comparison of automated candidate gene prediction systems using genes implicated in type 2 diabetes by genome-wide association studies |
title_fullStr | Comparison of automated candidate gene prediction systems using genes implicated in type 2 diabetes by genome-wide association studies |
title_full_unstemmed | Comparison of automated candidate gene prediction systems using genes implicated in type 2 diabetes by genome-wide association studies |
title_short | Comparison of automated candidate gene prediction systems using genes implicated in type 2 diabetes by genome-wide association studies |
title_sort | comparison of automated candidate gene prediction systems using genes implicated in type 2 diabetes by genome-wide association studies |
topic | Research |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2648789/ https://www.ncbi.nlm.nih.gov/pubmed/19208173 http://dx.doi.org/10.1186/1471-2105-10-S1-S69 |
work_keys_str_mv | AT tebererdahlt comparisonofautomatedcandidategenepredictionsystemsusinggenesimplicatedintype2diabetesbygenomewideassociationstudies AT liujasony comparisonofautomatedcandidategenepredictionsystemsusinggenesimplicatedintype2diabetesbygenomewideassociationstudies AT ballouzsara comparisonofautomatedcandidategenepredictionsystemsusinggenesimplicatedintype2diabetesbygenomewideassociationstudies AT fatkindiane comparisonofautomatedcandidategenepredictionsystemsusinggenesimplicatedintype2diabetesbygenomewideassociationstudies AT woutersmerrideea comparisonofautomatedcandidategenepredictionsystemsusinggenesimplicatedintype2diabetesbygenomewideassociationstudies |