Cargando…
Inferring gene-to-phenotype and gene-to-disease relationships at Mouse Genome Informatics: challenges and solutions
BACKGROUND: Inferring gene-to-phenotype and gene-to-human disease model relationships from annotated mouse phenotypes and disease associations is critical when researching gene function and identifying candidate disease genes. Filtering the various kinds of genotypes to determine which phenotypes ar...
Autores principales: | , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2016
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5143442/ http://dx.doi.org/10.1186/s13326-016-0054-4 |
_version_ | 1782472941405995008 |
---|---|
author | Bello, Susan M. Eppig, Janan T. |
author_facet | Bello, Susan M. Eppig, Janan T. |
author_sort | Bello, Susan M. |
collection | PubMed |
description | BACKGROUND: Inferring gene-to-phenotype and gene-to-human disease model relationships from annotated mouse phenotypes and disease associations is critical when researching gene function and identifying candidate disease genes. Filtering the various kinds of genotypes to determine which phenotypes are caused by a mutation in a particular gene can be a laborious and time-consuming process. METHODS: At Mouse Genome Informatics (MGI, www.informatics.jax.org), we have developed a gene annotation derivation algorithm that computes gene-to-phenotype and gene-to-disease annotations from our existing corpus of annotations to genotypes. This algorithm differentiates between simple genotypes with causative mutations in a single gene and more complex genotypes where mutations in multiple genes may contribute to the phenotype. As part of the process, alleles functioning as tools (e.g., reporters, recombinases) are filtered out. RESULTS: Using this algorithm derived gene-to-phenotype and gene-to-disease annotations were created for 16,000 and 2100 mouse markers, respectively, starting from over 57,900 and 4800 genotypes with at least one phenotype and disease annotation, respectively. CONCLUSIONS: Implementation of this algorithm provides consistent and accurate gene annotations across MGI and provides a vital time-savings relative to manual annotation by curators. |
format | Online Article Text |
id | pubmed-5143442 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2016 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-51434422016-12-15 Inferring gene-to-phenotype and gene-to-disease relationships at Mouse Genome Informatics: challenges and solutions Bello, Susan M. Eppig, Janan T. J Biomed Semantics Research BACKGROUND: Inferring gene-to-phenotype and gene-to-human disease model relationships from annotated mouse phenotypes and disease associations is critical when researching gene function and identifying candidate disease genes. Filtering the various kinds of genotypes to determine which phenotypes are caused by a mutation in a particular gene can be a laborious and time-consuming process. METHODS: At Mouse Genome Informatics (MGI, www.informatics.jax.org), we have developed a gene annotation derivation algorithm that computes gene-to-phenotype and gene-to-disease annotations from our existing corpus of annotations to genotypes. This algorithm differentiates between simple genotypes with causative mutations in a single gene and more complex genotypes where mutations in multiple genes may contribute to the phenotype. As part of the process, alleles functioning as tools (e.g., reporters, recombinases) are filtered out. RESULTS: Using this algorithm derived gene-to-phenotype and gene-to-disease annotations were created for 16,000 and 2100 mouse markers, respectively, starting from over 57,900 and 4800 genotypes with at least one phenotype and disease annotation, respectively. CONCLUSIONS: Implementation of this algorithm provides consistent and accurate gene annotations across MGI and provides a vital time-savings relative to manual annotation by curators. BioMed Central 2016-05-20 /pmc/articles/PMC5143442/ http://dx.doi.org/10.1186/s13326-016-0054-4 Text en © Bello and Eppig. 2016 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated. |
spellingShingle | Research Bello, Susan M. Eppig, Janan T. Inferring gene-to-phenotype and gene-to-disease relationships at Mouse Genome Informatics: challenges and solutions |
title | Inferring gene-to-phenotype and gene-to-disease relationships at Mouse Genome Informatics: challenges and solutions |
title_full | Inferring gene-to-phenotype and gene-to-disease relationships at Mouse Genome Informatics: challenges and solutions |
title_fullStr | Inferring gene-to-phenotype and gene-to-disease relationships at Mouse Genome Informatics: challenges and solutions |
title_full_unstemmed | Inferring gene-to-phenotype and gene-to-disease relationships at Mouse Genome Informatics: challenges and solutions |
title_short | Inferring gene-to-phenotype and gene-to-disease relationships at Mouse Genome Informatics: challenges and solutions |
title_sort | inferring gene-to-phenotype and gene-to-disease relationships at mouse genome informatics: challenges and solutions |
topic | Research |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5143442/ http://dx.doi.org/10.1186/s13326-016-0054-4 |
work_keys_str_mv | AT bellosusanm inferringgenetophenotypeandgenetodiseaserelationshipsatmousegenomeinformaticschallengesandsolutions AT eppigjanant inferringgenetophenotypeandgenetodiseaserelationshipsatmousegenomeinformaticschallengesandsolutions AT inferringgenetophenotypeandgenetodiseaserelationshipsatmousegenomeinformaticschallengesandsolutions |