Cargando…

Inferring gene-to-phenotype and gene-to-disease relationships at Mouse Genome Informatics: challenges and solutions

BACKGROUND: Inferring gene-to-phenotype and gene-to-human disease model relationships from annotated mouse phenotypes and disease associations is critical when researching gene function and identifying candidate disease genes. Filtering the various kinds of genotypes to determine which phenotypes ar...

Descripción completa

Detalles Bibliográficos
Autores principales: Bello, Susan M., Eppig, Janan T.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2016
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5143442/
http://dx.doi.org/10.1186/s13326-016-0054-4
_version_ 1782472941405995008
author Bello, Susan M.
Eppig, Janan T.
author_facet Bello, Susan M.
Eppig, Janan T.
author_sort Bello, Susan M.
collection PubMed
description BACKGROUND: Inferring gene-to-phenotype and gene-to-human disease model relationships from annotated mouse phenotypes and disease associations is critical when researching gene function and identifying candidate disease genes. Filtering the various kinds of genotypes to determine which phenotypes are caused by a mutation in a particular gene can be a laborious and time-consuming process. METHODS: At Mouse Genome Informatics (MGI, www.informatics.jax.org), we have developed a gene annotation derivation algorithm that computes gene-to-phenotype and gene-to-disease annotations from our existing corpus of annotations to genotypes. This algorithm differentiates between simple genotypes with causative mutations in a single gene and more complex genotypes where mutations in multiple genes may contribute to the phenotype. As part of the process, alleles functioning as tools (e.g., reporters, recombinases) are filtered out. RESULTS: Using this algorithm derived gene-to-phenotype and gene-to-disease annotations were created for 16,000 and 2100 mouse markers, respectively, starting from over 57,900 and 4800 genotypes with at least one phenotype and disease annotation, respectively. CONCLUSIONS: Implementation of this algorithm provides consistent and accurate gene annotations across MGI and provides a vital time-savings relative to manual annotation by curators.
format Online
Article
Text
id pubmed-5143442
institution National Center for Biotechnology Information
language English
publishDate 2016
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-51434422016-12-15 Inferring gene-to-phenotype and gene-to-disease relationships at Mouse Genome Informatics: challenges and solutions Bello, Susan M. Eppig, Janan T. J Biomed Semantics Research BACKGROUND: Inferring gene-to-phenotype and gene-to-human disease model relationships from annotated mouse phenotypes and disease associations is critical when researching gene function and identifying candidate disease genes. Filtering the various kinds of genotypes to determine which phenotypes are caused by a mutation in a particular gene can be a laborious and time-consuming process. METHODS: At Mouse Genome Informatics (MGI, www.informatics.jax.org), we have developed a gene annotation derivation algorithm that computes gene-to-phenotype and gene-to-disease annotations from our existing corpus of annotations to genotypes. This algorithm differentiates between simple genotypes with causative mutations in a single gene and more complex genotypes where mutations in multiple genes may contribute to the phenotype. As part of the process, alleles functioning as tools (e.g., reporters, recombinases) are filtered out. RESULTS: Using this algorithm derived gene-to-phenotype and gene-to-disease annotations were created for 16,000 and 2100 mouse markers, respectively, starting from over 57,900 and 4800 genotypes with at least one phenotype and disease annotation, respectively. CONCLUSIONS: Implementation of this algorithm provides consistent and accurate gene annotations across MGI and provides a vital time-savings relative to manual annotation by curators. BioMed Central 2016-05-20 /pmc/articles/PMC5143442/ http://dx.doi.org/10.1186/s13326-016-0054-4 Text en © Bello and Eppig. 2016 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Research
Bello, Susan M.
Eppig, Janan T.
Inferring gene-to-phenotype and gene-to-disease relationships at Mouse Genome Informatics: challenges and solutions
title Inferring gene-to-phenotype and gene-to-disease relationships at Mouse Genome Informatics: challenges and solutions
title_full Inferring gene-to-phenotype and gene-to-disease relationships at Mouse Genome Informatics: challenges and solutions
title_fullStr Inferring gene-to-phenotype and gene-to-disease relationships at Mouse Genome Informatics: challenges and solutions
title_full_unstemmed Inferring gene-to-phenotype and gene-to-disease relationships at Mouse Genome Informatics: challenges and solutions
title_short Inferring gene-to-phenotype and gene-to-disease relationships at Mouse Genome Informatics: challenges and solutions
title_sort inferring gene-to-phenotype and gene-to-disease relationships at mouse genome informatics: challenges and solutions
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5143442/
http://dx.doi.org/10.1186/s13326-016-0054-4
work_keys_str_mv AT bellosusanm inferringgenetophenotypeandgenetodiseaserelationshipsatmousegenomeinformaticschallengesandsolutions
AT eppigjanant inferringgenetophenotypeandgenetodiseaserelationshipsatmousegenomeinformaticschallengesandsolutions
AT inferringgenetophenotypeandgenetodiseaserelationshipsatmousegenomeinformaticschallengesandsolutions