Cargando…

The influence of disease categories on gene candidate predictions from model organism phenotypes

BACKGROUND: The molecular etiology is still to be identified for about half of the currently described Mendelian diseases in humans, thereby hindering efforts to find treatments or preventive measures. Advances, such as new sequencing technologies, have led to increasing amounts of data becoming ava...

Descripción completa

Detalles Bibliográficos
Autores principales: Oellrich, Anika, Koehler, Sebastian, Washington, Nicole, Mungall, Chris, Lewis, Suzanna, Haendel, Melissa, Robinson, Peter N, Smedley, Damian
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2014
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4108905/
https://www.ncbi.nlm.nih.gov/pubmed/25093073
http://dx.doi.org/10.1186/2041-1480-5-S1-S4
_version_ 1782327807297191936
author Oellrich, Anika
Koehler, Sebastian
Washington, Nicole
Mungall, Chris
Lewis, Suzanna
Haendel, Melissa
Robinson, Peter N
Smedley, Damian
author_facet Oellrich, Anika
Koehler, Sebastian
Washington, Nicole
Mungall, Chris
Lewis, Suzanna
Haendel, Melissa
Robinson, Peter N
Smedley, Damian
author_sort Oellrich, Anika
collection PubMed
description BACKGROUND: The molecular etiology is still to be identified for about half of the currently described Mendelian diseases in humans, thereby hindering efforts to find treatments or preventive measures. Advances, such as new sequencing technologies, have led to increasing amounts of data becoming available with which to address the problem of identifying disease genes. Therefore, automated methods are needed that reliably predict disease gene candidates based on available data. We have recently developed Exomiser as a tool for identifying causative variants from exome analysis results by filtering and prioritising using a number of criteria including the phenotype similarity between the disease and mouse mutants involving the gene candidates. Initial investigations revealed a variation in performance for different medical categories of disease, due in part to a varying contribution of the phenotype scoring component. RESULTS: In this study, we further analyse the performance of our cross-species phenotype matching algorithm, and examine in more detail the reasons why disease gene filtering based on phenotype data works better for certain disease categories than others. We found that in addition to misleading phenotype alignments between species, some disease categories are still more amenable to automated predictions than others, and that this often ties in with community perceptions on how well the organism works as model. CONCLUSIONS: In conclusion, our automated disease gene candidate predictions are highly dependent on the organism used for the predictions and the disease category being studied. Future work on computational disease gene prediction using phenotype data would benefit from methods that take into account the disease category and the source of model organism data.
format Online
Article
Text
id pubmed-4108905
institution National Center for Biotechnology Information
language English
publishDate 2014
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-41089052014-08-04 The influence of disease categories on gene candidate predictions from model organism phenotypes Oellrich, Anika Koehler, Sebastian Washington, Nicole Mungall, Chris Lewis, Suzanna Haendel, Melissa Robinson, Peter N Smedley, Damian J Biomed Semantics Proceedings BACKGROUND: The molecular etiology is still to be identified for about half of the currently described Mendelian diseases in humans, thereby hindering efforts to find treatments or preventive measures. Advances, such as new sequencing technologies, have led to increasing amounts of data becoming available with which to address the problem of identifying disease genes. Therefore, automated methods are needed that reliably predict disease gene candidates based on available data. We have recently developed Exomiser as a tool for identifying causative variants from exome analysis results by filtering and prioritising using a number of criteria including the phenotype similarity between the disease and mouse mutants involving the gene candidates. Initial investigations revealed a variation in performance for different medical categories of disease, due in part to a varying contribution of the phenotype scoring component. RESULTS: In this study, we further analyse the performance of our cross-species phenotype matching algorithm, and examine in more detail the reasons why disease gene filtering based on phenotype data works better for certain disease categories than others. We found that in addition to misleading phenotype alignments between species, some disease categories are still more amenable to automated predictions than others, and that this often ties in with community perceptions on how well the organism works as model. CONCLUSIONS: In conclusion, our automated disease gene candidate predictions are highly dependent on the organism used for the predictions and the disease category being studied. Future work on computational disease gene prediction using phenotype data would benefit from methods that take into account the disease category and the source of model organism data. BioMed Central 2014-06-03 /pmc/articles/PMC4108905/ /pubmed/25093073 http://dx.doi.org/10.1186/2041-1480-5-S1-S4 Text en Copyright © 2014 Oellrich et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Proceedings
Oellrich, Anika
Koehler, Sebastian
Washington, Nicole
Mungall, Chris
Lewis, Suzanna
Haendel, Melissa
Robinson, Peter N
Smedley, Damian
The influence of disease categories on gene candidate predictions from model organism phenotypes
title The influence of disease categories on gene candidate predictions from model organism phenotypes
title_full The influence of disease categories on gene candidate predictions from model organism phenotypes
title_fullStr The influence of disease categories on gene candidate predictions from model organism phenotypes
title_full_unstemmed The influence of disease categories on gene candidate predictions from model organism phenotypes
title_short The influence of disease categories on gene candidate predictions from model organism phenotypes
title_sort influence of disease categories on gene candidate predictions from model organism phenotypes
topic Proceedings
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4108905/
https://www.ncbi.nlm.nih.gov/pubmed/25093073
http://dx.doi.org/10.1186/2041-1480-5-S1-S4
work_keys_str_mv AT oellrichanika theinfluenceofdiseasecategoriesongenecandidatepredictionsfrommodelorganismphenotypes
AT koehlersebastian theinfluenceofdiseasecategoriesongenecandidatepredictionsfrommodelorganismphenotypes
AT washingtonnicole theinfluenceofdiseasecategoriesongenecandidatepredictionsfrommodelorganismphenotypes
AT theinfluenceofdiseasecategoriesongenecandidatepredictionsfrommodelorganismphenotypes
AT mungallchris theinfluenceofdiseasecategoriesongenecandidatepredictionsfrommodelorganismphenotypes
AT lewissuzanna theinfluenceofdiseasecategoriesongenecandidatepredictionsfrommodelorganismphenotypes
AT haendelmelissa theinfluenceofdiseasecategoriesongenecandidatepredictionsfrommodelorganismphenotypes
AT robinsonpetern theinfluenceofdiseasecategoriesongenecandidatepredictionsfrommodelorganismphenotypes
AT smedleydamian theinfluenceofdiseasecategoriesongenecandidatepredictionsfrommodelorganismphenotypes
AT oellrichanika influenceofdiseasecategoriesongenecandidatepredictionsfrommodelorganismphenotypes
AT koehlersebastian influenceofdiseasecategoriesongenecandidatepredictionsfrommodelorganismphenotypes
AT washingtonnicole influenceofdiseasecategoriesongenecandidatepredictionsfrommodelorganismphenotypes
AT influenceofdiseasecategoriesongenecandidatepredictionsfrommodelorganismphenotypes
AT mungallchris influenceofdiseasecategoriesongenecandidatepredictionsfrommodelorganismphenotypes
AT lewissuzanna influenceofdiseasecategoriesongenecandidatepredictionsfrommodelorganismphenotypes
AT haendelmelissa influenceofdiseasecategoriesongenecandidatepredictionsfrommodelorganismphenotypes
AT robinsonpetern influenceofdiseasecategoriesongenecandidatepredictionsfrommodelorganismphenotypes
AT smedleydamian influenceofdiseasecategoriesongenecandidatepredictionsfrommodelorganismphenotypes