Cargando…

Beegle: from literature mining to disease-gene discovery

Disease-gene identification is a challenging process that has multiple applications within functional genomics and personalized medicine. Typically, this process involves both finding genes known to be associated with the disease (through literature search) and carrying out preliminary experiments o...

Descripción completa

Detalles Bibliográficos
Autores principales: ElShal, Sarah, Tranchevent, Léon-Charles, Sifrim, Alejandro, Ardeshirdavani, Amin, Davis, Jesse, Moreau, Yves
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2016
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4737179/
https://www.ncbi.nlm.nih.gov/pubmed/26384564
http://dx.doi.org/10.1093/nar/gkv905
_version_ 1782413434260815872
author ElShal, Sarah
Tranchevent, Léon-Charles
Sifrim, Alejandro
Ardeshirdavani, Amin
Davis, Jesse
Moreau, Yves
author_facet ElShal, Sarah
Tranchevent, Léon-Charles
Sifrim, Alejandro
Ardeshirdavani, Amin
Davis, Jesse
Moreau, Yves
author_sort ElShal, Sarah
collection PubMed
description Disease-gene identification is a challenging process that has multiple applications within functional genomics and personalized medicine. Typically, this process involves both finding genes known to be associated with the disease (through literature search) and carrying out preliminary experiments or screens (e.g. linkage or association studies, copy number analyses, expression profiling) to determine a set of promising candidates for experimental validation. This requires extensive time and monetary resources. We describe Beegle, an online search and discovery engine that attempts to simplify this process by automating the typical approaches. It starts by mining the literature to quickly extract a set of genes known to be linked with a given query, then it integrates the learning methodology of Endeavour (a gene prioritization tool) to train a genomic model and rank a set of candidate genes to generate novel hypotheses. In a realistic evaluation setup, Beegle has an average recall of 84% in the top 100 returned genes as a search engine, which improves the discovery engine by 12.6% in the top 5% prioritized genes. Beegle is publicly available at http://beegle.esat.kuleuven.be/.
format Online
Article
Text
id pubmed-4737179
institution National Center for Biotechnology Information
language English
publishDate 2016
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-47371792016-02-03 Beegle: from literature mining to disease-gene discovery ElShal, Sarah Tranchevent, Léon-Charles Sifrim, Alejandro Ardeshirdavani, Amin Davis, Jesse Moreau, Yves Nucleic Acids Res Methods Online Disease-gene identification is a challenging process that has multiple applications within functional genomics and personalized medicine. Typically, this process involves both finding genes known to be associated with the disease (through literature search) and carrying out preliminary experiments or screens (e.g. linkage or association studies, copy number analyses, expression profiling) to determine a set of promising candidates for experimental validation. This requires extensive time and monetary resources. We describe Beegle, an online search and discovery engine that attempts to simplify this process by automating the typical approaches. It starts by mining the literature to quickly extract a set of genes known to be linked with a given query, then it integrates the learning methodology of Endeavour (a gene prioritization tool) to train a genomic model and rank a set of candidate genes to generate novel hypotheses. In a realistic evaluation setup, Beegle has an average recall of 84% in the top 100 returned genes as a search engine, which improves the discovery engine by 12.6% in the top 5% prioritized genes. Beegle is publicly available at http://beegle.esat.kuleuven.be/. Oxford University Press 2016-01-29 2015-09-17 /pmc/articles/PMC4737179/ /pubmed/26384564 http://dx.doi.org/10.1093/nar/gkv905 Text en © The Author(s) 2015. Published by Oxford University Press on behalf of Nucleic Acids Research. http://creativecommons.org/licenses/by-nc/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by-nc/4.0/), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial re-use, please contact journals.permissions@oup.com
spellingShingle Methods Online
ElShal, Sarah
Tranchevent, Léon-Charles
Sifrim, Alejandro
Ardeshirdavani, Amin
Davis, Jesse
Moreau, Yves
Beegle: from literature mining to disease-gene discovery
title Beegle: from literature mining to disease-gene discovery
title_full Beegle: from literature mining to disease-gene discovery
title_fullStr Beegle: from literature mining to disease-gene discovery
title_full_unstemmed Beegle: from literature mining to disease-gene discovery
title_short Beegle: from literature mining to disease-gene discovery
title_sort beegle: from literature mining to disease-gene discovery
topic Methods Online
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4737179/
https://www.ncbi.nlm.nih.gov/pubmed/26384564
http://dx.doi.org/10.1093/nar/gkv905
work_keys_str_mv AT elshalsarah beeglefromliteratureminingtodiseasegenediscovery
AT trancheventleoncharles beeglefromliteratureminingtodiseasegenediscovery
AT sifrimalejandro beeglefromliteratureminingtodiseasegenediscovery
AT ardeshirdavaniamin beeglefromliteratureminingtodiseasegenediscovery
AT davisjesse beeglefromliteratureminingtodiseasegenediscovery
AT moreauyves beeglefromliteratureminingtodiseasegenediscovery