Cargando…
Génie: literature-based gene prioritization at multi genomic scale
Biomedical literature is traditionally used as a way to inform scientists of the relevance of genes in relation to a research topic. However many genes, especially from poorly studied organisms, are not discussed in the literature. Moreover, a manual and comprehensive summarization of the literature...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Oxford University Press
2011
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3125729/ https://www.ncbi.nlm.nih.gov/pubmed/21609954 http://dx.doi.org/10.1093/nar/gkr246 |
_version_ | 1782207244981501952 |
---|---|
author | Fontaine, Jean-Fred Priller, Florian Barbosa-Silva, Adriano Andrade-Navarro, Miguel A. |
author_facet | Fontaine, Jean-Fred Priller, Florian Barbosa-Silva, Adriano Andrade-Navarro, Miguel A. |
author_sort | Fontaine, Jean-Fred |
collection | PubMed |
description | Biomedical literature is traditionally used as a way to inform scientists of the relevance of genes in relation to a research topic. However many genes, especially from poorly studied organisms, are not discussed in the literature. Moreover, a manual and comprehensive summarization of the literature attached to the genes of an organism is in general impossible due to the high number of genes and abstracts involved. We introduce the novel Génie algorithm that overcomes these problems by evaluating the literature attached to all genes in a genome and to their orthologs according to a selected topic. Génie showed high precision (up to 100%) and the best performance in comparison to other algorithms in most of the benchmarks, especially when high sensitivity was required. Moreover, the prioritization of zebrafish genes involved in heart development, using human and mouse orthologs, showed high enrichment in differentially expressed genes from microarray experiments. The Génie web server supports hundreds of species, millions of genes and offers novel functionalities. Common run times below a minute, even when analyzing the human genome with hundreds of thousands of literature records, allows the use of Génie in routine lab work. Availability: http://cbdm.mdc-berlin.de/tools/genie/. |
format | Online Article Text |
id | pubmed-3125729 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2011 |
publisher | Oxford University Press |
record_format | MEDLINE/PubMed |
spelling | pubmed-31257292011-07-05 Génie: literature-based gene prioritization at multi genomic scale Fontaine, Jean-Fred Priller, Florian Barbosa-Silva, Adriano Andrade-Navarro, Miguel A. Nucleic Acids Res Articles Biomedical literature is traditionally used as a way to inform scientists of the relevance of genes in relation to a research topic. However many genes, especially from poorly studied organisms, are not discussed in the literature. Moreover, a manual and comprehensive summarization of the literature attached to the genes of an organism is in general impossible due to the high number of genes and abstracts involved. We introduce the novel Génie algorithm that overcomes these problems by evaluating the literature attached to all genes in a genome and to their orthologs according to a selected topic. Génie showed high precision (up to 100%) and the best performance in comparison to other algorithms in most of the benchmarks, especially when high sensitivity was required. Moreover, the prioritization of zebrafish genes involved in heart development, using human and mouse orthologs, showed high enrichment in differentially expressed genes from microarray experiments. The Génie web server supports hundreds of species, millions of genes and offers novel functionalities. Common run times below a minute, even when analyzing the human genome with hundreds of thousands of literature records, allows the use of Génie in routine lab work. Availability: http://cbdm.mdc-berlin.de/tools/genie/. Oxford University Press 2011-07-01 2011-05-23 /pmc/articles/PMC3125729/ /pubmed/21609954 http://dx.doi.org/10.1093/nar/gkr246 Text en © The Author(s) 2011. Published by Oxford University Press. http://creativecommons.org/licenses/by-nc/3.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/3.0), which permits unrestricted non-commercial use, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Articles Fontaine, Jean-Fred Priller, Florian Barbosa-Silva, Adriano Andrade-Navarro, Miguel A. Génie: literature-based gene prioritization at multi genomic scale |
title | Génie: literature-based gene prioritization at multi genomic scale |
title_full | Génie: literature-based gene prioritization at multi genomic scale |
title_fullStr | Génie: literature-based gene prioritization at multi genomic scale |
title_full_unstemmed | Génie: literature-based gene prioritization at multi genomic scale |
title_short | Génie: literature-based gene prioritization at multi genomic scale |
title_sort | génie: literature-based gene prioritization at multi genomic scale |
topic | Articles |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3125729/ https://www.ncbi.nlm.nih.gov/pubmed/21609954 http://dx.doi.org/10.1093/nar/gkr246 |
work_keys_str_mv | AT fontainejeanfred genieliteraturebasedgeneprioritizationatmultigenomicscale AT prillerflorian genieliteraturebasedgeneprioritizationatmultigenomicscale AT barbosasilvaadriano genieliteraturebasedgeneprioritizationatmultigenomicscale AT andradenavarromiguela genieliteraturebasedgeneprioritizationatmultigenomicscale |