Cargando…

Unsupervised gene function extraction using semantic vectors

Finding gene functions discussed in the literature is an important task of information extraction (IE) from biomedical documents. Automated computational methodologies can significantly reduce the need for manual curation and improve quality of other related IE systems. We propose an open-IE method...

Descripción completa

Detalles Bibliográficos
Autores principales: Emadzadeh, Ehsan, Nikfarjam, Azadeh, Ginn, Rachel E., Gonzalez, Graciela
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2014
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4160099/
https://www.ncbi.nlm.nih.gov/pubmed/25209025
http://dx.doi.org/10.1093/database/bau084
_version_ 1782334338107441152
author Emadzadeh, Ehsan
Nikfarjam, Azadeh
Ginn, Rachel E.
Gonzalez, Graciela
author_facet Emadzadeh, Ehsan
Nikfarjam, Azadeh
Ginn, Rachel E.
Gonzalez, Graciela
author_sort Emadzadeh, Ehsan
collection PubMed
description Finding gene functions discussed in the literature is an important task of information extraction (IE) from biomedical documents. Automated computational methodologies can significantly reduce the need for manual curation and improve quality of other related IE systems. We propose an open-IE method for the BioCreative IV GO shared task (subtask b), focused on finding gene function terms [Gene Ontology (GO) terms] for different genes in an article. The proposed open-IE approach is based on distributional semantic similarity over the GO terms. The method does not require annotated data for training, which makes it highly generalizable. We achieve an F-measure of 0.26 on the test-set in the official submission for BioCreative-GO shared task, the third highest F-measure among the seven participants in the shared task. Database URL: https://code.google.com/p/rainbow-nlp/
format Online
Article
Text
id pubmed-4160099
institution National Center for Biotechnology Information
language English
publishDate 2014
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-41600992014-09-11 Unsupervised gene function extraction using semantic vectors Emadzadeh, Ehsan Nikfarjam, Azadeh Ginn, Rachel E. Gonzalez, Graciela Database (Oxford) Original Article Finding gene functions discussed in the literature is an important task of information extraction (IE) from biomedical documents. Automated computational methodologies can significantly reduce the need for manual curation and improve quality of other related IE systems. We propose an open-IE method for the BioCreative IV GO shared task (subtask b), focused on finding gene function terms [Gene Ontology (GO) terms] for different genes in an article. The proposed open-IE approach is based on distributional semantic similarity over the GO terms. The method does not require annotated data for training, which makes it highly generalizable. We achieve an F-measure of 0.26 on the test-set in the official submission for BioCreative-GO shared task, the third highest F-measure among the seven participants in the shared task. Database URL: https://code.google.com/p/rainbow-nlp/ Oxford University Press 2014-09-10 /pmc/articles/PMC4160099/ /pubmed/25209025 http://dx.doi.org/10.1093/database/bau084 Text en © The Author(s) 2014. Published by Oxford University Press. http://creativecommons.org/licenses/by/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Original Article
Emadzadeh, Ehsan
Nikfarjam, Azadeh
Ginn, Rachel E.
Gonzalez, Graciela
Unsupervised gene function extraction using semantic vectors
title Unsupervised gene function extraction using semantic vectors
title_full Unsupervised gene function extraction using semantic vectors
title_fullStr Unsupervised gene function extraction using semantic vectors
title_full_unstemmed Unsupervised gene function extraction using semantic vectors
title_short Unsupervised gene function extraction using semantic vectors
title_sort unsupervised gene function extraction using semantic vectors
topic Original Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4160099/
https://www.ncbi.nlm.nih.gov/pubmed/25209025
http://dx.doi.org/10.1093/database/bau084
work_keys_str_mv AT emadzadehehsan unsupervisedgenefunctionextractionusingsemanticvectors
AT nikfarjamazadeh unsupervisedgenefunctionextractionusingsemanticvectors
AT ginnrachele unsupervisedgenefunctionextractionusingsemanticvectors
AT gonzalezgraciela unsupervisedgenefunctionextractionusingsemanticvectors