Cargando…
Unsupervised gene function extraction using semantic vectors
Finding gene functions discussed in the literature is an important task of information extraction (IE) from biomedical documents. Automated computational methodologies can significantly reduce the need for manual curation and improve quality of other related IE systems. We propose an open-IE method...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Oxford University Press
2014
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4160099/ https://www.ncbi.nlm.nih.gov/pubmed/25209025 http://dx.doi.org/10.1093/database/bau084 |
_version_ | 1782334338107441152 |
---|---|
author | Emadzadeh, Ehsan Nikfarjam, Azadeh Ginn, Rachel E. Gonzalez, Graciela |
author_facet | Emadzadeh, Ehsan Nikfarjam, Azadeh Ginn, Rachel E. Gonzalez, Graciela |
author_sort | Emadzadeh, Ehsan |
collection | PubMed |
description | Finding gene functions discussed in the literature is an important task of information extraction (IE) from biomedical documents. Automated computational methodologies can significantly reduce the need for manual curation and improve quality of other related IE systems. We propose an open-IE method for the BioCreative IV GO shared task (subtask b), focused on finding gene function terms [Gene Ontology (GO) terms] for different genes in an article. The proposed open-IE approach is based on distributional semantic similarity over the GO terms. The method does not require annotated data for training, which makes it highly generalizable. We achieve an F-measure of 0.26 on the test-set in the official submission for BioCreative-GO shared task, the third highest F-measure among the seven participants in the shared task. Database URL: https://code.google.com/p/rainbow-nlp/ |
format | Online Article Text |
id | pubmed-4160099 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2014 |
publisher | Oxford University Press |
record_format | MEDLINE/PubMed |
spelling | pubmed-41600992014-09-11 Unsupervised gene function extraction using semantic vectors Emadzadeh, Ehsan Nikfarjam, Azadeh Ginn, Rachel E. Gonzalez, Graciela Database (Oxford) Original Article Finding gene functions discussed in the literature is an important task of information extraction (IE) from biomedical documents. Automated computational methodologies can significantly reduce the need for manual curation and improve quality of other related IE systems. We propose an open-IE method for the BioCreative IV GO shared task (subtask b), focused on finding gene function terms [Gene Ontology (GO) terms] for different genes in an article. The proposed open-IE approach is based on distributional semantic similarity over the GO terms. The method does not require annotated data for training, which makes it highly generalizable. We achieve an F-measure of 0.26 on the test-set in the official submission for BioCreative-GO shared task, the third highest F-measure among the seven participants in the shared task. Database URL: https://code.google.com/p/rainbow-nlp/ Oxford University Press 2014-09-10 /pmc/articles/PMC4160099/ /pubmed/25209025 http://dx.doi.org/10.1093/database/bau084 Text en © The Author(s) 2014. Published by Oxford University Press. http://creativecommons.org/licenses/by/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Original Article Emadzadeh, Ehsan Nikfarjam, Azadeh Ginn, Rachel E. Gonzalez, Graciela Unsupervised gene function extraction using semantic vectors |
title | Unsupervised gene function extraction using semantic vectors |
title_full | Unsupervised gene function extraction using semantic vectors |
title_fullStr | Unsupervised gene function extraction using semantic vectors |
title_full_unstemmed | Unsupervised gene function extraction using semantic vectors |
title_short | Unsupervised gene function extraction using semantic vectors |
title_sort | unsupervised gene function extraction using semantic vectors |
topic | Original Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4160099/ https://www.ncbi.nlm.nih.gov/pubmed/25209025 http://dx.doi.org/10.1093/database/bau084 |
work_keys_str_mv | AT emadzadehehsan unsupervisedgenefunctionextractionusingsemanticvectors AT nikfarjamazadeh unsupervisedgenefunctionextractionusingsemanticvectors AT ginnrachele unsupervisedgenefunctionextractionusingsemanticvectors AT gonzalezgraciela unsupervisedgenefunctionextractionusingsemanticvectors |