Cargando…

Information extraction from full text scientific articles: Where are the keywords?

BACKGROUND: To date, many of the methods for information extraction of biological information from scientific articles are restricted to the abstract of the article. However, full text articles in electronic version, which offer larger sources of data, are currently available. Several questions aris...

Descripción completa

Detalles Bibliográficos
Autores principales: Shah, Parantu K, Perez-Iratxeta, Carolina, Bork, Peer, Andrade, Miguel A
Formato: Texto
Lenguaje:English
Publicado: BioMed Central 2003
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC166134/
https://www.ncbi.nlm.nih.gov/pubmed/12775220
http://dx.doi.org/10.1186/1471-2105-4-20
_version_ 1782120849079271424
author Shah, Parantu K
Perez-Iratxeta, Carolina
Bork, Peer
Andrade, Miguel A
author_facet Shah, Parantu K
Perez-Iratxeta, Carolina
Bork, Peer
Andrade, Miguel A
author_sort Shah, Parantu K
collection PubMed
description BACKGROUND: To date, many of the methods for information extraction of biological information from scientific articles are restricted to the abstract of the article. However, full text articles in electronic version, which offer larger sources of data, are currently available. Several questions arise as to whether the effort of scanning full text articles is worthy, or whether the information that can be extracted from the different sections of an article can be relevant. RESULTS: In this work we addressed those questions showing that the keyword content of the different sections of a standard scientific article (abstract, introduction, methods, results, and discussion) is very heterogeneous. CONCLUSIONS: Although the abstract contains the best ratio of keywords per total of words, other sections of the article may be a better source of biologically relevant data.
format Text
id pubmed-166134
institution National Center for Biotechnology Information
language English
publishDate 2003
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-1661342003-07-26 Information extraction from full text scientific articles: Where are the keywords? Shah, Parantu K Perez-Iratxeta, Carolina Bork, Peer Andrade, Miguel A BMC Bioinformatics Methodology Article BACKGROUND: To date, many of the methods for information extraction of biological information from scientific articles are restricted to the abstract of the article. However, full text articles in electronic version, which offer larger sources of data, are currently available. Several questions arise as to whether the effort of scanning full text articles is worthy, or whether the information that can be extracted from the different sections of an article can be relevant. RESULTS: In this work we addressed those questions showing that the keyword content of the different sections of a standard scientific article (abstract, introduction, methods, results, and discussion) is very heterogeneous. CONCLUSIONS: Although the abstract contains the best ratio of keywords per total of words, other sections of the article may be a better source of biologically relevant data. BioMed Central 2003-05-29 /pmc/articles/PMC166134/ /pubmed/12775220 http://dx.doi.org/10.1186/1471-2105-4-20 Text en Copyright © 2003 Shah et al; licensee BioMed Central Ltd. This is an Open Access article: verbatim copying and redistribution of this article are permitted in all media for any purpose, provided this notice is preserved along with the article's original URL.
spellingShingle Methodology Article
Shah, Parantu K
Perez-Iratxeta, Carolina
Bork, Peer
Andrade, Miguel A
Information extraction from full text scientific articles: Where are the keywords?
title Information extraction from full text scientific articles: Where are the keywords?
title_full Information extraction from full text scientific articles: Where are the keywords?
title_fullStr Information extraction from full text scientific articles: Where are the keywords?
title_full_unstemmed Information extraction from full text scientific articles: Where are the keywords?
title_short Information extraction from full text scientific articles: Where are the keywords?
title_sort information extraction from full text scientific articles: where are the keywords?
topic Methodology Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC166134/
https://www.ncbi.nlm.nih.gov/pubmed/12775220
http://dx.doi.org/10.1186/1471-2105-4-20
work_keys_str_mv AT shahparantuk informationextractionfromfulltextscientificarticleswherearethekeywords
AT pereziratxetacarolina informationextractionfromfulltextscientificarticleswherearethekeywords
AT borkpeer informationextractionfromfulltextscientificarticleswherearethekeywords
AT andrademiguela informationextractionfromfulltextscientificarticleswherearethekeywords