Cargando…
Information extraction from full text scientific articles: Where are the keywords?
BACKGROUND: To date, many of the methods for information extraction of biological information from scientific articles are restricted to the abstract of the article. However, full text articles in electronic version, which offer larger sources of data, are currently available. Several questions aris...
Autores principales: | , , , |
---|---|
Formato: | Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2003
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC166134/ https://www.ncbi.nlm.nih.gov/pubmed/12775220 http://dx.doi.org/10.1186/1471-2105-4-20 |
_version_ | 1782120849079271424 |
---|---|
author | Shah, Parantu K Perez-Iratxeta, Carolina Bork, Peer Andrade, Miguel A |
author_facet | Shah, Parantu K Perez-Iratxeta, Carolina Bork, Peer Andrade, Miguel A |
author_sort | Shah, Parantu K |
collection | PubMed |
description | BACKGROUND: To date, many of the methods for information extraction of biological information from scientific articles are restricted to the abstract of the article. However, full text articles in electronic version, which offer larger sources of data, are currently available. Several questions arise as to whether the effort of scanning full text articles is worthy, or whether the information that can be extracted from the different sections of an article can be relevant. RESULTS: In this work we addressed those questions showing that the keyword content of the different sections of a standard scientific article (abstract, introduction, methods, results, and discussion) is very heterogeneous. CONCLUSIONS: Although the abstract contains the best ratio of keywords per total of words, other sections of the article may be a better source of biologically relevant data. |
format | Text |
id | pubmed-166134 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2003 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-1661342003-07-26 Information extraction from full text scientific articles: Where are the keywords? Shah, Parantu K Perez-Iratxeta, Carolina Bork, Peer Andrade, Miguel A BMC Bioinformatics Methodology Article BACKGROUND: To date, many of the methods for information extraction of biological information from scientific articles are restricted to the abstract of the article. However, full text articles in electronic version, which offer larger sources of data, are currently available. Several questions arise as to whether the effort of scanning full text articles is worthy, or whether the information that can be extracted from the different sections of an article can be relevant. RESULTS: In this work we addressed those questions showing that the keyword content of the different sections of a standard scientific article (abstract, introduction, methods, results, and discussion) is very heterogeneous. CONCLUSIONS: Although the abstract contains the best ratio of keywords per total of words, other sections of the article may be a better source of biologically relevant data. BioMed Central 2003-05-29 /pmc/articles/PMC166134/ /pubmed/12775220 http://dx.doi.org/10.1186/1471-2105-4-20 Text en Copyright © 2003 Shah et al; licensee BioMed Central Ltd. This is an Open Access article: verbatim copying and redistribution of this article are permitted in all media for any purpose, provided this notice is preserved along with the article's original URL. |
spellingShingle | Methodology Article Shah, Parantu K Perez-Iratxeta, Carolina Bork, Peer Andrade, Miguel A Information extraction from full text scientific articles: Where are the keywords? |
title | Information extraction from full text scientific articles: Where are the keywords? |
title_full | Information extraction from full text scientific articles: Where are the keywords? |
title_fullStr | Information extraction from full text scientific articles: Where are the keywords? |
title_full_unstemmed | Information extraction from full text scientific articles: Where are the keywords? |
title_short | Information extraction from full text scientific articles: Where are the keywords? |
title_sort | information extraction from full text scientific articles: where are the keywords? |
topic | Methodology Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC166134/ https://www.ncbi.nlm.nih.gov/pubmed/12775220 http://dx.doi.org/10.1186/1471-2105-4-20 |
work_keys_str_mv | AT shahparantuk informationextractionfromfulltextscientificarticleswherearethekeywords AT pereziratxetacarolina informationextractionfromfulltextscientificarticleswherearethekeywords AT borkpeer informationextractionfromfulltextscientificarticleswherearethekeywords AT andrademiguela informationextractionfromfulltextscientificarticleswherearethekeywords |