Cargando…

Database Citation in Full Text Biomedical Articles

Molecular biology and literature databases represent essential infrastructure for life science research. Effective integration of these data resources requires that there are structured cross-references at the level of individual articles and biological records. Here, we describe the current pattern...

Descripción completa

Detalles Bibliográficos
Autores principales: Kafkas, Şenay, Kim, Jee-Hyub, McEntyre, Johanna R.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2013
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3667078/
https://www.ncbi.nlm.nih.gov/pubmed/23734176
http://dx.doi.org/10.1371/journal.pone.0063184
_version_ 1782271439663005696
author Kafkas, Şenay
Kim, Jee-Hyub
McEntyre, Johanna R.
author_facet Kafkas, Şenay
Kim, Jee-Hyub
McEntyre, Johanna R.
author_sort Kafkas, Şenay
collection PubMed
description Molecular biology and literature databases represent essential infrastructure for life science research. Effective integration of these data resources requires that there are structured cross-references at the level of individual articles and biological records. Here, we describe the current patterns of how database entries are cited in research articles, based on analysis of the full text Open Access articles available from Europe PMC. Focusing on citation of entries in the European Nucleotide Archive (ENA), UniProt and Protein Data Bank, Europe (PDBe), we demonstrate that text mining doubles the number of structured annotations of database record citations supplied in journal articles by publishers. Many thousands of new literature-database relationships are found by text mining, since these relationships are also not present in the set of articles cited by database records. We recommend that structured annotation of database records in articles is extended to other databases, such as ArrayExpress and Pfam, entries from which are also cited widely in the literature. The very high precision and high-throughput of this text-mining pipeline makes this activity possible both accurately and at low cost, which will allow the development of new integrated data services.
format Online
Article
Text
id pubmed-3667078
institution National Center for Biotechnology Information
language English
publishDate 2013
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-36670782013-06-03 Database Citation in Full Text Biomedical Articles Kafkas, Şenay Kim, Jee-Hyub McEntyre, Johanna R. PLoS One Research Article Molecular biology and literature databases represent essential infrastructure for life science research. Effective integration of these data resources requires that there are structured cross-references at the level of individual articles and biological records. Here, we describe the current patterns of how database entries are cited in research articles, based on analysis of the full text Open Access articles available from Europe PMC. Focusing on citation of entries in the European Nucleotide Archive (ENA), UniProt and Protein Data Bank, Europe (PDBe), we demonstrate that text mining doubles the number of structured annotations of database record citations supplied in journal articles by publishers. Many thousands of new literature-database relationships are found by text mining, since these relationships are also not present in the set of articles cited by database records. We recommend that structured annotation of database records in articles is extended to other databases, such as ArrayExpress and Pfam, entries from which are also cited widely in the literature. The very high precision and high-throughput of this text-mining pipeline makes this activity possible both accurately and at low cost, which will allow the development of new integrated data services. Public Library of Science 2013-05-29 /pmc/articles/PMC3667078/ /pubmed/23734176 http://dx.doi.org/10.1371/journal.pone.0063184 Text en © 2013 Kafkas et al http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited.
spellingShingle Research Article
Kafkas, Şenay
Kim, Jee-Hyub
McEntyre, Johanna R.
Database Citation in Full Text Biomedical Articles
title Database Citation in Full Text Biomedical Articles
title_full Database Citation in Full Text Biomedical Articles
title_fullStr Database Citation in Full Text Biomedical Articles
title_full_unstemmed Database Citation in Full Text Biomedical Articles
title_short Database Citation in Full Text Biomedical Articles
title_sort database citation in full text biomedical articles
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3667078/
https://www.ncbi.nlm.nih.gov/pubmed/23734176
http://dx.doi.org/10.1371/journal.pone.0063184
work_keys_str_mv AT kafkassenay databasecitationinfulltextbiomedicalarticles
AT kimjeehyub databasecitationinfulltextbiomedicalarticles
AT mcentyrejohannar databasecitationinfulltextbiomedicalarticles