Cargando…
Database Citation in Full Text Biomedical Articles
Molecular biology and literature databases represent essential infrastructure for life science research. Effective integration of these data resources requires that there are structured cross-references at the level of individual articles and biological records. Here, we describe the current pattern...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Public Library of Science
2013
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3667078/ https://www.ncbi.nlm.nih.gov/pubmed/23734176 http://dx.doi.org/10.1371/journal.pone.0063184 |
_version_ | 1782271439663005696 |
---|---|
author | Kafkas, Şenay Kim, Jee-Hyub McEntyre, Johanna R. |
author_facet | Kafkas, Şenay Kim, Jee-Hyub McEntyre, Johanna R. |
author_sort | Kafkas, Şenay |
collection | PubMed |
description | Molecular biology and literature databases represent essential infrastructure for life science research. Effective integration of these data resources requires that there are structured cross-references at the level of individual articles and biological records. Here, we describe the current patterns of how database entries are cited in research articles, based on analysis of the full text Open Access articles available from Europe PMC. Focusing on citation of entries in the European Nucleotide Archive (ENA), UniProt and Protein Data Bank, Europe (PDBe), we demonstrate that text mining doubles the number of structured annotations of database record citations supplied in journal articles by publishers. Many thousands of new literature-database relationships are found by text mining, since these relationships are also not present in the set of articles cited by database records. We recommend that structured annotation of database records in articles is extended to other databases, such as ArrayExpress and Pfam, entries from which are also cited widely in the literature. The very high precision and high-throughput of this text-mining pipeline makes this activity possible both accurately and at low cost, which will allow the development of new integrated data services. |
format | Online Article Text |
id | pubmed-3667078 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2013 |
publisher | Public Library of Science |
record_format | MEDLINE/PubMed |
spelling | pubmed-36670782013-06-03 Database Citation in Full Text Biomedical Articles Kafkas, Şenay Kim, Jee-Hyub McEntyre, Johanna R. PLoS One Research Article Molecular biology and literature databases represent essential infrastructure for life science research. Effective integration of these data resources requires that there are structured cross-references at the level of individual articles and biological records. Here, we describe the current patterns of how database entries are cited in research articles, based on analysis of the full text Open Access articles available from Europe PMC. Focusing on citation of entries in the European Nucleotide Archive (ENA), UniProt and Protein Data Bank, Europe (PDBe), we demonstrate that text mining doubles the number of structured annotations of database record citations supplied in journal articles by publishers. Many thousands of new literature-database relationships are found by text mining, since these relationships are also not present in the set of articles cited by database records. We recommend that structured annotation of database records in articles is extended to other databases, such as ArrayExpress and Pfam, entries from which are also cited widely in the literature. The very high precision and high-throughput of this text-mining pipeline makes this activity possible both accurately and at low cost, which will allow the development of new integrated data services. Public Library of Science 2013-05-29 /pmc/articles/PMC3667078/ /pubmed/23734176 http://dx.doi.org/10.1371/journal.pone.0063184 Text en © 2013 Kafkas et al http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited. |
spellingShingle | Research Article Kafkas, Şenay Kim, Jee-Hyub McEntyre, Johanna R. Database Citation in Full Text Biomedical Articles |
title | Database Citation in Full Text Biomedical Articles |
title_full | Database Citation in Full Text Biomedical Articles |
title_fullStr | Database Citation in Full Text Biomedical Articles |
title_full_unstemmed | Database Citation in Full Text Biomedical Articles |
title_short | Database Citation in Full Text Biomedical Articles |
title_sort | database citation in full text biomedical articles |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3667078/ https://www.ncbi.nlm.nih.gov/pubmed/23734176 http://dx.doi.org/10.1371/journal.pone.0063184 |
work_keys_str_mv | AT kafkassenay databasecitationinfulltextbiomedicalarticles AT kimjeehyub databasecitationinfulltextbiomedicalarticles AT mcentyrejohannar databasecitationinfulltextbiomedicalarticles |