Cargando…

Structured digital tables on the Semantic Web: toward a structured digital literature

In parallel to the growth in bioscience databases, biomedical publications have increased exponentially in the past decade. However, the extraction of high-quality information from the corpus of scientific literature has been hampered by the lack of machine-interpretable content, despite text-mining...

Descripción completa

Detalles Bibliográficos
Autores principales: Cheung, Kei-Hoi, Samwald, Matthias, Auerbach, Raymond K, Gerstein, Mark B
Formato: Texto
Lenguaje:English
Publicado: Nature Publishing Group 2010
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2950080/
https://www.ncbi.nlm.nih.gov/pubmed/20739925
http://dx.doi.org/10.1038/msb.2010.45
_version_ 1782187622890733568
author Cheung, Kei-Hoi
Samwald, Matthias
Auerbach, Raymond K
Gerstein, Mark B
author_facet Cheung, Kei-Hoi
Samwald, Matthias
Auerbach, Raymond K
Gerstein, Mark B
author_sort Cheung, Kei-Hoi
collection PubMed
description In parallel to the growth in bioscience databases, biomedical publications have increased exponentially in the past decade. However, the extraction of high-quality information from the corpus of scientific literature has been hampered by the lack of machine-interpretable content, despite text-mining advances. To address this, we propose creating a structured digital table as part of an overall effort in developing machine-readable, structured digital literature. In particular, we envision transforming publication tables into standardized triples using Semantic Web approaches. We identify three canonical types of tables (conveying information about properties, networks, and concept hierarchies) and show how more complex tables can be built from these basic types. We envision that authors would create tables initially using the structured triples for canonical types and then have them visually rendered for publication, and we present examples for converting representative tables into triples. Finally, we discuss how ‘stub' versions of structured digital tables could be a useful bridge for connecting together the literature with databases, allowing the former to more precisely document the later.
format Text
id pubmed-2950080
institution National Center for Biotechnology Information
language English
publishDate 2010
publisher Nature Publishing Group
record_format MEDLINE/PubMed
spelling pubmed-29500802010-10-05 Structured digital tables on the Semantic Web: toward a structured digital literature Cheung, Kei-Hoi Samwald, Matthias Auerbach, Raymond K Gerstein, Mark B Mol Syst Biol Perspectives In parallel to the growth in bioscience databases, biomedical publications have increased exponentially in the past decade. However, the extraction of high-quality information from the corpus of scientific literature has been hampered by the lack of machine-interpretable content, despite text-mining advances. To address this, we propose creating a structured digital table as part of an overall effort in developing machine-readable, structured digital literature. In particular, we envision transforming publication tables into standardized triples using Semantic Web approaches. We identify three canonical types of tables (conveying information about properties, networks, and concept hierarchies) and show how more complex tables can be built from these basic types. We envision that authors would create tables initially using the structured triples for canonical types and then have them visually rendered for publication, and we present examples for converting representative tables into triples. Finally, we discuss how ‘stub' versions of structured digital tables could be a useful bridge for connecting together the literature with databases, allowing the former to more precisely document the later. Nature Publishing Group 2010-08-24 /pmc/articles/PMC2950080/ /pubmed/20739925 http://dx.doi.org/10.1038/msb.2010.45 Text en Copyright © 2010, EMBO and Macmillan Publishers Limited http://creativecommons.org/licenses/by-nc-sa/3.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution Noncommercial Share Alike 3.0 Unported License, which allows readers to alter, transform, or build upon the article and then distribute the resulting work under the same or similar license to this one. The work must be attributed back to the original author and commercial use is not permitted without specific permission.
spellingShingle Perspectives
Cheung, Kei-Hoi
Samwald, Matthias
Auerbach, Raymond K
Gerstein, Mark B
Structured digital tables on the Semantic Web: toward a structured digital literature
title Structured digital tables on the Semantic Web: toward a structured digital literature
title_full Structured digital tables on the Semantic Web: toward a structured digital literature
title_fullStr Structured digital tables on the Semantic Web: toward a structured digital literature
title_full_unstemmed Structured digital tables on the Semantic Web: toward a structured digital literature
title_short Structured digital tables on the Semantic Web: toward a structured digital literature
title_sort structured digital tables on the semantic web: toward a structured digital literature
topic Perspectives
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2950080/
https://www.ncbi.nlm.nih.gov/pubmed/20739925
http://dx.doi.org/10.1038/msb.2010.45
work_keys_str_mv AT cheungkeihoi structureddigitaltablesonthesemanticwebtowardastructureddigitalliterature
AT samwaldmatthias structureddigitaltablesonthesemanticwebtowardastructureddigitalliterature
AT auerbachraymondk structureddigitaltablesonthesemanticwebtowardastructureddigitalliterature
AT gersteinmarkb structureddigitaltablesonthesemanticwebtowardastructureddigitalliterature