Cargando…

CafeteriaSA corpus: scientific abstracts annotated across different food semantic resources

In the last decades, a great amount of work has been done in predictive modeling of issues related to human and environmental health. Resolution of issues related to healthcare is made possible by the existence of several biomedical vocabularies and standards, which play a crucial role in understand...

Descripción completa

Detalles Bibliográficos
Autores principales: Cenikj, Gjorgjina, Valenčič, Eva, Ispirova, Gordana, Ogrinc, Matevž, Stojanov, Riste, Korošec, Peter, Cavalli, Ermanno, Seljak, Barbara Koroušić, Eftimov, Tome
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9757992/
https://www.ncbi.nlm.nih.gov/pubmed/36526439
http://dx.doi.org/10.1093/database/baac107
_version_ 1784851944610725888
author Cenikj, Gjorgjina
Valenčič, Eva
Ispirova, Gordana
Ogrinc, Matevž
Stojanov, Riste
Korošec, Peter
Cavalli, Ermanno
Seljak, Barbara Koroušić
Eftimov, Tome
author_facet Cenikj, Gjorgjina
Valenčič, Eva
Ispirova, Gordana
Ogrinc, Matevž
Stojanov, Riste
Korošec, Peter
Cavalli, Ermanno
Seljak, Barbara Koroušić
Eftimov, Tome
author_sort Cenikj, Gjorgjina
collection PubMed
description In the last decades, a great amount of work has been done in predictive modeling of issues related to human and environmental health. Resolution of issues related to healthcare is made possible by the existence of several biomedical vocabularies and standards, which play a crucial role in understanding the health information, together with a large amount of health-related data. However, despite a large number of available resources and work done in the health and environmental domains, there is a lack of semantic resources that can be utilized in the food and nutrition domain, as well as their interconnections. For this purpose, in a European Food Safety Authority–funded project CAFETERIA, we have developed the first annotated corpus of 500 scientific abstracts that consists of 6407 annotated food entities with regard to Hansard taxonomy, 4299 for FoodOn and 3623 for SNOMED-CT. The CafeteriaSA corpus will enable the further development of natural language processing methods for food information extraction from textual data that will allow extracting food information from scientific textual data. Database URL: https://zenodo.org/record/6683798#.Y49wIezMJJF
format Online
Article
Text
id pubmed-9757992
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-97579922022-12-19 CafeteriaSA corpus: scientific abstracts annotated across different food semantic resources Cenikj, Gjorgjina Valenčič, Eva Ispirova, Gordana Ogrinc, Matevž Stojanov, Riste Korošec, Peter Cavalli, Ermanno Seljak, Barbara Koroušić Eftimov, Tome Database (Oxford) Original Article In the last decades, a great amount of work has been done in predictive modeling of issues related to human and environmental health. Resolution of issues related to healthcare is made possible by the existence of several biomedical vocabularies and standards, which play a crucial role in understanding the health information, together with a large amount of health-related data. However, despite a large number of available resources and work done in the health and environmental domains, there is a lack of semantic resources that can be utilized in the food and nutrition domain, as well as their interconnections. For this purpose, in a European Food Safety Authority–funded project CAFETERIA, we have developed the first annotated corpus of 500 scientific abstracts that consists of 6407 annotated food entities with regard to Hansard taxonomy, 4299 for FoodOn and 3623 for SNOMED-CT. The CafeteriaSA corpus will enable the further development of natural language processing methods for food information extraction from textual data that will allow extracting food information from scientific textual data. Database URL: https://zenodo.org/record/6683798#.Y49wIezMJJF Oxford University Press 2022-12-16 /pmc/articles/PMC9757992/ /pubmed/36526439 http://dx.doi.org/10.1093/database/baac107 Text en © The Author(s) 2022. Published by Oxford University Press. https://creativecommons.org/licenses/by-nc/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution-NonCommercial License (https://creativecommons.org/licenses/by-nc/4.0/), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial re-use, please contact journals.permissions@oup.com
spellingShingle Original Article
Cenikj, Gjorgjina
Valenčič, Eva
Ispirova, Gordana
Ogrinc, Matevž
Stojanov, Riste
Korošec, Peter
Cavalli, Ermanno
Seljak, Barbara Koroušić
Eftimov, Tome
CafeteriaSA corpus: scientific abstracts annotated across different food semantic resources
title CafeteriaSA corpus: scientific abstracts annotated across different food semantic resources
title_full CafeteriaSA corpus: scientific abstracts annotated across different food semantic resources
title_fullStr CafeteriaSA corpus: scientific abstracts annotated across different food semantic resources
title_full_unstemmed CafeteriaSA corpus: scientific abstracts annotated across different food semantic resources
title_short CafeteriaSA corpus: scientific abstracts annotated across different food semantic resources
title_sort cafeteriasa corpus: scientific abstracts annotated across different food semantic resources
topic Original Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9757992/
https://www.ncbi.nlm.nih.gov/pubmed/36526439
http://dx.doi.org/10.1093/database/baac107
work_keys_str_mv AT cenikjgjorgjina cafeteriasacorpusscientificabstractsannotatedacrossdifferentfoodsemanticresources
AT valenciceva cafeteriasacorpusscientificabstractsannotatedacrossdifferentfoodsemanticresources
AT ispirovagordana cafeteriasacorpusscientificabstractsannotatedacrossdifferentfoodsemanticresources
AT ogrincmatevz cafeteriasacorpusscientificabstractsannotatedacrossdifferentfoodsemanticresources
AT stojanovriste cafeteriasacorpusscientificabstractsannotatedacrossdifferentfoodsemanticresources
AT korosecpeter cafeteriasacorpusscientificabstractsannotatedacrossdifferentfoodsemanticresources
AT cavalliermanno cafeteriasacorpusscientificabstractsannotatedacrossdifferentfoodsemanticresources
AT seljakbarbarakorousic cafeteriasacorpusscientificabstractsannotatedacrossdifferentfoodsemanticresources
AT eftimovtome cafeteriasacorpusscientificabstractsannotatedacrossdifferentfoodsemanticresources