Cargando…
CafeteriaSA corpus: scientific abstracts annotated across different food semantic resources
In the last decades, a great amount of work has been done in predictive modeling of issues related to human and environmental health. Resolution of issues related to healthcare is made possible by the existence of several biomedical vocabularies and standards, which play a crucial role in understand...
Autores principales: | , , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Oxford University Press
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9757992/ https://www.ncbi.nlm.nih.gov/pubmed/36526439 http://dx.doi.org/10.1093/database/baac107 |
_version_ | 1784851944610725888 |
---|---|
author | Cenikj, Gjorgjina Valenčič, Eva Ispirova, Gordana Ogrinc, Matevž Stojanov, Riste Korošec, Peter Cavalli, Ermanno Seljak, Barbara Koroušić Eftimov, Tome |
author_facet | Cenikj, Gjorgjina Valenčič, Eva Ispirova, Gordana Ogrinc, Matevž Stojanov, Riste Korošec, Peter Cavalli, Ermanno Seljak, Barbara Koroušić Eftimov, Tome |
author_sort | Cenikj, Gjorgjina |
collection | PubMed |
description | In the last decades, a great amount of work has been done in predictive modeling of issues related to human and environmental health. Resolution of issues related to healthcare is made possible by the existence of several biomedical vocabularies and standards, which play a crucial role in understanding the health information, together with a large amount of health-related data. However, despite a large number of available resources and work done in the health and environmental domains, there is a lack of semantic resources that can be utilized in the food and nutrition domain, as well as their interconnections. For this purpose, in a European Food Safety Authority–funded project CAFETERIA, we have developed the first annotated corpus of 500 scientific abstracts that consists of 6407 annotated food entities with regard to Hansard taxonomy, 4299 for FoodOn and 3623 for SNOMED-CT. The CafeteriaSA corpus will enable the further development of natural language processing methods for food information extraction from textual data that will allow extracting food information from scientific textual data. Database URL: https://zenodo.org/record/6683798#.Y49wIezMJJF |
format | Online Article Text |
id | pubmed-9757992 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | Oxford University Press |
record_format | MEDLINE/PubMed |
spelling | pubmed-97579922022-12-19 CafeteriaSA corpus: scientific abstracts annotated across different food semantic resources Cenikj, Gjorgjina Valenčič, Eva Ispirova, Gordana Ogrinc, Matevž Stojanov, Riste Korošec, Peter Cavalli, Ermanno Seljak, Barbara Koroušić Eftimov, Tome Database (Oxford) Original Article In the last decades, a great amount of work has been done in predictive modeling of issues related to human and environmental health. Resolution of issues related to healthcare is made possible by the existence of several biomedical vocabularies and standards, which play a crucial role in understanding the health information, together with a large amount of health-related data. However, despite a large number of available resources and work done in the health and environmental domains, there is a lack of semantic resources that can be utilized in the food and nutrition domain, as well as their interconnections. For this purpose, in a European Food Safety Authority–funded project CAFETERIA, we have developed the first annotated corpus of 500 scientific abstracts that consists of 6407 annotated food entities with regard to Hansard taxonomy, 4299 for FoodOn and 3623 for SNOMED-CT. The CafeteriaSA corpus will enable the further development of natural language processing methods for food information extraction from textual data that will allow extracting food information from scientific textual data. Database URL: https://zenodo.org/record/6683798#.Y49wIezMJJF Oxford University Press 2022-12-16 /pmc/articles/PMC9757992/ /pubmed/36526439 http://dx.doi.org/10.1093/database/baac107 Text en © The Author(s) 2022. Published by Oxford University Press. https://creativecommons.org/licenses/by-nc/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution-NonCommercial License (https://creativecommons.org/licenses/by-nc/4.0/), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial re-use, please contact journals.permissions@oup.com |
spellingShingle | Original Article Cenikj, Gjorgjina Valenčič, Eva Ispirova, Gordana Ogrinc, Matevž Stojanov, Riste Korošec, Peter Cavalli, Ermanno Seljak, Barbara Koroušić Eftimov, Tome CafeteriaSA corpus: scientific abstracts annotated across different food semantic resources |
title | CafeteriaSA corpus: scientific abstracts annotated across different food semantic resources |
title_full | CafeteriaSA corpus: scientific abstracts annotated across different food semantic resources |
title_fullStr | CafeteriaSA corpus: scientific abstracts annotated across different food semantic resources |
title_full_unstemmed | CafeteriaSA corpus: scientific abstracts annotated across different food semantic resources |
title_short | CafeteriaSA corpus: scientific abstracts annotated across different food semantic resources |
title_sort | cafeteriasa corpus: scientific abstracts annotated across different food semantic resources |
topic | Original Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9757992/ https://www.ncbi.nlm.nih.gov/pubmed/36526439 http://dx.doi.org/10.1093/database/baac107 |
work_keys_str_mv | AT cenikjgjorgjina cafeteriasacorpusscientificabstractsannotatedacrossdifferentfoodsemanticresources AT valenciceva cafeteriasacorpusscientificabstractsannotatedacrossdifferentfoodsemanticresources AT ispirovagordana cafeteriasacorpusscientificabstractsannotatedacrossdifferentfoodsemanticresources AT ogrincmatevz cafeteriasacorpusscientificabstractsannotatedacrossdifferentfoodsemanticresources AT stojanovriste cafeteriasacorpusscientificabstractsannotatedacrossdifferentfoodsemanticresources AT korosecpeter cafeteriasacorpusscientificabstractsannotatedacrossdifferentfoodsemanticresources AT cavalliermanno cafeteriasacorpusscientificabstractsannotatedacrossdifferentfoodsemanticresources AT seljakbarbarakorousic cafeteriasacorpusscientificabstractsannotatedacrossdifferentfoodsemanticresources AT eftimovtome cafeteriasacorpusscientificabstractsannotatedacrossdifferentfoodsemanticresources |