Cargando…

YummyData: providing high-quality open life science data

Many life science datasets are now available via Linked Data technologies, meaning that they are represented in a common format (the Resource Description Framework), and are accessible via standard APIs (SPARQL endpoints). While this is an important step toward developing an interoperable bioinforma...

Descripción completa

Detalles Bibliográficos
Autores principales:	Yamamoto, Yasunori, Yamaguchi, Atsuko, Splendiani, Andrea
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	Oxford University Press 2018
Materias:	Original Article
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5846286/ https://www.ncbi.nlm.nih.gov/pubmed/29688370 http://dx.doi.org/10.1093/database/bay022

_version_	1783305560301502464
author	Yamamoto, Yasunori Yamaguchi, Atsuko Splendiani, Andrea
author_facet	Yamamoto, Yasunori Yamaguchi, Atsuko Splendiani, Andrea
author_sort	Yamamoto, Yasunori
collection	PubMed
description	Many life science datasets are now available via Linked Data technologies, meaning that they are represented in a common format (the Resource Description Framework), and are accessible via standard APIs (SPARQL endpoints). While this is an important step toward developing an interoperable bioinformatics data landscape, it also creates a new set of obstacles, as it is often difficult for researchers to find the datasets they need. Different providers frequently offer the same datasets, with different levels of support: as well as having more or less up-to-date data, some providers add metadata to describe the content, structures, and ontologies of the stored datasets while others do not. We currently lack a place where researchers can go to easily assess datasets from different providers in terms of metrics such as service stability or metadata richness. We also lack a space for collecting feedback and improving data providers’ awareness of user needs. To address this issue, we have developed YummyData, which consists of two components. One periodically polls a curated list of SPARQL endpoints, monitoring the states of their Linked Data implementations and content. The other presents the information measured for the endpoints and provides a forum for discussion and feedback. YummyData is designed to improve the findability and reusability of life science datasets provided as Linked Data and to foster its adoption. It is freely accessible at http://yummydata.org/. Database URL: http://yummydata.org/
format	Online Article Text
id	pubmed-5846286
institution	National Center for Biotechnology Information
language	English
publishDate	2018
publisher	Oxford University Press
record_format	MEDLINE/PubMed
spelling	pubmed-58462862018-03-21 YummyData: providing high-quality open life science data Yamamoto, Yasunori Yamaguchi, Atsuko Splendiani, Andrea Database (Oxford) Original Article Many life science datasets are now available via Linked Data technologies, meaning that they are represented in a common format (the Resource Description Framework), and are accessible via standard APIs (SPARQL endpoints). While this is an important step toward developing an interoperable bioinformatics data landscape, it also creates a new set of obstacles, as it is often difficult for researchers to find the datasets they need. Different providers frequently offer the same datasets, with different levels of support: as well as having more or less up-to-date data, some providers add metadata to describe the content, structures, and ontologies of the stored datasets while others do not. We currently lack a place where researchers can go to easily assess datasets from different providers in terms of metrics such as service stability or metadata richness. We also lack a space for collecting feedback and improving data providers’ awareness of user needs. To address this issue, we have developed YummyData, which consists of two components. One periodically polls a curated list of SPARQL endpoints, monitoring the states of their Linked Data implementations and content. The other presents the information measured for the endpoints and provides a forum for discussion and feedback. YummyData is designed to improve the findability and reusability of life science datasets provided as Linked Data and to foster its adoption. It is freely accessible at http://yummydata.org/. Database URL: http://yummydata.org/ Oxford University Press 2018-03-09 /pmc/articles/PMC5846286/ /pubmed/29688370 http://dx.doi.org/10.1093/database/bay022 Text en © The Author(s) 2018. Published by Oxford University Press. http://creativecommons.org/licenses/by/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle	Original Article Yamamoto, Yasunori Yamaguchi, Atsuko Splendiani, Andrea YummyData: providing high-quality open life science data
title	YummyData: providing high-quality open life science data
title_full	YummyData: providing high-quality open life science data
title_fullStr	YummyData: providing high-quality open life science data
title_full_unstemmed	YummyData: providing high-quality open life science data
title_short	YummyData: providing high-quality open life science data
title_sort	yummydata: providing high-quality open life science data
topic	Original Article
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5846286/ https://www.ncbi.nlm.nih.gov/pubmed/29688370 http://dx.doi.org/10.1093/database/bay022
work_keys_str_mv	AT yamamotoyasunori yummydataprovidinghighqualityopenlifesciencedata AT yamaguchiatsuko yummydataprovidinghighqualityopenlifesciencedata AT splendianiandrea yummydataprovidinghighqualityopenlifesciencedata

YummyData: providing high-quality open life science data

Ejemplares similares