Cargando…

CMS Data Provenance: CERN open data portal data curation automation

The CERN Open Data portal is the access point to a growing range of data produced through the research performed at CERN. It disseminates the preserved output from various research activities and includes accompanying software and documentation needed to understand and analyze the data. The portal a...

Descripción completa

Detalles Bibliográficos
Autor principal: Almomani, Osama
Lenguaje:eng
Publicado: 2022
Materias:
Acceso en línea:http://cds.cern.ch/record/2834494
_version_ 1780975595827494912
author Almomani, Osama
author_facet Almomani, Osama
author_sort Almomani, Osama
collection CERN
description The CERN Open Data portal is the access point to a growing range of data produced through the research performed at CERN. It disseminates the preserved output from various research activities and includes accompanying software and documentation needed to understand and analyze the data. The portal adheres to established global standards in data preservation and Open Science: the products are shared under open licenses; they are issued with a Digital Object Identifier (DOI) to make them citable objects.[1] The portal has a wide range of datasets coming from CERN main experiments ALICE, ATLAS, CMS, LHCb and OPERA. The datasets are very huge in size -Gigabytes to Terabytes-, and only metadata information is generated as JSON records that go directly to the portal by the data curation scripts. data curation scripts contain a collection of data ingestion and curation tools used to prepare the datasets’ metadata, software, and any accompanying material for public open data releases on the CERN Open Data portal.[2]
id cern-2834494
institution Organización Europea para la Investigación Nuclear
language eng
publishDate 2022
record_format invenio
spelling cern-28344942022-09-26T19:27:40Zhttp://cds.cern.ch/record/2834494engAlmomani, OsamaCMS Data Provenance: CERN open data portal data curation automationPhysics in GeneralThe CERN Open Data portal is the access point to a growing range of data produced through the research performed at CERN. It disseminates the preserved output from various research activities and includes accompanying software and documentation needed to understand and analyze the data. The portal adheres to established global standards in data preservation and Open Science: the products are shared under open licenses; they are issued with a Digital Object Identifier (DOI) to make them citable objects.[1] The portal has a wide range of datasets coming from CERN main experiments ALICE, ATLAS, CMS, LHCb and OPERA. The datasets are very huge in size -Gigabytes to Terabytes-, and only metadata information is generated as JSON records that go directly to the portal by the data curation scripts. data curation scripts contain a collection of data ingestion and curation tools used to prepare the datasets’ metadata, software, and any accompanying material for public open data releases on the CERN Open Data portal.[2]CERN-STUDENTS-Note-2022-174oai:cds.cern.ch:28344942022-09-26
spellingShingle Physics in General
Almomani, Osama
CMS Data Provenance: CERN open data portal data curation automation
title CMS Data Provenance: CERN open data portal data curation automation
title_full CMS Data Provenance: CERN open data portal data curation automation
title_fullStr CMS Data Provenance: CERN open data portal data curation automation
title_full_unstemmed CMS Data Provenance: CERN open data portal data curation automation
title_short CMS Data Provenance: CERN open data portal data curation automation
title_sort cms data provenance: cern open data portal data curation automation
topic Physics in General
url http://cds.cern.ch/record/2834494
work_keys_str_mv AT almomaniosama cmsdataprovenancecernopendataportaldatacurationautomation