Cargando…
CMS Data Provenance: CERN open data portal data curation automation
The CERN Open Data portal is the access point to a growing range of data produced through the research performed at CERN. It disseminates the preserved output from various research activities and includes accompanying software and documentation needed to understand and analyze the data. The portal a...
Autor principal: | |
---|---|
Lenguaje: | eng |
Publicado: |
2022
|
Materias: | |
Acceso en línea: | http://cds.cern.ch/record/2834494 |
_version_ | 1780975595827494912 |
---|---|
author | Almomani, Osama |
author_facet | Almomani, Osama |
author_sort | Almomani, Osama |
collection | CERN |
description | The CERN Open Data portal is the access point to a growing range of data produced through the research performed at CERN. It disseminates the preserved output from various research activities and includes accompanying software and documentation needed to understand and analyze the data. The portal adheres to established global standards in data preservation and Open Science: the products are shared under open licenses; they are issued with a Digital Object Identifier (DOI) to make them citable objects.[1] The portal has a wide range of datasets coming from CERN main experiments ALICE, ATLAS, CMS, LHCb and OPERA. The datasets are very huge in size -Gigabytes to Terabytes-, and only metadata information is generated as JSON records that go directly to the portal by the data curation scripts. data curation scripts contain a collection of data ingestion and curation tools used to prepare the datasets’ metadata, software, and any accompanying material for public open data releases on the CERN Open Data portal.[2] |
id | cern-2834494 |
institution | Organización Europea para la Investigación Nuclear |
language | eng |
publishDate | 2022 |
record_format | invenio |
spelling | cern-28344942022-09-26T19:27:40Zhttp://cds.cern.ch/record/2834494engAlmomani, OsamaCMS Data Provenance: CERN open data portal data curation automationPhysics in GeneralThe CERN Open Data portal is the access point to a growing range of data produced through the research performed at CERN. It disseminates the preserved output from various research activities and includes accompanying software and documentation needed to understand and analyze the data. The portal adheres to established global standards in data preservation and Open Science: the products are shared under open licenses; they are issued with a Digital Object Identifier (DOI) to make them citable objects.[1] The portal has a wide range of datasets coming from CERN main experiments ALICE, ATLAS, CMS, LHCb and OPERA. The datasets are very huge in size -Gigabytes to Terabytes-, and only metadata information is generated as JSON records that go directly to the portal by the data curation scripts. data curation scripts contain a collection of data ingestion and curation tools used to prepare the datasets’ metadata, software, and any accompanying material for public open data releases on the CERN Open Data portal.[2]CERN-STUDENTS-Note-2022-174oai:cds.cern.ch:28344942022-09-26 |
spellingShingle | Physics in General Almomani, Osama CMS Data Provenance: CERN open data portal data curation automation |
title | CMS Data Provenance: CERN open data portal data curation automation |
title_full | CMS Data Provenance: CERN open data portal data curation automation |
title_fullStr | CMS Data Provenance: CERN open data portal data curation automation |
title_full_unstemmed | CMS Data Provenance: CERN open data portal data curation automation |
title_short | CMS Data Provenance: CERN open data portal data curation automation |
title_sort | cms data provenance: cern open data portal data curation automation |
topic | Physics in General |
url | http://cds.cern.ch/record/2834494 |
work_keys_str_mv | AT almomaniosama cmsdataprovenancecernopendataportaldatacurationautomation |