Cargando…
Data Knowledge Base for HENP Scientific Collaborations
Contemporary scientific experiments produce significant amount of data as well as scientific publications based on this data. Since volumes of both are constantly increasing, it becomes more and more problematic to establish a connection between a given paper and the underlying data. However, such a...
Autores principales: | , , , , , |
---|---|
Lenguaje: | eng |
Publicado: |
2018
|
Materias: | |
Acceso en línea: | https://dx.doi.org/10.1088/1742-6596/1085/3/032013 http://cds.cern.ch/record/2669215 |
_version_ | 1780962212816355328 |
---|---|
author | Aulov, V A Golosova, M V Grigorieva, M A Klimentov, A A Padolski, S Wenaus, T |
author_facet | Aulov, V A Golosova, M V Grigorieva, M A Klimentov, A A Padolski, S Wenaus, T |
author_sort | Aulov, V A |
collection | CERN |
description | Contemporary scientific experiments produce significant amount of data as well as scientific publications based on this data. Since volumes of both are constantly increasing, it becomes more and more problematic to establish a connection between a given paper and the underlying data. However, such an association is one of the crucial pieces of information for performing various tasks, such as validating the scientific results presented in paper, comparing different approaches to deal with a problem or even simply understanding the situation in some area of science. Authors of this paper are working under the Data Knowledge Base (DKB) R&D; project, initiated in 2016 to solve this issue for the ATLAS experiment at CERN. This project is aimed at developing of the software environment, providing the storage and a coherent representation of the basic information objects. In this paper authors present a metadata model developed for the ATLAS experiment, the architecture of the DKB system and its main components. Special attention is paid to the Kafka-based ETL subsystem implementation and mechanism for extraction of meta-information from the texts of ATLAS publications |
id | oai-inspirehep.net-1699835 |
institution | Organización Europea para la Investigación Nuclear |
language | eng |
publishDate | 2018 |
record_format | invenio |
spelling | oai-inspirehep.net-16998352021-02-09T10:05:50Zdoi:10.1088/1742-6596/1085/3/032013http://cds.cern.ch/record/2669215engAulov, V AGolosova, M VGrigorieva, M AKlimentov, A APadolski, SWenaus, TData Knowledge Base for HENP Scientific CollaborationsComputing and ComputersContemporary scientific experiments produce significant amount of data as well as scientific publications based on this data. Since volumes of both are constantly increasing, it becomes more and more problematic to establish a connection between a given paper and the underlying data. However, such an association is one of the crucial pieces of information for performing various tasks, such as validating the scientific results presented in paper, comparing different approaches to deal with a problem or even simply understanding the situation in some area of science. Authors of this paper are working under the Data Knowledge Base (DKB) R&D; project, initiated in 2016 to solve this issue for the ATLAS experiment at CERN. This project is aimed at developing of the software environment, providing the storage and a coherent representation of the basic information objects. In this paper authors present a metadata model developed for the ATLAS experiment, the architecture of the DKB system and its main components. Special attention is paid to the Kafka-based ETL subsystem implementation and mechanism for extraction of meta-information from the texts of ATLAS publicationsoai:inspirehep.net:16998352018 |
spellingShingle | Computing and Computers Aulov, V A Golosova, M V Grigorieva, M A Klimentov, A A Padolski, S Wenaus, T Data Knowledge Base for HENP Scientific Collaborations |
title | Data Knowledge Base for HENP Scientific Collaborations |
title_full | Data Knowledge Base for HENP Scientific Collaborations |
title_fullStr | Data Knowledge Base for HENP Scientific Collaborations |
title_full_unstemmed | Data Knowledge Base for HENP Scientific Collaborations |
title_short | Data Knowledge Base for HENP Scientific Collaborations |
title_sort | data knowledge base for henp scientific collaborations |
topic | Computing and Computers |
url | https://dx.doi.org/10.1088/1742-6596/1085/3/032013 http://cds.cern.ch/record/2669215 |
work_keys_str_mv | AT aulovva dataknowledgebaseforhenpscientificcollaborations AT golosovamv dataknowledgebaseforhenpscientificcollaborations AT grigorievama dataknowledgebaseforhenpscientificcollaborations AT klimentovaa dataknowledgebaseforhenpscientificcollaborations AT padolskis dataknowledgebaseforhenpscientificcollaborations AT wenaust dataknowledgebaseforhenpscientificcollaborations |