Cargando…
Open data science: technical and cultural aspects
Research in STM fields routinely generates and requires large amounts of data in electronic form. The growth of scientific research using infrastructures such as the Grid, UK's eScience programme and cyber infrastructure requires the re-use, repurposing and redissemination of this information....
Autor principal: | |
---|---|
Lenguaje: | eng |
Publicado: |
2005
|
Materias: | |
Acceso en línea: | http://cds.cern.ch/record/908236 |
_version_ | 1780908823667539968 |
---|---|
author | Murray-Rust, Peter |
author_facet | Murray-Rust, Peter |
author_sort | Murray-Rust, Peter |
collection | CERN |
description | Research in STM fields routinely generates and requires large amounts of data in electronic form. The growth of scientific research using infrastructures such as the Grid, UK's eScience programme and cyber infrastructure requires the re-use, repurposing and redissemination of this information. Fields like bioinformatics, astronomy, physics, and earth/environmental sciences routinely use such data as primary research input. Much of this is now carried out by machines which harvest data from multiple sources in dynamic and iterative ways, validate, filter compute and republish it. The current publication process and legal infrastructure is now a serious hindrance to this. Most STM data are never published and the re-usability of those that are is often unclear as authors and publishers give no explicit permission. However almost all authors intend that published data (non-copyrightable “facts”) are for the re-use of and redissemination to the STM community and the world in general. Many publishers agree with this, but most do not actively support the effective publication of data, through disinterest or the lack of a viable business proves. Some, however, appear to assert ownership and control over factual data, debarring robots and charging for access. The new technology offers enormous scope for different models for the publication and use of Open STM data and some will be demonstrated. To develop the necessary culture for this, SPARC has generously agreed to provide a discussion list (SPARC-OpenData) on which PM-R will be the first moderator. |
id | cern-908236 |
institution | Organización Europea para la Investigación Nuclear |
language | eng |
publishDate | 2005 |
record_format | invenio |
spelling | cern-9082362022-11-02T22:21:25Zhttp://cds.cern.ch/record/908236engMurray-Rust, PeterOpen data science: technical and cultural aspectsInformation Transfer and ManagementResearch in STM fields routinely generates and requires large amounts of data in electronic form. The growth of scientific research using infrastructures such as the Grid, UK's eScience programme and cyber infrastructure requires the re-use, repurposing and redissemination of this information. Fields like bioinformatics, astronomy, physics, and earth/environmental sciences routinely use such data as primary research input. Much of this is now carried out by machines which harvest data from multiple sources in dynamic and iterative ways, validate, filter compute and republish it. The current publication process and legal infrastructure is now a serious hindrance to this. Most STM data are never published and the re-usability of those that are is often unclear as authors and publishers give no explicit permission. However almost all authors intend that published data (non-copyrightable “facts”) are for the re-use of and redissemination to the STM community and the world in general. Many publishers agree with this, but most do not actively support the effective publication of data, through disinterest or the lack of a viable business proves. Some, however, appear to assert ownership and control over factual data, debarring robots and charging for access. The new technology offers enormous scope for different models for the publication and use of Open STM data and some will be demonstrated. To develop the necessary culture for this, SPARC has generously agreed to provide a discussion list (SPARC-OpenData) on which PM-R will be the first moderator.oai:cds.cern.ch:9082362005-10-22 |
spellingShingle | Information Transfer and Management Murray-Rust, Peter Open data science: technical and cultural aspects |
title | Open data science: technical and cultural aspects |
title_full | Open data science: technical and cultural aspects |
title_fullStr | Open data science: technical and cultural aspects |
title_full_unstemmed | Open data science: technical and cultural aspects |
title_short | Open data science: technical and cultural aspects |
title_sort | open data science: technical and cultural aspects |
topic | Information Transfer and Management |
url | http://cds.cern.ch/record/908236 |
work_keys_str_mv | AT murrayrustpeter opendatasciencetechnicalandculturalaspects |