Cargando…

Open data science: technical and cultural aspects

Research in STM fields routinely generates and requires large amounts of data in electronic form. The growth of scientific research using infrastructures such as the Grid, UK's eScience programme and cyber infrastructure requires the re-use, repurposing and redissemination of this information....

Descripción completa

Detalles Bibliográficos
Autor principal: Murray-Rust, Peter
Lenguaje:eng
Publicado: 2005
Materias:
Acceso en línea:http://cds.cern.ch/record/908236
_version_ 1780908823667539968
author Murray-Rust, Peter
author_facet Murray-Rust, Peter
author_sort Murray-Rust, Peter
collection CERN
description Research in STM fields routinely generates and requires large amounts of data in electronic form. The growth of scientific research using infrastructures such as the Grid, UK's eScience programme and cyber infrastructure requires the re-use, repurposing and redissemination of this information. Fields like bioinformatics, astronomy, physics, and earth/environmental sciences routinely use such data as primary research input. Much of this is now carried out by machines which harvest data from multiple sources in dynamic and iterative ways, validate, filter compute and republish it. The current publication process and legal infrastructure is now a serious hindrance to this. Most STM data are never published and the re-usability of those that are is often unclear as authors and publishers give no explicit permission. However almost all authors intend that published data (non-copyrightable “facts”) are for the re-use of and redissemination to the STM community and the world in general. Many publishers agree with this, but most do not actively support the effective publication of data, through disinterest or the lack of a viable business proves. Some, however, appear to assert ownership and control over factual data, debarring robots and charging for access. The new technology offers enormous scope for different models for the publication and use of Open STM data and some will be demonstrated. To develop the necessary culture for this, SPARC has generously agreed to provide a discussion list (SPARC-OpenData) on which PM-R will be the first moderator.
id cern-908236
institution Organización Europea para la Investigación Nuclear
language eng
publishDate 2005
record_format invenio
spelling cern-9082362022-11-02T22:21:25Zhttp://cds.cern.ch/record/908236engMurray-Rust, PeterOpen data science: technical and cultural aspectsInformation Transfer and ManagementResearch in STM fields routinely generates and requires large amounts of data in electronic form. The growth of scientific research using infrastructures such as the Grid, UK's eScience programme and cyber infrastructure requires the re-use, repurposing and redissemination of this information. Fields like bioinformatics, astronomy, physics, and earth/environmental sciences routinely use such data as primary research input. Much of this is now carried out by machines which harvest data from multiple sources in dynamic and iterative ways, validate, filter compute and republish it. The current publication process and legal infrastructure is now a serious hindrance to this. Most STM data are never published and the re-usability of those that are is often unclear as authors and publishers give no explicit permission. However almost all authors intend that published data (non-copyrightable “facts”) are for the re-use of and redissemination to the STM community and the world in general. Many publishers agree with this, but most do not actively support the effective publication of data, through disinterest or the lack of a viable business proves. Some, however, appear to assert ownership and control over factual data, debarring robots and charging for access. The new technology offers enormous scope for different models for the publication and use of Open STM data and some will be demonstrated. To develop the necessary culture for this, SPARC has generously agreed to provide a discussion list (SPARC-OpenData) on which PM-R will be the first moderator.oai:cds.cern.ch:9082362005-10-22
spellingShingle Information Transfer and Management
Murray-Rust, Peter
Open data science: technical and cultural aspects
title Open data science: technical and cultural aspects
title_full Open data science: technical and cultural aspects
title_fullStr Open data science: technical and cultural aspects
title_full_unstemmed Open data science: technical and cultural aspects
title_short Open data science: technical and cultural aspects
title_sort open data science: technical and cultural aspects
topic Information Transfer and Management
url http://cds.cern.ch/record/908236
work_keys_str_mv AT murrayrustpeter opendatasciencetechnicalandculturalaspects