Cargando…

Machines first, humans second: on the importance of algorithmic interpretation of open chemistry data

The current rise in the use of open lab notebook techniques means that there are an increasing number of scientists who make chemical information freely and openly available to the entire community as a series of micropublications that are released shortly after the conclusion of each experiment. We...

Descripción completa

Detalles Bibliográficos
Autores principales: Clark, Alex M, Williams, Antony J, Ekins, Sean
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Springer International Publishing 2015
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4369291/
https://www.ncbi.nlm.nih.gov/pubmed/25798198
http://dx.doi.org/10.1186/s13321-015-0057-7
_version_ 1782362746204979200
author Clark, Alex M
Williams, Antony J
Ekins, Sean
author_facet Clark, Alex M
Williams, Antony J
Ekins, Sean
author_sort Clark, Alex M
collection PubMed
description The current rise in the use of open lab notebook techniques means that there are an increasing number of scientists who make chemical information freely and openly available to the entire community as a series of micropublications that are released shortly after the conclusion of each experiment. We propose that this trend be accompanied by a thorough examination of data sharing priorities. We argue that the most significant immediate benefactor of open data is in fact chemical algorithms, which are capable of absorbing vast quantities of data, and using it to present concise insights to working chemists, on a scale that could not be achieved by traditional publication methods. Making this goal practically achievable will require a paradigm shift in the way individual scientists translate their data into digital form, since most contemporary methods of data entry are designed for presentation to humans rather than consumption by machine learning algorithms. We discuss some of the complex issues involved in fixing current methods, as well as some of the immediate benefits that can be gained when open data is published correctly using unambiguous machine readable formats. [Figure: see text]
format Online
Article
Text
id pubmed-4369291
institution National Center for Biotechnology Information
language English
publishDate 2015
publisher Springer International Publishing
record_format MEDLINE/PubMed
spelling pubmed-43692912015-03-23 Machines first, humans second: on the importance of algorithmic interpretation of open chemistry data Clark, Alex M Williams, Antony J Ekins, Sean J Cheminform Research Article The current rise in the use of open lab notebook techniques means that there are an increasing number of scientists who make chemical information freely and openly available to the entire community as a series of micropublications that are released shortly after the conclusion of each experiment. We propose that this trend be accompanied by a thorough examination of data sharing priorities. We argue that the most significant immediate benefactor of open data is in fact chemical algorithms, which are capable of absorbing vast quantities of data, and using it to present concise insights to working chemists, on a scale that could not be achieved by traditional publication methods. Making this goal practically achievable will require a paradigm shift in the way individual scientists translate their data into digital form, since most contemporary methods of data entry are designed for presentation to humans rather than consumption by machine learning algorithms. We discuss some of the complex issues involved in fixing current methods, as well as some of the immediate benefits that can be gained when open data is published correctly using unambiguous machine readable formats. [Figure: see text] Springer International Publishing 2015-03-22 /pmc/articles/PMC4369291/ /pubmed/25798198 http://dx.doi.org/10.1186/s13321-015-0057-7 Text en © Clark et al.; licensee Springer. 2015 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly credited.
spellingShingle Research Article
Clark, Alex M
Williams, Antony J
Ekins, Sean
Machines first, humans second: on the importance of algorithmic interpretation of open chemistry data
title Machines first, humans second: on the importance of algorithmic interpretation of open chemistry data
title_full Machines first, humans second: on the importance of algorithmic interpretation of open chemistry data
title_fullStr Machines first, humans second: on the importance of algorithmic interpretation of open chemistry data
title_full_unstemmed Machines first, humans second: on the importance of algorithmic interpretation of open chemistry data
title_short Machines first, humans second: on the importance of algorithmic interpretation of open chemistry data
title_sort machines first, humans second: on the importance of algorithmic interpretation of open chemistry data
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4369291/
https://www.ncbi.nlm.nih.gov/pubmed/25798198
http://dx.doi.org/10.1186/s13321-015-0057-7
work_keys_str_mv AT clarkalexm machinesfirsthumanssecondontheimportanceofalgorithmicinterpretationofopenchemistrydata
AT williamsantonyj machinesfirsthumanssecondontheimportanceofalgorithmicinterpretationofopenchemistrydata
AT ekinssean machinesfirsthumanssecondontheimportanceofalgorithmicinterpretationofopenchemistrydata