Cargando…
Machines first, humans second: on the importance of algorithmic interpretation of open chemistry data
The current rise in the use of open lab notebook techniques means that there are an increasing number of scientists who make chemical information freely and openly available to the entire community as a series of micropublications that are released shortly after the conclusion of each experiment. We...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Springer International Publishing
2015
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4369291/ https://www.ncbi.nlm.nih.gov/pubmed/25798198 http://dx.doi.org/10.1186/s13321-015-0057-7 |
_version_ | 1782362746204979200 |
---|---|
author | Clark, Alex M Williams, Antony J Ekins, Sean |
author_facet | Clark, Alex M Williams, Antony J Ekins, Sean |
author_sort | Clark, Alex M |
collection | PubMed |
description | The current rise in the use of open lab notebook techniques means that there are an increasing number of scientists who make chemical information freely and openly available to the entire community as a series of micropublications that are released shortly after the conclusion of each experiment. We propose that this trend be accompanied by a thorough examination of data sharing priorities. We argue that the most significant immediate benefactor of open data is in fact chemical algorithms, which are capable of absorbing vast quantities of data, and using it to present concise insights to working chemists, on a scale that could not be achieved by traditional publication methods. Making this goal practically achievable will require a paradigm shift in the way individual scientists translate their data into digital form, since most contemporary methods of data entry are designed for presentation to humans rather than consumption by machine learning algorithms. We discuss some of the complex issues involved in fixing current methods, as well as some of the immediate benefits that can be gained when open data is published correctly using unambiguous machine readable formats. [Figure: see text] |
format | Online Article Text |
id | pubmed-4369291 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2015 |
publisher | Springer International Publishing |
record_format | MEDLINE/PubMed |
spelling | pubmed-43692912015-03-23 Machines first, humans second: on the importance of algorithmic interpretation of open chemistry data Clark, Alex M Williams, Antony J Ekins, Sean J Cheminform Research Article The current rise in the use of open lab notebook techniques means that there are an increasing number of scientists who make chemical information freely and openly available to the entire community as a series of micropublications that are released shortly after the conclusion of each experiment. We propose that this trend be accompanied by a thorough examination of data sharing priorities. We argue that the most significant immediate benefactor of open data is in fact chemical algorithms, which are capable of absorbing vast quantities of data, and using it to present concise insights to working chemists, on a scale that could not be achieved by traditional publication methods. Making this goal practically achievable will require a paradigm shift in the way individual scientists translate their data into digital form, since most contemporary methods of data entry are designed for presentation to humans rather than consumption by machine learning algorithms. We discuss some of the complex issues involved in fixing current methods, as well as some of the immediate benefits that can be gained when open data is published correctly using unambiguous machine readable formats. [Figure: see text] Springer International Publishing 2015-03-22 /pmc/articles/PMC4369291/ /pubmed/25798198 http://dx.doi.org/10.1186/s13321-015-0057-7 Text en © Clark et al.; licensee Springer. 2015 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly credited. |
spellingShingle | Research Article Clark, Alex M Williams, Antony J Ekins, Sean Machines first, humans second: on the importance of algorithmic interpretation of open chemistry data |
title | Machines first, humans second: on the importance of algorithmic interpretation of open chemistry data |
title_full | Machines first, humans second: on the importance of algorithmic interpretation of open chemistry data |
title_fullStr | Machines first, humans second: on the importance of algorithmic interpretation of open chemistry data |
title_full_unstemmed | Machines first, humans second: on the importance of algorithmic interpretation of open chemistry data |
title_short | Machines first, humans second: on the importance of algorithmic interpretation of open chemistry data |
title_sort | machines first, humans second: on the importance of algorithmic interpretation of open chemistry data |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4369291/ https://www.ncbi.nlm.nih.gov/pubmed/25798198 http://dx.doi.org/10.1186/s13321-015-0057-7 |
work_keys_str_mv | AT clarkalexm machinesfirsthumanssecondontheimportanceofalgorithmicinterpretationofopenchemistrydata AT williamsantonyj machinesfirsthumanssecondontheimportanceofalgorithmicinterpretationofopenchemistrydata AT ekinssean machinesfirsthumanssecondontheimportanceofalgorithmicinterpretationofopenchemistrydata |