Cargando…

The Biological Object Notation (BON): a structured file format for biological data

The large size and high complexity of biological data can represent a major methodological challenge for the analysis and exchange of data sets between computers and applications. There has also been a substantial increase in the amount of metadata associated with biological data sets, which is bein...

Descripción completa

Detalles Bibliográficos
Autores principales: Buchmann, Jan P., Fourment, Mathieu, Holmes, Edward C.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Nature Publishing Group UK 2018
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6018389/
https://www.ncbi.nlm.nih.gov/pubmed/29941895
http://dx.doi.org/10.1038/s41598-018-28016-6
_version_ 1783334938589790208
author Buchmann, Jan P.
Fourment, Mathieu
Holmes, Edward C.
author_facet Buchmann, Jan P.
Fourment, Mathieu
Holmes, Edward C.
author_sort Buchmann, Jan P.
collection PubMed
description The large size and high complexity of biological data can represent a major methodological challenge for the analysis and exchange of data sets between computers and applications. There has also been a substantial increase in the amount of metadata associated with biological data sets, which is being increasingly incorporated into existing data formats. Despite the existence of structured formats based on XML, biological data sets are mainly formatted using unstructured file formats, and the incorporation of metadata results in increasingly complex parsing routines such that they become more error prone. To overcome these problems, we present the “biological object notation” (BON) format, a new way to exchange and parse nearly all biological data sets more efficiently and with less error than other currently available formats. Based on JavaScript Object Notation (JSON), BON simplifies parsing by clearly separating the biological data from its metadata and reduces complexity compared to XML based formats. The ability to selectively compress data up to 87% compared to other file formats and the reduced complexity results in improved transfer times and less error prone applications.
format Online
Article
Text
id pubmed-6018389
institution National Center for Biotechnology Information
language English
publishDate 2018
publisher Nature Publishing Group UK
record_format MEDLINE/PubMed
spelling pubmed-60183892018-07-06 The Biological Object Notation (BON): a structured file format for biological data Buchmann, Jan P. Fourment, Mathieu Holmes, Edward C. Sci Rep Article The large size and high complexity of biological data can represent a major methodological challenge for the analysis and exchange of data sets between computers and applications. There has also been a substantial increase in the amount of metadata associated with biological data sets, which is being increasingly incorporated into existing data formats. Despite the existence of structured formats based on XML, biological data sets are mainly formatted using unstructured file formats, and the incorporation of metadata results in increasingly complex parsing routines such that they become more error prone. To overcome these problems, we present the “biological object notation” (BON) format, a new way to exchange and parse nearly all biological data sets more efficiently and with less error than other currently available formats. Based on JavaScript Object Notation (JSON), BON simplifies parsing by clearly separating the biological data from its metadata and reduces complexity compared to XML based formats. The ability to selectively compress data up to 87% compared to other file formats and the reduced complexity results in improved transfer times and less error prone applications. Nature Publishing Group UK 2018-06-25 /pmc/articles/PMC6018389/ /pubmed/29941895 http://dx.doi.org/10.1038/s41598-018-28016-6 Text en © The Author(s) 2018 Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.
spellingShingle Article
Buchmann, Jan P.
Fourment, Mathieu
Holmes, Edward C.
The Biological Object Notation (BON): a structured file format for biological data
title The Biological Object Notation (BON): a structured file format for biological data
title_full The Biological Object Notation (BON): a structured file format for biological data
title_fullStr The Biological Object Notation (BON): a structured file format for biological data
title_full_unstemmed The Biological Object Notation (BON): a structured file format for biological data
title_short The Biological Object Notation (BON): a structured file format for biological data
title_sort biological object notation (bon): a structured file format for biological data
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6018389/
https://www.ncbi.nlm.nih.gov/pubmed/29941895
http://dx.doi.org/10.1038/s41598-018-28016-6
work_keys_str_mv AT buchmannjanp thebiologicalobjectnotationbonastructuredfileformatforbiologicaldata
AT fourmentmathieu thebiologicalobjectnotationbonastructuredfileformatforbiologicaldata
AT holmesedwardc thebiologicalobjectnotationbonastructuredfileformatforbiologicaldata
AT buchmannjanp biologicalobjectnotationbonastructuredfileformatforbiologicaldata
AT fourmentmathieu biologicalobjectnotationbonastructuredfileformatforbiologicaldata
AT holmesedwardc biologicalobjectnotationbonastructuredfileformatforbiologicaldata