Cargando…
The Biological Object Notation (BON): a structured file format for biological data
The large size and high complexity of biological data can represent a major methodological challenge for the analysis and exchange of data sets between computers and applications. There has also been a substantial increase in the amount of metadata associated with biological data sets, which is bein...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Nature Publishing Group UK
2018
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6018389/ https://www.ncbi.nlm.nih.gov/pubmed/29941895 http://dx.doi.org/10.1038/s41598-018-28016-6 |
_version_ | 1783334938589790208 |
---|---|
author | Buchmann, Jan P. Fourment, Mathieu Holmes, Edward C. |
author_facet | Buchmann, Jan P. Fourment, Mathieu Holmes, Edward C. |
author_sort | Buchmann, Jan P. |
collection | PubMed |
description | The large size and high complexity of biological data can represent a major methodological challenge for the analysis and exchange of data sets between computers and applications. There has also been a substantial increase in the amount of metadata associated with biological data sets, which is being increasingly incorporated into existing data formats. Despite the existence of structured formats based on XML, biological data sets are mainly formatted using unstructured file formats, and the incorporation of metadata results in increasingly complex parsing routines such that they become more error prone. To overcome these problems, we present the “biological object notation” (BON) format, a new way to exchange and parse nearly all biological data sets more efficiently and with less error than other currently available formats. Based on JavaScript Object Notation (JSON), BON simplifies parsing by clearly separating the biological data from its metadata and reduces complexity compared to XML based formats. The ability to selectively compress data up to 87% compared to other file formats and the reduced complexity results in improved transfer times and less error prone applications. |
format | Online Article Text |
id | pubmed-6018389 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2018 |
publisher | Nature Publishing Group UK |
record_format | MEDLINE/PubMed |
spelling | pubmed-60183892018-07-06 The Biological Object Notation (BON): a structured file format for biological data Buchmann, Jan P. Fourment, Mathieu Holmes, Edward C. Sci Rep Article The large size and high complexity of biological data can represent a major methodological challenge for the analysis and exchange of data sets between computers and applications. There has also been a substantial increase in the amount of metadata associated with biological data sets, which is being increasingly incorporated into existing data formats. Despite the existence of structured formats based on XML, biological data sets are mainly formatted using unstructured file formats, and the incorporation of metadata results in increasingly complex parsing routines such that they become more error prone. To overcome these problems, we present the “biological object notation” (BON) format, a new way to exchange and parse nearly all biological data sets more efficiently and with less error than other currently available formats. Based on JavaScript Object Notation (JSON), BON simplifies parsing by clearly separating the biological data from its metadata and reduces complexity compared to XML based formats. The ability to selectively compress data up to 87% compared to other file formats and the reduced complexity results in improved transfer times and less error prone applications. Nature Publishing Group UK 2018-06-25 /pmc/articles/PMC6018389/ /pubmed/29941895 http://dx.doi.org/10.1038/s41598-018-28016-6 Text en © The Author(s) 2018 Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/. |
spellingShingle | Article Buchmann, Jan P. Fourment, Mathieu Holmes, Edward C. The Biological Object Notation (BON): a structured file format for biological data |
title | The Biological Object Notation (BON): a structured file format for biological data |
title_full | The Biological Object Notation (BON): a structured file format for biological data |
title_fullStr | The Biological Object Notation (BON): a structured file format for biological data |
title_full_unstemmed | The Biological Object Notation (BON): a structured file format for biological data |
title_short | The Biological Object Notation (BON): a structured file format for biological data |
title_sort | biological object notation (bon): a structured file format for biological data |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6018389/ https://www.ncbi.nlm.nih.gov/pubmed/29941895 http://dx.doi.org/10.1038/s41598-018-28016-6 |
work_keys_str_mv | AT buchmannjanp thebiologicalobjectnotationbonastructuredfileformatforbiologicaldata AT fourmentmathieu thebiologicalobjectnotationbonastructuredfileformatforbiologicaldata AT holmesedwardc thebiologicalobjectnotationbonastructuredfileformatforbiologicaldata AT buchmannjanp biologicalobjectnotationbonastructuredfileformatforbiologicaldata AT fourmentmathieu biologicalobjectnotationbonastructuredfileformatforbiologicaldata AT holmesedwardc biologicalobjectnotationbonastructuredfileformatforbiologicaldata |