Cargando…
Extension of the sasCIF format and its applications for data processing and deposition
Recent advances in small-angle scattering (SAS) experimental facilities and data analysis methods have prompted a dramatic increase in the number of users and of projects conducted, causing an upsurge in the number of objects studied, experimental data available and structural models generated. To o...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
International Union of Crystallography
2016
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4762569/ https://www.ncbi.nlm.nih.gov/pubmed/26937242 http://dx.doi.org/10.1107/S1600576715024942 |
_version_ | 1782417129971122176 |
---|---|
author | Kachala, Michael Westbrook, John Svergun, Dmitri |
author_facet | Kachala, Michael Westbrook, John Svergun, Dmitri |
author_sort | Kachala, Michael |
collection | PubMed |
description | Recent advances in small-angle scattering (SAS) experimental facilities and data analysis methods have prompted a dramatic increase in the number of users and of projects conducted, causing an upsurge in the number of objects studied, experimental data available and structural models generated. To organize the data and models and make them accessible to the community, the Task Forces on SAS and hybrid methods for the International Union of Crystallography and the Worldwide Protein Data Bank envisage developing a federated approach to SAS data and model archiving. Within the framework of this approach, the existing databases may exchange information and provide independent but synchronized entries to users. At present, ways of exchanging information between the various SAS databases are not established, leading to possible duplication and incompatibility of entries, and limiting the opportunities for data-driven research for SAS users. In this work, a solution is developed to resolve these issues and provide a universal exchange format for the community, based on the use of the widely adopted crystallographic information framework (CIF). The previous version of the sasCIF format, implemented as an extension of the core CIF dictionary, has been available since 2000 to facilitate SAS data exchange between laboratories. The sasCIF format has now been extended to describe comprehensively the necessary experimental information, results and models, including relevant metadata for SAS data analysis and for deposition into a database. Processing tools for these files (sasCIFtools) have been developed, and these are available both as standalone open-source programs and integrated into the SAS Biological Data Bank, allowing the export and import of data entries as sasCIF files. Software modules to save the relevant information directly from beamline data-processing pipelines in sasCIF format are also developed. This update of sasCIF and the relevant tools are an important step in the standardization of the way SAS data are presented and exchanged, to make the results easily accessible to users and to promote further the application of SAS in the structural biology community. |
format | Online Article Text |
id | pubmed-4762569 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2016 |
publisher | International Union of Crystallography |
record_format | MEDLINE/PubMed |
spelling | pubmed-47625692016-03-02 Extension of the sasCIF format and its applications for data processing and deposition Kachala, Michael Westbrook, John Svergun, Dmitri J Appl Crystallogr CIF applications Recent advances in small-angle scattering (SAS) experimental facilities and data analysis methods have prompted a dramatic increase in the number of users and of projects conducted, causing an upsurge in the number of objects studied, experimental data available and structural models generated. To organize the data and models and make them accessible to the community, the Task Forces on SAS and hybrid methods for the International Union of Crystallography and the Worldwide Protein Data Bank envisage developing a federated approach to SAS data and model archiving. Within the framework of this approach, the existing databases may exchange information and provide independent but synchronized entries to users. At present, ways of exchanging information between the various SAS databases are not established, leading to possible duplication and incompatibility of entries, and limiting the opportunities for data-driven research for SAS users. In this work, a solution is developed to resolve these issues and provide a universal exchange format for the community, based on the use of the widely adopted crystallographic information framework (CIF). The previous version of the sasCIF format, implemented as an extension of the core CIF dictionary, has been available since 2000 to facilitate SAS data exchange between laboratories. The sasCIF format has now been extended to describe comprehensively the necessary experimental information, results and models, including relevant metadata for SAS data analysis and for deposition into a database. Processing tools for these files (sasCIFtools) have been developed, and these are available both as standalone open-source programs and integrated into the SAS Biological Data Bank, allowing the export and import of data entries as sasCIF files. Software modules to save the relevant information directly from beamline data-processing pipelines in sasCIF format are also developed. This update of sasCIF and the relevant tools are an important step in the standardization of the way SAS data are presented and exchanged, to make the results easily accessible to users and to promote further the application of SAS in the structural biology community. International Union of Crystallography 2016-02-01 /pmc/articles/PMC4762569/ /pubmed/26937242 http://dx.doi.org/10.1107/S1600576715024942 Text en © Michael Kachala et al. 2016 http://creativecommons.org/licenses/by/2.0/uk/ This is an open-access article distributed under the terms of the Creative Commons Attribution Licence, which permits unrestricted use, distribution, and reproduction in any medium, provided the original authors and source are cited. |
spellingShingle | CIF applications Kachala, Michael Westbrook, John Svergun, Dmitri Extension of the sasCIF format and its applications for data processing and deposition |
title | Extension of the sasCIF format and its applications for data processing and deposition |
title_full | Extension of the sasCIF format and its applications for data processing and deposition |
title_fullStr | Extension of the sasCIF format and its applications for data processing and deposition |
title_full_unstemmed | Extension of the sasCIF format and its applications for data processing and deposition |
title_short | Extension of the sasCIF format and its applications for data processing and deposition |
title_sort | extension of the sascif format and its applications for data processing and deposition |
topic | CIF applications |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4762569/ https://www.ncbi.nlm.nih.gov/pubmed/26937242 http://dx.doi.org/10.1107/S1600576715024942 |
work_keys_str_mv | AT kachalamichael extensionofthesascifformatanditsapplicationsfordataprocessinganddeposition AT westbrookjohn extensionofthesascifformatanditsapplicationsfordataprocessinganddeposition AT svergundmitri extensionofthesascifformatanditsapplicationsfordataprocessinganddeposition |