Cargando…

MMTF—An efficient file format for the transmission, visualization, and analysis of macromolecular structures

Recent advances in experimental techniques have led to a rapid growth in complexity, size, and number of macromolecular structures that are made available through the Protein Data Bank. This creates a challenge for macromolecular visualization and analysis. Macromolecular structure files, such as PD...

Descripción completa

Detalles Bibliográficos
Autores principales: Bradley, Anthony R., Rose, Alexander S., Pavelka, Antonín, Valasatava, Yana, Duarte, Jose M., Prlić, Andreas, Rose, Peter W.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2017
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5473584/
https://www.ncbi.nlm.nih.gov/pubmed/28574982
http://dx.doi.org/10.1371/journal.pcbi.1005575
_version_ 1783244314774601728
author Bradley, Anthony R.
Rose, Alexander S.
Pavelka, Antonín
Valasatava, Yana
Duarte, Jose M.
Prlić, Andreas
Rose, Peter W.
author_facet Bradley, Anthony R.
Rose, Alexander S.
Pavelka, Antonín
Valasatava, Yana
Duarte, Jose M.
Prlić, Andreas
Rose, Peter W.
author_sort Bradley, Anthony R.
collection PubMed
description Recent advances in experimental techniques have led to a rapid growth in complexity, size, and number of macromolecular structures that are made available through the Protein Data Bank. This creates a challenge for macromolecular visualization and analysis. Macromolecular structure files, such as PDB or PDBx/mmCIF files can be slow to transfer, parse, and hard to incorporate into third-party software tools. Here, we present a new binary and compressed data representation, the MacroMolecular Transmission Format, MMTF, as well as software implementations in several languages that have been developed around it, which address these issues. We describe the new format and its APIs and demonstrate that it is several times faster to parse, and about a quarter of the file size of the current standard format, PDBx/mmCIF. As a consequence of the new data representation, it is now possible to visualize structures with millions of atoms in a web browser, keep the whole PDB archive in memory or parse it within few minutes on average computers, which opens up a new way of thinking how to design and implement efficient algorithms in structural bioinformatics. The PDB archive is available in MMTF file format through web services and data that are updated on a weekly basis.
format Online
Article
Text
id pubmed-5473584
institution National Center for Biotechnology Information
language English
publishDate 2017
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-54735842017-06-26 MMTF—An efficient file format for the transmission, visualization, and analysis of macromolecular structures Bradley, Anthony R. Rose, Alexander S. Pavelka, Antonín Valasatava, Yana Duarte, Jose M. Prlić, Andreas Rose, Peter W. PLoS Comput Biol Research Article Recent advances in experimental techniques have led to a rapid growth in complexity, size, and number of macromolecular structures that are made available through the Protein Data Bank. This creates a challenge for macromolecular visualization and analysis. Macromolecular structure files, such as PDB or PDBx/mmCIF files can be slow to transfer, parse, and hard to incorporate into third-party software tools. Here, we present a new binary and compressed data representation, the MacroMolecular Transmission Format, MMTF, as well as software implementations in several languages that have been developed around it, which address these issues. We describe the new format and its APIs and demonstrate that it is several times faster to parse, and about a quarter of the file size of the current standard format, PDBx/mmCIF. As a consequence of the new data representation, it is now possible to visualize structures with millions of atoms in a web browser, keep the whole PDB archive in memory or parse it within few minutes on average computers, which opens up a new way of thinking how to design and implement efficient algorithms in structural bioinformatics. The PDB archive is available in MMTF file format through web services and data that are updated on a weekly basis. Public Library of Science 2017-06-02 /pmc/articles/PMC5473584/ /pubmed/28574982 http://dx.doi.org/10.1371/journal.pcbi.1005575 Text en © 2017 Bradley et al http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle Research Article
Bradley, Anthony R.
Rose, Alexander S.
Pavelka, Antonín
Valasatava, Yana
Duarte, Jose M.
Prlić, Andreas
Rose, Peter W.
MMTF—An efficient file format for the transmission, visualization, and analysis of macromolecular structures
title MMTF—An efficient file format for the transmission, visualization, and analysis of macromolecular structures
title_full MMTF—An efficient file format for the transmission, visualization, and analysis of macromolecular structures
title_fullStr MMTF—An efficient file format for the transmission, visualization, and analysis of macromolecular structures
title_full_unstemmed MMTF—An efficient file format for the transmission, visualization, and analysis of macromolecular structures
title_short MMTF—An efficient file format for the transmission, visualization, and analysis of macromolecular structures
title_sort mmtf—an efficient file format for the transmission, visualization, and analysis of macromolecular structures
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5473584/
https://www.ncbi.nlm.nih.gov/pubmed/28574982
http://dx.doi.org/10.1371/journal.pcbi.1005575
work_keys_str_mv AT bradleyanthonyr mmtfanefficientfileformatforthetransmissionvisualizationandanalysisofmacromolecularstructures
AT rosealexanders mmtfanefficientfileformatforthetransmissionvisualizationandanalysisofmacromolecularstructures
AT pavelkaantonin mmtfanefficientfileformatforthetransmissionvisualizationandanalysisofmacromolecularstructures
AT valasatavayana mmtfanefficientfileformatforthetransmissionvisualizationandanalysisofmacromolecularstructures
AT duartejosem mmtfanefficientfileformatforthetransmissionvisualizationandanalysisofmacromolecularstructures
AT prlicandreas mmtfanefficientfileformatforthetransmissionvisualizationandanalysisofmacromolecularstructures
AT rosepeterw mmtfanefficientfileformatforthetransmissionvisualizationandanalysisofmacromolecularstructures