Cargando…

localpdb—a Python package to manage protein structures and their annotations

MOTIVATION: The wealth of protein structures collected in the Protein Data Bank enabled large-scale studies of their function and evolution. Such studies, however, require the generation of customized datasets combining the structural data with miscellaneous accessory resources providing functional,...

Descripción completa

Detalles Bibliográficos
Autores principales: Ludwiczak, Jan, Winski, Aleksander, Dunin-Horkawicz, Stanislaw
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9048648/
https://www.ncbi.nlm.nih.gov/pubmed/35199148
http://dx.doi.org/10.1093/bioinformatics/btac121
_version_ 1784695976709062656
author Ludwiczak, Jan
Winski, Aleksander
Dunin-Horkawicz, Stanislaw
author_facet Ludwiczak, Jan
Winski, Aleksander
Dunin-Horkawicz, Stanislaw
author_sort Ludwiczak, Jan
collection PubMed
description MOTIVATION: The wealth of protein structures collected in the Protein Data Bank enabled large-scale studies of their function and evolution. Such studies, however, require the generation of customized datasets combining the structural data with miscellaneous accessory resources providing functional, taxonomic and other annotations. Unfortunately, the functionality of currently available tools for the creation of such datasets is limited and their usage frequently requires laborious surveying of various data sources and resolving inconsistencies between their versions. RESULTS: To address this problem, we developed localpdb, a versatile Python library for the management of protein structures and their annotations. The library features a flexible plugin system enabling seamless unification of the structural data with diverse auxiliary resources, full version control and powerful functionality of creating highly customized datasets. The localpdb can be used in a wide range of bioinformatic tasks, in particular those involving large-scale protein structural analyses and machine learning. AVAILABILITY AND IMPLEMENTATION: localpdb is freely available at https://github.com/labstructbioinf/localpdb. Documentation along with the usage examples can be accessed at https://labstructbioinf.github.io/localpdb/.
format Online
Article
Text
id pubmed-9048648
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-90486482022-04-29 localpdb—a Python package to manage protein structures and their annotations Ludwiczak, Jan Winski, Aleksander Dunin-Horkawicz, Stanislaw Bioinformatics Applications Notes MOTIVATION: The wealth of protein structures collected in the Protein Data Bank enabled large-scale studies of their function and evolution. Such studies, however, require the generation of customized datasets combining the structural data with miscellaneous accessory resources providing functional, taxonomic and other annotations. Unfortunately, the functionality of currently available tools for the creation of such datasets is limited and their usage frequently requires laborious surveying of various data sources and resolving inconsistencies between their versions. RESULTS: To address this problem, we developed localpdb, a versatile Python library for the management of protein structures and their annotations. The library features a flexible plugin system enabling seamless unification of the structural data with diverse auxiliary resources, full version control and powerful functionality of creating highly customized datasets. The localpdb can be used in a wide range of bioinformatic tasks, in particular those involving large-scale protein structural analyses and machine learning. AVAILABILITY AND IMPLEMENTATION: localpdb is freely available at https://github.com/labstructbioinf/localpdb. Documentation along with the usage examples can be accessed at https://labstructbioinf.github.io/localpdb/. Oxford University Press 2022-02-23 /pmc/articles/PMC9048648/ /pubmed/35199148 http://dx.doi.org/10.1093/bioinformatics/btac121 Text en © The Author(s) 2022. Published by Oxford University Press. https://creativecommons.org/licenses/by-nc/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution-NonCommercial License (https://creativecommons.org/licenses/by-nc/4.0/), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial re-use, please contact journals.permissions@oup.com
spellingShingle Applications Notes
Ludwiczak, Jan
Winski, Aleksander
Dunin-Horkawicz, Stanislaw
localpdb—a Python package to manage protein structures and their annotations
title localpdb—a Python package to manage protein structures and their annotations
title_full localpdb—a Python package to manage protein structures and their annotations
title_fullStr localpdb—a Python package to manage protein structures and their annotations
title_full_unstemmed localpdb—a Python package to manage protein structures and their annotations
title_short localpdb—a Python package to manage protein structures and their annotations
title_sort localpdb—a python package to manage protein structures and their annotations
topic Applications Notes
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9048648/
https://www.ncbi.nlm.nih.gov/pubmed/35199148
http://dx.doi.org/10.1093/bioinformatics/btac121
work_keys_str_mv AT ludwiczakjan localpdbapythonpackagetomanageproteinstructuresandtheirannotations
AT winskialeksander localpdbapythonpackagetomanageproteinstructuresandtheirannotations
AT duninhorkawiczstanislaw localpdbapythonpackagetomanageproteinstructuresandtheirannotations