Cargando…
localpdb—a Python package to manage protein structures and their annotations
MOTIVATION: The wealth of protein structures collected in the Protein Data Bank enabled large-scale studies of their function and evolution. Such studies, however, require the generation of customized datasets combining the structural data with miscellaneous accessory resources providing functional,...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Oxford University Press
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9048648/ https://www.ncbi.nlm.nih.gov/pubmed/35199148 http://dx.doi.org/10.1093/bioinformatics/btac121 |
_version_ | 1784695976709062656 |
---|---|
author | Ludwiczak, Jan Winski, Aleksander Dunin-Horkawicz, Stanislaw |
author_facet | Ludwiczak, Jan Winski, Aleksander Dunin-Horkawicz, Stanislaw |
author_sort | Ludwiczak, Jan |
collection | PubMed |
description | MOTIVATION: The wealth of protein structures collected in the Protein Data Bank enabled large-scale studies of their function and evolution. Such studies, however, require the generation of customized datasets combining the structural data with miscellaneous accessory resources providing functional, taxonomic and other annotations. Unfortunately, the functionality of currently available tools for the creation of such datasets is limited and their usage frequently requires laborious surveying of various data sources and resolving inconsistencies between their versions. RESULTS: To address this problem, we developed localpdb, a versatile Python library for the management of protein structures and their annotations. The library features a flexible plugin system enabling seamless unification of the structural data with diverse auxiliary resources, full version control and powerful functionality of creating highly customized datasets. The localpdb can be used in a wide range of bioinformatic tasks, in particular those involving large-scale protein structural analyses and machine learning. AVAILABILITY AND IMPLEMENTATION: localpdb is freely available at https://github.com/labstructbioinf/localpdb. Documentation along with the usage examples can be accessed at https://labstructbioinf.github.io/localpdb/. |
format | Online Article Text |
id | pubmed-9048648 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | Oxford University Press |
record_format | MEDLINE/PubMed |
spelling | pubmed-90486482022-04-29 localpdb—a Python package to manage protein structures and their annotations Ludwiczak, Jan Winski, Aleksander Dunin-Horkawicz, Stanislaw Bioinformatics Applications Notes MOTIVATION: The wealth of protein structures collected in the Protein Data Bank enabled large-scale studies of their function and evolution. Such studies, however, require the generation of customized datasets combining the structural data with miscellaneous accessory resources providing functional, taxonomic and other annotations. Unfortunately, the functionality of currently available tools for the creation of such datasets is limited and their usage frequently requires laborious surveying of various data sources and resolving inconsistencies between their versions. RESULTS: To address this problem, we developed localpdb, a versatile Python library for the management of protein structures and their annotations. The library features a flexible plugin system enabling seamless unification of the structural data with diverse auxiliary resources, full version control and powerful functionality of creating highly customized datasets. The localpdb can be used in a wide range of bioinformatic tasks, in particular those involving large-scale protein structural analyses and machine learning. AVAILABILITY AND IMPLEMENTATION: localpdb is freely available at https://github.com/labstructbioinf/localpdb. Documentation along with the usage examples can be accessed at https://labstructbioinf.github.io/localpdb/. Oxford University Press 2022-02-23 /pmc/articles/PMC9048648/ /pubmed/35199148 http://dx.doi.org/10.1093/bioinformatics/btac121 Text en © The Author(s) 2022. Published by Oxford University Press. https://creativecommons.org/licenses/by-nc/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution-NonCommercial License (https://creativecommons.org/licenses/by-nc/4.0/), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial re-use, please contact journals.permissions@oup.com |
spellingShingle | Applications Notes Ludwiczak, Jan Winski, Aleksander Dunin-Horkawicz, Stanislaw localpdb—a Python package to manage protein structures and their annotations |
title |
localpdb—a Python package to manage protein structures and their annotations |
title_full |
localpdb—a Python package to manage protein structures and their annotations |
title_fullStr |
localpdb—a Python package to manage protein structures and their annotations |
title_full_unstemmed |
localpdb—a Python package to manage protein structures and their annotations |
title_short |
localpdb—a Python package to manage protein structures and their annotations |
title_sort | localpdb—a python package to manage protein structures and their annotations |
topic | Applications Notes |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9048648/ https://www.ncbi.nlm.nih.gov/pubmed/35199148 http://dx.doi.org/10.1093/bioinformatics/btac121 |
work_keys_str_mv | AT ludwiczakjan localpdbapythonpackagetomanageproteinstructuresandtheirannotations AT winskialeksander localpdbapythonpackagetomanageproteinstructuresandtheirannotations AT duninhorkawiczstanislaw localpdbapythonpackagetomanageproteinstructuresandtheirannotations |