Cargando…

MUFOLD-DB: a processed protein structure database for protein structure prediction and analysis

BACKGROUND: Protein structure data in Protein Data Bank (PDB) are widely used in studies of protein function and evolution and in protein structure prediction. However, there are two main barriers in large-scale usage of PDB data: 1) PDB data are highly redundant in terms of sequence and structure s...

Descripción completa

Detalles Bibliográficos
Autores principales: He, Zhiquan, Zhang, Chao, Xu, Yang, Zeng, Shuai, Zhang, Jingfen, Xu, Dong
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2014
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4304177/
https://www.ncbi.nlm.nih.gov/pubmed/25559128
http://dx.doi.org/10.1186/1471-2164-15-S11-S2
_version_ 1782354049649082368
author He, Zhiquan
Zhang, Chao
Xu, Yang
Zeng, Shuai
Zhang, Jingfen
Xu, Dong
author_facet He, Zhiquan
Zhang, Chao
Xu, Yang
Zeng, Shuai
Zhang, Jingfen
Xu, Dong
author_sort He, Zhiquan
collection PubMed
description BACKGROUND: Protein structure data in Protein Data Bank (PDB) are widely used in studies of protein function and evolution and in protein structure prediction. However, there are two main barriers in large-scale usage of PDB data: 1) PDB data are highly redundant in terms of sequence and structure similarity; and 2) many PDB files have issues due to inconsistency of data and standards as well as missing residues, so that automated retrieval and analysis are often difficult. DESCRIPTION: To address these issues, we have created MUFOLD-DB http://mufold.org/mufolddb.php, a web-based database, to collect and process the weekly PDB files thereby providing users with non-redundant, cleaned and partially-predicted structure data. For each of the non-redundant sequences, we annotate the SCOP domain classification and predict structures of missing regions by loop modelling. In addition, evolutional information, secondary structure, disorder region, and processed three-dimensional structure are computed and visualized to help users better understand the protein. CONCLUSIONS: MUFOLD-DB integrates processed PDB sequence and structure data and multiple computational results, provides a friendly interface for users to retrieve, browse and download these data, and offers several useful functionalities to facilitate users' data operation.
format Online
Article
Text
id pubmed-4304177
institution National Center for Biotechnology Information
language English
publishDate 2014
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-43041772015-02-09 MUFOLD-DB: a processed protein structure database for protein structure prediction and analysis He, Zhiquan Zhang, Chao Xu, Yang Zeng, Shuai Zhang, Jingfen Xu, Dong BMC Genomics Research BACKGROUND: Protein structure data in Protein Data Bank (PDB) are widely used in studies of protein function and evolution and in protein structure prediction. However, there are two main barriers in large-scale usage of PDB data: 1) PDB data are highly redundant in terms of sequence and structure similarity; and 2) many PDB files have issues due to inconsistency of data and standards as well as missing residues, so that automated retrieval and analysis are often difficult. DESCRIPTION: To address these issues, we have created MUFOLD-DB http://mufold.org/mufolddb.php, a web-based database, to collect and process the weekly PDB files thereby providing users with non-redundant, cleaned and partially-predicted structure data. For each of the non-redundant sequences, we annotate the SCOP domain classification and predict structures of missing regions by loop modelling. In addition, evolutional information, secondary structure, disorder region, and processed three-dimensional structure are computed and visualized to help users better understand the protein. CONCLUSIONS: MUFOLD-DB integrates processed PDB sequence and structure data and multiple computational results, provides a friendly interface for users to retrieve, browse and download these data, and offers several useful functionalities to facilitate users' data operation. BioMed Central 2014-12-16 /pmc/articles/PMC4304177/ /pubmed/25559128 http://dx.doi.org/10.1186/1471-2164-15-S11-S2 Text en Copyright © 2014 He et al.; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/4.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Research
He, Zhiquan
Zhang, Chao
Xu, Yang
Zeng, Shuai
Zhang, Jingfen
Xu, Dong
MUFOLD-DB: a processed protein structure database for protein structure prediction and analysis
title MUFOLD-DB: a processed protein structure database for protein structure prediction and analysis
title_full MUFOLD-DB: a processed protein structure database for protein structure prediction and analysis
title_fullStr MUFOLD-DB: a processed protein structure database for protein structure prediction and analysis
title_full_unstemmed MUFOLD-DB: a processed protein structure database for protein structure prediction and analysis
title_short MUFOLD-DB: a processed protein structure database for protein structure prediction and analysis
title_sort mufold-db: a processed protein structure database for protein structure prediction and analysis
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4304177/
https://www.ncbi.nlm.nih.gov/pubmed/25559128
http://dx.doi.org/10.1186/1471-2164-15-S11-S2
work_keys_str_mv AT hezhiquan mufolddbaprocessedproteinstructuredatabaseforproteinstructurepredictionandanalysis
AT zhangchao mufolddbaprocessedproteinstructuredatabaseforproteinstructurepredictionandanalysis
AT xuyang mufolddbaprocessedproteinstructuredatabaseforproteinstructurepredictionandanalysis
AT zengshuai mufolddbaprocessedproteinstructuredatabaseforproteinstructurepredictionandanalysis
AT zhangjingfen mufolddbaprocessedproteinstructuredatabaseforproteinstructurepredictionandanalysis
AT xudong mufolddbaprocessedproteinstructuredatabaseforproteinstructurepredictionandanalysis