Cargando…

Complementing sequence-derived features with structural information extracted from fragment libraries for protein structure prediction

BACKGROUND: Fragment libraries play a key role in fragment-assembly based protein structure prediction, where protein fragments are assembled to form a complete three-dimensional structure. Rich and accurate structural information embedded in fragment libraries has not been systematically extracted...

Descripción completa

Detalles Bibliográficos
Autores principales: Liu, Siyuan, Wang, Tong, Xu, Qijiang, Shao, Bin, Yin, Jian, Liu, Tie-Yan
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8240311/
https://www.ncbi.nlm.nih.gov/pubmed/34182922
http://dx.doi.org/10.1186/s12859-021-04258-6
Descripción
Sumario:BACKGROUND: Fragment libraries play a key role in fragment-assembly based protein structure prediction, where protein fragments are assembled to form a complete three-dimensional structure. Rich and accurate structural information embedded in fragment libraries has not been systematically extracted and used beyond fragment assembly. METHODS: To better leverage the valuable structural information for protein structure prediction, we extracted seven types of structural information from fragment libraries. We broadened the usage of such structural information by transforming fragment libraries into protein-specific potentials for gradient-descent based protein folding and encoding fragment libraries as structural features for protein property prediction. RESULTS: Fragment libraires improved the accuracy of protein folding and outperformed state-of-the-art algorithms with respect to predicted properties, such as torsion angles and inter-residue distances. CONCLUSION: Our work implies that the rich structural information extracted from fragment libraries can complement sequence-derived features to help protein structure prediction. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s12859-021-04258-6.