Cargando…

Designing succinct structural alphabets

Motivation: The 3D structure of a protein sequence can be assembled from the substructures corresponding to small segments of this sequence. For each small sequence segment, there are only a few more likely substructures. We call them the ‘structural alphabet’ for this segment. Classical approaches...

Descripción completa

Detalles Bibliográficos
Autores principales: Li, Shuai Cheng, Bu, Dongbo, Gao, Xin, Xu, Jinbo, Li, Ming
Formato: Texto
Lenguaje:English
Publicado: Oxford University Press 2008
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2718643/
https://www.ncbi.nlm.nih.gov/pubmed/18586712
http://dx.doi.org/10.1093/bioinformatics/btn165
_version_ 1782170006454272000
author Li, Shuai Cheng
Bu, Dongbo
Gao, Xin
Xu, Jinbo
Li, Ming
author_facet Li, Shuai Cheng
Bu, Dongbo
Gao, Xin
Xu, Jinbo
Li, Ming
author_sort Li, Shuai Cheng
collection PubMed
description Motivation: The 3D structure of a protein sequence can be assembled from the substructures corresponding to small segments of this sequence. For each small sequence segment, there are only a few more likely substructures. We call them the ‘structural alphabet’ for this segment. Classical approaches such as ROSETTA used sequence profile and secondary structure information, to predict structural fragments. In contrast, we utilize more structural information, such as solvent accessibility and contact capacity, for finding structural fragments. Results: Integer linear programming technique is applied to derive the best combination of these sequence and structural information items. This approach generates significantly more accurate and succinct structural alphabets with more than 50% improvement over the previous accuracies. With these novel structural alphabets, we are able to construct more accurate protein structures than the state-of-art ab initio protein structure prediction programs such as ROSETTA. We are also able to reduce the Kolodny's library size by a factor of 8, at the same accuracy. Availability: The online FRazor server is under construction Contact:scli@uwaterloo.ca,mli@uwaterloo.ca, j3xu@tti-c.org
format Text
id pubmed-2718643
institution National Center for Biotechnology Information
language English
publishDate 2008
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-27186432009-07-31 Designing succinct structural alphabets Li, Shuai Cheng Bu, Dongbo Gao, Xin Xu, Jinbo Li, Ming Bioinformatics Ismb 2008 Conference Proceedings 19–23 July 2008, Toronto Motivation: The 3D structure of a protein sequence can be assembled from the substructures corresponding to small segments of this sequence. For each small sequence segment, there are only a few more likely substructures. We call them the ‘structural alphabet’ for this segment. Classical approaches such as ROSETTA used sequence profile and secondary structure information, to predict structural fragments. In contrast, we utilize more structural information, such as solvent accessibility and contact capacity, for finding structural fragments. Results: Integer linear programming technique is applied to derive the best combination of these sequence and structural information items. This approach generates significantly more accurate and succinct structural alphabets with more than 50% improvement over the previous accuracies. With these novel structural alphabets, we are able to construct more accurate protein structures than the state-of-art ab initio protein structure prediction programs such as ROSETTA. We are also able to reduce the Kolodny's library size by a factor of 8, at the same accuracy. Availability: The online FRazor server is under construction Contact:scli@uwaterloo.ca,mli@uwaterloo.ca, j3xu@tti-c.org Oxford University Press 2008-07-01 /pmc/articles/PMC2718643/ /pubmed/18586712 http://dx.doi.org/10.1093/bioinformatics/btn165 Text en © 2008 The Author(s) http://creativecommons.org/licenses/by-nc/2.0/uk/ This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/2.0/uk/) which permits unrestricted non-commercial use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Ismb 2008 Conference Proceedings 19–23 July 2008, Toronto
Li, Shuai Cheng
Bu, Dongbo
Gao, Xin
Xu, Jinbo
Li, Ming
Designing succinct structural alphabets
title Designing succinct structural alphabets
title_full Designing succinct structural alphabets
title_fullStr Designing succinct structural alphabets
title_full_unstemmed Designing succinct structural alphabets
title_short Designing succinct structural alphabets
title_sort designing succinct structural alphabets
topic Ismb 2008 Conference Proceedings 19–23 July 2008, Toronto
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2718643/
https://www.ncbi.nlm.nih.gov/pubmed/18586712
http://dx.doi.org/10.1093/bioinformatics/btn165
work_keys_str_mv AT lishuaicheng designingsuccinctstructuralalphabets
AT budongbo designingsuccinctstructuralalphabets
AT gaoxin designingsuccinctstructuralalphabets
AT xujinbo designingsuccinctstructuralalphabets
AT liming designingsuccinctstructuralalphabets