Cargando…
A multi-template combination algorithm for protein comparative modeling
BACKGROUND: Multiple protein templates are commonly used in manual protein structure prediction. However, few automated algorithms of selecting and combining multiple templates are available. RESULTS: Here we develop an effective multi-template combination algorithm for protein comparative modeling....
Autor principal: | |
---|---|
Formato: | Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2008
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2311309/ https://www.ncbi.nlm.nih.gov/pubmed/18366648 http://dx.doi.org/10.1186/1472-6807-8-18 |
_version_ | 1782152560315990016 |
---|---|
author | Cheng, Jianlin |
author_facet | Cheng, Jianlin |
author_sort | Cheng, Jianlin |
collection | PubMed |
description | BACKGROUND: Multiple protein templates are commonly used in manual protein structure prediction. However, few automated algorithms of selecting and combining multiple templates are available. RESULTS: Here we develop an effective multi-template combination algorithm for protein comparative modeling. The algorithm selects templates according to the similarity significance of the alignments between template and target proteins. It combines the whole template-target alignments whose similarity significance score is close to that of the top template-target alignment within a threshold, whereas it only takes alignment fragments from a less similar template-target alignment that align with a sizable uncovered region of the target. We compare the algorithm with the traditional method of using a single top template on the 45 comparative modeling targets (i.e. easy template-based modeling targets) used in the seventh edition of Critical Assessment of Techniques for Protein Structure Prediction (CASP7). The multi-template combination algorithm improves the GDT-TS scores of predicted models by 6.8% on average. The statistical analysis shows that the improvement is significant (p-value < 10(-4)). Compared with the ideal approach that always uses the best template, the multi-template approach yields only slightly better performance. During the CASP7 experiment, the preliminary implementation of the multi-template combination algorithm (FOLDpro) was ranked second among 67 servers in the category of high-accuracy structure prediction in terms of GDT-TS measure. CONCLUSION: We have developed a novel multi-template algorithm to improve protein comparative modeling. |
format | Text |
id | pubmed-2311309 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2008 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-23113092008-04-16 A multi-template combination algorithm for protein comparative modeling Cheng, Jianlin BMC Struct Biol Methodology Article BACKGROUND: Multiple protein templates are commonly used in manual protein structure prediction. However, few automated algorithms of selecting and combining multiple templates are available. RESULTS: Here we develop an effective multi-template combination algorithm for protein comparative modeling. The algorithm selects templates according to the similarity significance of the alignments between template and target proteins. It combines the whole template-target alignments whose similarity significance score is close to that of the top template-target alignment within a threshold, whereas it only takes alignment fragments from a less similar template-target alignment that align with a sizable uncovered region of the target. We compare the algorithm with the traditional method of using a single top template on the 45 comparative modeling targets (i.e. easy template-based modeling targets) used in the seventh edition of Critical Assessment of Techniques for Protein Structure Prediction (CASP7). The multi-template combination algorithm improves the GDT-TS scores of predicted models by 6.8% on average. The statistical analysis shows that the improvement is significant (p-value < 10(-4)). Compared with the ideal approach that always uses the best template, the multi-template approach yields only slightly better performance. During the CASP7 experiment, the preliminary implementation of the multi-template combination algorithm (FOLDpro) was ranked second among 67 servers in the category of high-accuracy structure prediction in terms of GDT-TS measure. CONCLUSION: We have developed a novel multi-template algorithm to improve protein comparative modeling. BioMed Central 2008-03-17 /pmc/articles/PMC2311309/ /pubmed/18366648 http://dx.doi.org/10.1186/1472-6807-8-18 Text en Copyright © 2008 Cheng; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License ( (http://creativecommons.org/licenses/by/2.0) ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Methodology Article Cheng, Jianlin A multi-template combination algorithm for protein comparative modeling |
title | A multi-template combination algorithm for protein comparative modeling |
title_full | A multi-template combination algorithm for protein comparative modeling |
title_fullStr | A multi-template combination algorithm for protein comparative modeling |
title_full_unstemmed | A multi-template combination algorithm for protein comparative modeling |
title_short | A multi-template combination algorithm for protein comparative modeling |
title_sort | multi-template combination algorithm for protein comparative modeling |
topic | Methodology Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2311309/ https://www.ncbi.nlm.nih.gov/pubmed/18366648 http://dx.doi.org/10.1186/1472-6807-8-18 |
work_keys_str_mv | AT chengjianlin amultitemplatecombinationalgorithmforproteincomparativemodeling AT chengjianlin multitemplatecombinationalgorithmforproteincomparativemodeling |