A comparative assessment and analysis of 20 representative sequence alignment methods for protein structure prediction

Protein sequence alignment is essential for template-based protein structure prediction and function annotation. We collect 20 sequence alignment algorithms, 10 published and 10 newly developed, which cover all representative sequence- and profile-based alignment approaches. These algorithms are ben...

Descripción completa

Detalles Bibliográficos
Autores principales: Yan, Renxiang, Xu, Dong, Yang, Jianyi, Walker, Sara, Zhang, Yang
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Nature Publishing Group 2013
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3965362/
https://www.ncbi.nlm.nih.gov/pubmed/24018415
http://dx.doi.org/10.1038/srep02619
_version_ 1782308785161764864
author Yan, Renxiang
Xu, Dong
Yang, Jianyi
Walker, Sara
Zhang, Yang
author_facet Yan, Renxiang
Xu, Dong
Yang, Jianyi
Walker, Sara
Zhang, Yang
author_sort Yan, Renxiang
collection PubMed
description Protein sequence alignment is essential for template-based protein structure prediction and function annotation. We collect 20 sequence alignment algorithms, 10 published and 10 newly developed, which cover all representative sequence- and profile-based alignment approaches. These algorithms are benchmarked on 538 non-redundant proteins for protein fold-recognition on a uniform template library. Results demonstrate dominant advantage of profile-profile based methods, which generate models with average TM-score 26.5% higher than sequence-profile methods and 49.8% higher than sequence-sequence alignment methods. There is no obvious difference in results between methods with profiles generated from PSI-BLAST PSSM matrix and hidden Markov models. Accuracy of profile-profile alignments can be further improved by 9.6% or 21.4% when predicted or native structure features are incorporated. Nevertheless, TM-scores from profile-profile methods including experimental structural features are still 37.1% lower than that from TM-align, demonstrating that the fold-recognition problem cannot be solved solely by improving accuracy of structure feature predictions.
format Online
Article
Text
id pubmed-3965362
institution National Center for Biotechnology Information
language English
publishDate 2013
publisher Nature Publishing Group
record_format MEDLINE/PubMed
spelling pubmed-39653622014-04-02 A comparative assessment and analysis of 20 representative sequence alignment methods for protein structure prediction Yan, Renxiang Xu, Dong Yang, Jianyi Walker, Sara Zhang, Yang Sci Rep Article Protein sequence alignment is essential for template-based protein structure prediction and function annotation. We collect 20 sequence alignment algorithms, 10 published and 10 newly developed, which cover all representative sequence- and profile-based alignment approaches. These algorithms are benchmarked on 538 non-redundant proteins for protein fold-recognition on a uniform template library. Results demonstrate dominant advantage of profile-profile based methods, which generate models with average TM-score 26.5% higher than sequence-profile methods and 49.8% higher than sequence-sequence alignment methods. There is no obvious difference in results between methods with profiles generated from PSI-BLAST PSSM matrix and hidden Markov models. Accuracy of profile-profile alignments can be further improved by 9.6% or 21.4% when predicted or native structure features are incorporated. Nevertheless, TM-scores from profile-profile methods including experimental structural features are still 37.1% lower than that from TM-align, demonstrating that the fold-recognition problem cannot be solved solely by improving accuracy of structure feature predictions. Nature Publishing Group 2013-09-10 /pmc/articles/PMC3965362/ /pubmed/24018415 http://dx.doi.org/10.1038/srep02619 Text en Copyright © 2013, Macmillan Publishers Limited. All rights reserved http://creativecommons.org/licenses/by-nc-sa/3.0/ This work is licensed under a Creative Commons Attribution-NonCommercial-ShareALike 3.0 Unported License. To view a copy of this license, visit http://creativecommons.org/licenses/by-nc-sa/3.0/
spellingShingle Article
Yan, Renxiang
Xu, Dong
Yang, Jianyi
Walker, Sara
Zhang, Yang
A comparative assessment and analysis of 20 representative sequence alignment methods for protein structure prediction
title A comparative assessment and analysis of 20 representative sequence alignment methods for protein structure prediction
title_full A comparative assessment and analysis of 20 representative sequence alignment methods for protein structure prediction
title_fullStr A comparative assessment and analysis of 20 representative sequence alignment methods for protein structure prediction
title_full_unstemmed A comparative assessment and analysis of 20 representative sequence alignment methods for protein structure prediction
title_short A comparative assessment and analysis of 20 representative sequence alignment methods for protein structure prediction
title_sort comparative assessment and analysis of 20 representative sequence alignment methods for protein structure prediction
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3965362/
https://www.ncbi.nlm.nih.gov/pubmed/24018415
http://dx.doi.org/10.1038/srep02619
work_keys_str_mv AT yanrenxiang acomparativeassessmentandanalysisof20representativesequencealignmentmethodsforproteinstructureprediction
AT xudong acomparativeassessmentandanalysisof20representativesequencealignmentmethodsforproteinstructureprediction
AT yangjianyi acomparativeassessmentandanalysisof20representativesequencealignmentmethodsforproteinstructureprediction
AT walkersara acomparativeassessmentandanalysisof20representativesequencealignmentmethodsforproteinstructureprediction
AT zhangyang acomparativeassessmentandanalysisof20representativesequencealignmentmethodsforproteinstructureprediction
AT yanrenxiang comparativeassessmentandanalysisof20representativesequencealignmentmethodsforproteinstructureprediction
AT xudong comparativeassessmentandanalysisof20representativesequencealignmentmethodsforproteinstructureprediction
AT yangjianyi comparativeassessmentandanalysisof20representativesequencealignmentmethodsforproteinstructureprediction
AT walkersara comparativeassessmentandanalysisof20representativesequencealignmentmethodsforproteinstructureprediction
AT zhangyang comparativeassessmentandanalysisof20representativesequencealignmentmethodsforproteinstructureprediction