Cargando…

A conditional neural fields model for protein threading

Motivation: Alignment errors are still the main bottleneck for current template-based protein modeling (TM) methods, including protein threading and homology modeling, especially when the sequence identity between two proteins under consideration is low (<30%). Results: We present a novel protein...

Descripción completa

Detalles Bibliográficos
Autores principales: Ma, Jianzhu, Peng, Jian, Wang, Sheng, Xu, Jinbo
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2012
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3371845/
https://www.ncbi.nlm.nih.gov/pubmed/22689779
http://dx.doi.org/10.1093/bioinformatics/bts213
_version_ 1782235269080023040
author Ma, Jianzhu
Peng, Jian
Wang, Sheng
Xu, Jinbo
author_facet Ma, Jianzhu
Peng, Jian
Wang, Sheng
Xu, Jinbo
author_sort Ma, Jianzhu
collection PubMed
description Motivation: Alignment errors are still the main bottleneck for current template-based protein modeling (TM) methods, including protein threading and homology modeling, especially when the sequence identity between two proteins under consideration is low (<30%). Results: We present a novel protein threading method, CNFpred, which achieves much more accurate sequence–template alignment by employing a probabilistic graphical model called a Conditional Neural Field (CNF), which aligns one protein sequence to its remote template using a non-linear scoring function. This scoring function accounts for correlation among a variety of protein sequence and structure features, makes use of information in the neighborhood of two residues to be aligned, and is thus much more sensitive than the widely used linear or profile-based scoring function. To train this CNF threading model, we employ a novel quality-sensitive method, instead of the standard maximum-likelihood method, to maximize directly the expected quality of the training set. Experimental results show that CNFpred generates significantly better alignments than the best profile-based and threading methods on several public (but small) benchmarks as well as our own large dataset. CNFpred outperforms others regardless of the lengths or classes of proteins, and works particularly well for proteins with sparse sequence profiles due to the effective utilization of structure information. Our methodology can also be adapted to protein sequence alignment. Contact: j3xu@ttic.edu Supplementary information: Supplementary data are available at Bioinformatics online.
format Online
Article
Text
id pubmed-3371845
institution National Center for Biotechnology Information
language English
publishDate 2012
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-33718452012-06-11 A conditional neural fields model for protein threading Ma, Jianzhu Peng, Jian Wang, Sheng Xu, Jinbo Bioinformatics Ismb 2012 Proceedings Papers Committee July 15 to July 19, 2012, Long Beach, Ca, Usa Motivation: Alignment errors are still the main bottleneck for current template-based protein modeling (TM) methods, including protein threading and homology modeling, especially when the sequence identity between two proteins under consideration is low (<30%). Results: We present a novel protein threading method, CNFpred, which achieves much more accurate sequence–template alignment by employing a probabilistic graphical model called a Conditional Neural Field (CNF), which aligns one protein sequence to its remote template using a non-linear scoring function. This scoring function accounts for correlation among a variety of protein sequence and structure features, makes use of information in the neighborhood of two residues to be aligned, and is thus much more sensitive than the widely used linear or profile-based scoring function. To train this CNF threading model, we employ a novel quality-sensitive method, instead of the standard maximum-likelihood method, to maximize directly the expected quality of the training set. Experimental results show that CNFpred generates significantly better alignments than the best profile-based and threading methods on several public (but small) benchmarks as well as our own large dataset. CNFpred outperforms others regardless of the lengths or classes of proteins, and works particularly well for proteins with sparse sequence profiles due to the effective utilization of structure information. Our methodology can also be adapted to protein sequence alignment. Contact: j3xu@ttic.edu Supplementary information: Supplementary data are available at Bioinformatics online. Oxford University Press 2012-06-15 2012-06-09 /pmc/articles/PMC3371845/ /pubmed/22689779 http://dx.doi.org/10.1093/bioinformatics/bts213 Text en © The Author(s) 2012. Published by Oxford University Press. http://creativecommons.org/licenses/by-nc/3.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/3.0), which permits unrestricted non-commercial use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Ismb 2012 Proceedings Papers Committee July 15 to July 19, 2012, Long Beach, Ca, Usa
Ma, Jianzhu
Peng, Jian
Wang, Sheng
Xu, Jinbo
A conditional neural fields model for protein threading
title A conditional neural fields model for protein threading
title_full A conditional neural fields model for protein threading
title_fullStr A conditional neural fields model for protein threading
title_full_unstemmed A conditional neural fields model for protein threading
title_short A conditional neural fields model for protein threading
title_sort conditional neural fields model for protein threading
topic Ismb 2012 Proceedings Papers Committee July 15 to July 19, 2012, Long Beach, Ca, Usa
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3371845/
https://www.ncbi.nlm.nih.gov/pubmed/22689779
http://dx.doi.org/10.1093/bioinformatics/bts213
work_keys_str_mv AT majianzhu aconditionalneuralfieldsmodelforproteinthreading
AT pengjian aconditionalneuralfieldsmodelforproteinthreading
AT wangsheng aconditionalneuralfieldsmodelforproteinthreading
AT xujinbo aconditionalneuralfieldsmodelforproteinthreading
AT majianzhu conditionalneuralfieldsmodelforproteinthreading
AT pengjian conditionalneuralfieldsmodelforproteinthreading
AT wangsheng conditionalneuralfieldsmodelforproteinthreading
AT xujinbo conditionalneuralfieldsmodelforproteinthreading