Cargando…

DomHR: Accurately Identifying Domain Boundaries in Proteins Using a Hinge Region Strategy

MOTIVATION: The precise prediction of protein domains, which are the structural, functional and evolutionary units of proteins, has been a research focus in recent years. Although many methods have been presented for predicting protein domains and boundaries, the accuracy of predictions could be imp...

Descripción completa

Detalles Bibliográficos
Autores principales: Zhang, Xiao-yan, Lu, Long-jian, Song, Qi, Yang, Qian-qian, Li, Da-peng, Sun, Jiang-ming, Li, Tong-hua, Cong, Pei-sheng
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2013
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3623903/
https://www.ncbi.nlm.nih.gov/pubmed/23593247
http://dx.doi.org/10.1371/journal.pone.0060559
Descripción
Sumario:MOTIVATION: The precise prediction of protein domains, which are the structural, functional and evolutionary units of proteins, has been a research focus in recent years. Although many methods have been presented for predicting protein domains and boundaries, the accuracy of predictions could be improved. RESULTS: In this study we present a novel approach, DomHR, which is an accurate predictor of protein domain boundaries based on a creative hinge region strategy. A hinge region was defined as a segment of amino acids that covers part of a domain region and a boundary region. We developed a strategy to construct profiles of domain-hinge-boundary (DHB) features generated by sequence-domain/hinge/boundary alignment against a database of known domain structures. The DHB features had three elements: normalized domain, hinge, and boundary probabilities. The DHB features were used as input to identify domain boundaries in a sequence. DomHR used a nonredundant dataset as the training set, the DHB and predicted shape string as features, and a conditional random field as the classification algorithm. In predicted hinge regions, a residue was determined to be a domain or a boundary according to a decision threshold. After decision thresholds were optimized, DomHR was evaluated by cross-validation, large-scale prediction, independent test and CASP (Critical Assessment of Techniques for Protein Structure Prediction) tests. All results confirmed that DomHR outperformed other well-established, publicly available domain boundary predictors for prediction accuracy. AVAILABILITY: The DomHR is available at http://cal.tongji.edu.cn/domain/.