Cargando…

DomHR: Accurately Identifying Domain Boundaries in Proteins Using a Hinge Region Strategy

MOTIVATION: The precise prediction of protein domains, which are the structural, functional and evolutionary units of proteins, has been a research focus in recent years. Although many methods have been presented for predicting protein domains and boundaries, the accuracy of predictions could be imp...

Descripción completa

Detalles Bibliográficos
Autores principales: Zhang, Xiao-yan, Lu, Long-jian, Song, Qi, Yang, Qian-qian, Li, Da-peng, Sun, Jiang-ming, Li, Tong-hua, Cong, Pei-sheng
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2013
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3623903/
https://www.ncbi.nlm.nih.gov/pubmed/23593247
http://dx.doi.org/10.1371/journal.pone.0060559
_version_ 1782265992300199936
author Zhang, Xiao-yan
Lu, Long-jian
Song, Qi
Yang, Qian-qian
Li, Da-peng
Sun, Jiang-ming
Li, Tong-hua
Cong, Pei-sheng
author_facet Zhang, Xiao-yan
Lu, Long-jian
Song, Qi
Yang, Qian-qian
Li, Da-peng
Sun, Jiang-ming
Li, Tong-hua
Cong, Pei-sheng
author_sort Zhang, Xiao-yan
collection PubMed
description MOTIVATION: The precise prediction of protein domains, which are the structural, functional and evolutionary units of proteins, has been a research focus in recent years. Although many methods have been presented for predicting protein domains and boundaries, the accuracy of predictions could be improved. RESULTS: In this study we present a novel approach, DomHR, which is an accurate predictor of protein domain boundaries based on a creative hinge region strategy. A hinge region was defined as a segment of amino acids that covers part of a domain region and a boundary region. We developed a strategy to construct profiles of domain-hinge-boundary (DHB) features generated by sequence-domain/hinge/boundary alignment against a database of known domain structures. The DHB features had three elements: normalized domain, hinge, and boundary probabilities. The DHB features were used as input to identify domain boundaries in a sequence. DomHR used a nonredundant dataset as the training set, the DHB and predicted shape string as features, and a conditional random field as the classification algorithm. In predicted hinge regions, a residue was determined to be a domain or a boundary according to a decision threshold. After decision thresholds were optimized, DomHR was evaluated by cross-validation, large-scale prediction, independent test and CASP (Critical Assessment of Techniques for Protein Structure Prediction) tests. All results confirmed that DomHR outperformed other well-established, publicly available domain boundary predictors for prediction accuracy. AVAILABILITY: The DomHR is available at http://cal.tongji.edu.cn/domain/.
format Online
Article
Text
id pubmed-3623903
institution National Center for Biotechnology Information
language English
publishDate 2013
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-36239032013-04-16 DomHR: Accurately Identifying Domain Boundaries in Proteins Using a Hinge Region Strategy Zhang, Xiao-yan Lu, Long-jian Song, Qi Yang, Qian-qian Li, Da-peng Sun, Jiang-ming Li, Tong-hua Cong, Pei-sheng PLoS One Research Article MOTIVATION: The precise prediction of protein domains, which are the structural, functional and evolutionary units of proteins, has been a research focus in recent years. Although many methods have been presented for predicting protein domains and boundaries, the accuracy of predictions could be improved. RESULTS: In this study we present a novel approach, DomHR, which is an accurate predictor of protein domain boundaries based on a creative hinge region strategy. A hinge region was defined as a segment of amino acids that covers part of a domain region and a boundary region. We developed a strategy to construct profiles of domain-hinge-boundary (DHB) features generated by sequence-domain/hinge/boundary alignment against a database of known domain structures. The DHB features had three elements: normalized domain, hinge, and boundary probabilities. The DHB features were used as input to identify domain boundaries in a sequence. DomHR used a nonredundant dataset as the training set, the DHB and predicted shape string as features, and a conditional random field as the classification algorithm. In predicted hinge regions, a residue was determined to be a domain or a boundary according to a decision threshold. After decision thresholds were optimized, DomHR was evaluated by cross-validation, large-scale prediction, independent test and CASP (Critical Assessment of Techniques for Protein Structure Prediction) tests. All results confirmed that DomHR outperformed other well-established, publicly available domain boundary predictors for prediction accuracy. AVAILABILITY: The DomHR is available at http://cal.tongji.edu.cn/domain/. Public Library of Science 2013-04-11 /pmc/articles/PMC3623903/ /pubmed/23593247 http://dx.doi.org/10.1371/journal.pone.0060559 Text en © 2013 Zhang et al http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited.
spellingShingle Research Article
Zhang, Xiao-yan
Lu, Long-jian
Song, Qi
Yang, Qian-qian
Li, Da-peng
Sun, Jiang-ming
Li, Tong-hua
Cong, Pei-sheng
DomHR: Accurately Identifying Domain Boundaries in Proteins Using a Hinge Region Strategy
title DomHR: Accurately Identifying Domain Boundaries in Proteins Using a Hinge Region Strategy
title_full DomHR: Accurately Identifying Domain Boundaries in Proteins Using a Hinge Region Strategy
title_fullStr DomHR: Accurately Identifying Domain Boundaries in Proteins Using a Hinge Region Strategy
title_full_unstemmed DomHR: Accurately Identifying Domain Boundaries in Proteins Using a Hinge Region Strategy
title_short DomHR: Accurately Identifying Domain Boundaries in Proteins Using a Hinge Region Strategy
title_sort domhr: accurately identifying domain boundaries in proteins using a hinge region strategy
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3623903/
https://www.ncbi.nlm.nih.gov/pubmed/23593247
http://dx.doi.org/10.1371/journal.pone.0060559
work_keys_str_mv AT zhangxiaoyan domhraccuratelyidentifyingdomainboundariesinproteinsusingahingeregionstrategy
AT lulongjian domhraccuratelyidentifyingdomainboundariesinproteinsusingahingeregionstrategy
AT songqi domhraccuratelyidentifyingdomainboundariesinproteinsusingahingeregionstrategy
AT yangqianqian domhraccuratelyidentifyingdomainboundariesinproteinsusingahingeregionstrategy
AT lidapeng domhraccuratelyidentifyingdomainboundariesinproteinsusingahingeregionstrategy
AT sunjiangming domhraccuratelyidentifyingdomainboundariesinproteinsusingahingeregionstrategy
AT litonghua domhraccuratelyidentifyingdomainboundariesinproteinsusingahingeregionstrategy
AT congpeisheng domhraccuratelyidentifyingdomainboundariesinproteinsusingahingeregionstrategy