Cargando…
DomHR: Accurately Identifying Domain Boundaries in Proteins Using a Hinge Region Strategy
MOTIVATION: The precise prediction of protein domains, which are the structural, functional and evolutionary units of proteins, has been a research focus in recent years. Although many methods have been presented for predicting protein domains and boundaries, the accuracy of predictions could be imp...
Autores principales: | , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Public Library of Science
2013
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3623903/ https://www.ncbi.nlm.nih.gov/pubmed/23593247 http://dx.doi.org/10.1371/journal.pone.0060559 |
_version_ | 1782265992300199936 |
---|---|
author | Zhang, Xiao-yan Lu, Long-jian Song, Qi Yang, Qian-qian Li, Da-peng Sun, Jiang-ming Li, Tong-hua Cong, Pei-sheng |
author_facet | Zhang, Xiao-yan Lu, Long-jian Song, Qi Yang, Qian-qian Li, Da-peng Sun, Jiang-ming Li, Tong-hua Cong, Pei-sheng |
author_sort | Zhang, Xiao-yan |
collection | PubMed |
description | MOTIVATION: The precise prediction of protein domains, which are the structural, functional and evolutionary units of proteins, has been a research focus in recent years. Although many methods have been presented for predicting protein domains and boundaries, the accuracy of predictions could be improved. RESULTS: In this study we present a novel approach, DomHR, which is an accurate predictor of protein domain boundaries based on a creative hinge region strategy. A hinge region was defined as a segment of amino acids that covers part of a domain region and a boundary region. We developed a strategy to construct profiles of domain-hinge-boundary (DHB) features generated by sequence-domain/hinge/boundary alignment against a database of known domain structures. The DHB features had three elements: normalized domain, hinge, and boundary probabilities. The DHB features were used as input to identify domain boundaries in a sequence. DomHR used a nonredundant dataset as the training set, the DHB and predicted shape string as features, and a conditional random field as the classification algorithm. In predicted hinge regions, a residue was determined to be a domain or a boundary according to a decision threshold. After decision thresholds were optimized, DomHR was evaluated by cross-validation, large-scale prediction, independent test and CASP (Critical Assessment of Techniques for Protein Structure Prediction) tests. All results confirmed that DomHR outperformed other well-established, publicly available domain boundary predictors for prediction accuracy. AVAILABILITY: The DomHR is available at http://cal.tongji.edu.cn/domain/. |
format | Online Article Text |
id | pubmed-3623903 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2013 |
publisher | Public Library of Science |
record_format | MEDLINE/PubMed |
spelling | pubmed-36239032013-04-16 DomHR: Accurately Identifying Domain Boundaries in Proteins Using a Hinge Region Strategy Zhang, Xiao-yan Lu, Long-jian Song, Qi Yang, Qian-qian Li, Da-peng Sun, Jiang-ming Li, Tong-hua Cong, Pei-sheng PLoS One Research Article MOTIVATION: The precise prediction of protein domains, which are the structural, functional and evolutionary units of proteins, has been a research focus in recent years. Although many methods have been presented for predicting protein domains and boundaries, the accuracy of predictions could be improved. RESULTS: In this study we present a novel approach, DomHR, which is an accurate predictor of protein domain boundaries based on a creative hinge region strategy. A hinge region was defined as a segment of amino acids that covers part of a domain region and a boundary region. We developed a strategy to construct profiles of domain-hinge-boundary (DHB) features generated by sequence-domain/hinge/boundary alignment against a database of known domain structures. The DHB features had three elements: normalized domain, hinge, and boundary probabilities. The DHB features were used as input to identify domain boundaries in a sequence. DomHR used a nonredundant dataset as the training set, the DHB and predicted shape string as features, and a conditional random field as the classification algorithm. In predicted hinge regions, a residue was determined to be a domain or a boundary according to a decision threshold. After decision thresholds were optimized, DomHR was evaluated by cross-validation, large-scale prediction, independent test and CASP (Critical Assessment of Techniques for Protein Structure Prediction) tests. All results confirmed that DomHR outperformed other well-established, publicly available domain boundary predictors for prediction accuracy. AVAILABILITY: The DomHR is available at http://cal.tongji.edu.cn/domain/. Public Library of Science 2013-04-11 /pmc/articles/PMC3623903/ /pubmed/23593247 http://dx.doi.org/10.1371/journal.pone.0060559 Text en © 2013 Zhang et al http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited. |
spellingShingle | Research Article Zhang, Xiao-yan Lu, Long-jian Song, Qi Yang, Qian-qian Li, Da-peng Sun, Jiang-ming Li, Tong-hua Cong, Pei-sheng DomHR: Accurately Identifying Domain Boundaries in Proteins Using a Hinge Region Strategy |
title | DomHR: Accurately Identifying Domain Boundaries in Proteins Using a Hinge Region Strategy |
title_full | DomHR: Accurately Identifying Domain Boundaries in Proteins Using a Hinge Region Strategy |
title_fullStr | DomHR: Accurately Identifying Domain Boundaries in Proteins Using a Hinge Region Strategy |
title_full_unstemmed | DomHR: Accurately Identifying Domain Boundaries in Proteins Using a Hinge Region Strategy |
title_short | DomHR: Accurately Identifying Domain Boundaries in Proteins Using a Hinge Region Strategy |
title_sort | domhr: accurately identifying domain boundaries in proteins using a hinge region strategy |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3623903/ https://www.ncbi.nlm.nih.gov/pubmed/23593247 http://dx.doi.org/10.1371/journal.pone.0060559 |
work_keys_str_mv | AT zhangxiaoyan domhraccuratelyidentifyingdomainboundariesinproteinsusingahingeregionstrategy AT lulongjian domhraccuratelyidentifyingdomainboundariesinproteinsusingahingeregionstrategy AT songqi domhraccuratelyidentifyingdomainboundariesinproteinsusingahingeregionstrategy AT yangqianqian domhraccuratelyidentifyingdomainboundariesinproteinsusingahingeregionstrategy AT lidapeng domhraccuratelyidentifyingdomainboundariesinproteinsusingahingeregionstrategy AT sunjiangming domhraccuratelyidentifyingdomainboundariesinproteinsusingahingeregionstrategy AT litonghua domhraccuratelyidentifyingdomainboundariesinproteinsusingahingeregionstrategy AT congpeisheng domhraccuratelyidentifyingdomainboundariesinproteinsusingahingeregionstrategy |