Cargando…
ThreaDom: extracting protein domain boundary information from multiple threading alignments
Motivation: Protein domains are subunits that can fold and evolve independently. Identification of domain boundary locations is often the first step in protein folding and function annotations. Most of the current methods deduce domain boundaries by sequence-based analysis, which has low accuracy. T...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Oxford University Press
2013
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3694664/ https://www.ncbi.nlm.nih.gov/pubmed/23812990 http://dx.doi.org/10.1093/bioinformatics/btt209 |
_version_ | 1782274884524572672 |
---|---|
author | Xue, Zhidong Xu, Dong Wang, Yan Zhang, Yang |
author_facet | Xue, Zhidong Xu, Dong Wang, Yan Zhang, Yang |
author_sort | Xue, Zhidong |
collection | PubMed |
description | Motivation: Protein domains are subunits that can fold and evolve independently. Identification of domain boundary locations is often the first step in protein folding and function annotations. Most of the current methods deduce domain boundaries by sequence-based analysis, which has low accuracy. There is no efficient method for predicting discontinuous domains that consist of segments from separated sequence regions. As template-based methods are most efficient for protein 3D structure modeling, combining multiple threading alignment information should increase the accuracy and reliability of computational domain predictions. Result: We developed a new protein domain predictor, ThreaDom, which deduces domain boundary locations based on multiple threading alignments. The core of the method development is the derivation of a domain conservation score that combines information from template domain structures and terminal and internal alignment gaps. Tested on 630 non-redundant sequences, without using homologous templates, ThreaDom generates correct single- and multi-domain classifications in 81% of cases, where 78% have the domain linker assigned within ±20 residues. In a second test on 486 proteins with discontinuous domains, ThreaDom achieves an average precision 84% and recall 65% in domain boundary prediction. Finally, ThreaDom was examined on 56 targets from CASP8 and had a domain overlap rate 73, 87 and 85% with the target for Free Modeling, Hard multiple-domain and discontinuous domain proteins, respectively, which are significantly higher than most domain predictors in the CASP8. Similar results were achieved on the targets from the most recently CASP9 and CASP10 experiments. Availability: http://zhanglab.ccmb.med.umich.edu/ThreaDom/. Contact: zhng@umich.edu Supplementary information: Supplementary data are available at Bioinformatics online. |
format | Online Article Text |
id | pubmed-3694664 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2013 |
publisher | Oxford University Press |
record_format | MEDLINE/PubMed |
spelling | pubmed-36946642013-06-27 ThreaDom: extracting protein domain boundary information from multiple threading alignments Xue, Zhidong Xu, Dong Wang, Yan Zhang, Yang Bioinformatics Ismb/Eccb 2013 Proceedings Papers Committee July 21 to July 23, 2013, Berlin, Germany Motivation: Protein domains are subunits that can fold and evolve independently. Identification of domain boundary locations is often the first step in protein folding and function annotations. Most of the current methods deduce domain boundaries by sequence-based analysis, which has low accuracy. There is no efficient method for predicting discontinuous domains that consist of segments from separated sequence regions. As template-based methods are most efficient for protein 3D structure modeling, combining multiple threading alignment information should increase the accuracy and reliability of computational domain predictions. Result: We developed a new protein domain predictor, ThreaDom, which deduces domain boundary locations based on multiple threading alignments. The core of the method development is the derivation of a domain conservation score that combines information from template domain structures and terminal and internal alignment gaps. Tested on 630 non-redundant sequences, without using homologous templates, ThreaDom generates correct single- and multi-domain classifications in 81% of cases, where 78% have the domain linker assigned within ±20 residues. In a second test on 486 proteins with discontinuous domains, ThreaDom achieves an average precision 84% and recall 65% in domain boundary prediction. Finally, ThreaDom was examined on 56 targets from CASP8 and had a domain overlap rate 73, 87 and 85% with the target for Free Modeling, Hard multiple-domain and discontinuous domain proteins, respectively, which are significantly higher than most domain predictors in the CASP8. Similar results were achieved on the targets from the most recently CASP9 and CASP10 experiments. Availability: http://zhanglab.ccmb.med.umich.edu/ThreaDom/. Contact: zhng@umich.edu Supplementary information: Supplementary data are available at Bioinformatics online. Oxford University Press 2013-07-01 2013-06-19 /pmc/articles/PMC3694664/ /pubmed/23812990 http://dx.doi.org/10.1093/bioinformatics/btt209 Text en © The Author 2013. Published by Oxford University Press. http://creativecommons.org/licenses/by-nc/3.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/3.0/), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial re-use, please contact journals.permissions@oup.com |
spellingShingle | Ismb/Eccb 2013 Proceedings Papers Committee July 21 to July 23, 2013, Berlin, Germany Xue, Zhidong Xu, Dong Wang, Yan Zhang, Yang ThreaDom: extracting protein domain boundary information from multiple threading alignments |
title | ThreaDom: extracting protein domain boundary information from multiple threading alignments |
title_full | ThreaDom: extracting protein domain boundary information from multiple threading alignments |
title_fullStr | ThreaDom: extracting protein domain boundary information from multiple threading alignments |
title_full_unstemmed | ThreaDom: extracting protein domain boundary information from multiple threading alignments |
title_short | ThreaDom: extracting protein domain boundary information from multiple threading alignments |
title_sort | threadom: extracting protein domain boundary information from multiple threading alignments |
topic | Ismb/Eccb 2013 Proceedings Papers Committee July 21 to July 23, 2013, Berlin, Germany |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3694664/ https://www.ncbi.nlm.nih.gov/pubmed/23812990 http://dx.doi.org/10.1093/bioinformatics/btt209 |
work_keys_str_mv | AT xuezhidong threadomextractingproteindomainboundaryinformationfrommultiplethreadingalignments AT xudong threadomextractingproteindomainboundaryinformationfrommultiplethreadingalignments AT wangyan threadomextractingproteindomainboundaryinformationfrommultiplethreadingalignments AT zhangyang threadomextractingproteindomainboundaryinformationfrommultiplethreadingalignments |