Cargando…

ThreaDom: extracting protein domain boundary information from multiple threading alignments

Motivation: Protein domains are subunits that can fold and evolve independently. Identification of domain boundary locations is often the first step in protein folding and function annotations. Most of the current methods deduce domain boundaries by sequence-based analysis, which has low accuracy. T...

Descripción completa

Detalles Bibliográficos
Autores principales: Xue, Zhidong, Xu, Dong, Wang, Yan, Zhang, Yang
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2013
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3694664/
https://www.ncbi.nlm.nih.gov/pubmed/23812990
http://dx.doi.org/10.1093/bioinformatics/btt209
_version_ 1782274884524572672
author Xue, Zhidong
Xu, Dong
Wang, Yan
Zhang, Yang
author_facet Xue, Zhidong
Xu, Dong
Wang, Yan
Zhang, Yang
author_sort Xue, Zhidong
collection PubMed
description Motivation: Protein domains are subunits that can fold and evolve independently. Identification of domain boundary locations is often the first step in protein folding and function annotations. Most of the current methods deduce domain boundaries by sequence-based analysis, which has low accuracy. There is no efficient method for predicting discontinuous domains that consist of segments from separated sequence regions. As template-based methods are most efficient for protein 3D structure modeling, combining multiple threading alignment information should increase the accuracy and reliability of computational domain predictions. Result: We developed a new protein domain predictor, ThreaDom, which deduces domain boundary locations based on multiple threading alignments. The core of the method development is the derivation of a domain conservation score that combines information from template domain structures and terminal and internal alignment gaps. Tested on 630 non-redundant sequences, without using homologous templates, ThreaDom generates correct single- and multi-domain classifications in 81% of cases, where 78% have the domain linker assigned within ±20 residues. In a second test on 486 proteins with discontinuous domains, ThreaDom achieves an average precision 84% and recall 65% in domain boundary prediction. Finally, ThreaDom was examined on 56 targets from CASP8 and had a domain overlap rate 73, 87 and 85% with the target for Free Modeling, Hard multiple-domain and discontinuous domain proteins, respectively, which are significantly higher than most domain predictors in the CASP8. Similar results were achieved on the targets from the most recently CASP9 and CASP10 experiments. Availability: http://zhanglab.ccmb.med.umich.edu/ThreaDom/. Contact: zhng@umich.edu Supplementary information: Supplementary data are available at Bioinformatics online.
format Online
Article
Text
id pubmed-3694664
institution National Center for Biotechnology Information
language English
publishDate 2013
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-36946642013-06-27 ThreaDom: extracting protein domain boundary information from multiple threading alignments Xue, Zhidong Xu, Dong Wang, Yan Zhang, Yang Bioinformatics Ismb/Eccb 2013 Proceedings Papers Committee July 21 to July 23, 2013, Berlin, Germany Motivation: Protein domains are subunits that can fold and evolve independently. Identification of domain boundary locations is often the first step in protein folding and function annotations. Most of the current methods deduce domain boundaries by sequence-based analysis, which has low accuracy. There is no efficient method for predicting discontinuous domains that consist of segments from separated sequence regions. As template-based methods are most efficient for protein 3D structure modeling, combining multiple threading alignment information should increase the accuracy and reliability of computational domain predictions. Result: We developed a new protein domain predictor, ThreaDom, which deduces domain boundary locations based on multiple threading alignments. The core of the method development is the derivation of a domain conservation score that combines information from template domain structures and terminal and internal alignment gaps. Tested on 630 non-redundant sequences, without using homologous templates, ThreaDom generates correct single- and multi-domain classifications in 81% of cases, where 78% have the domain linker assigned within ±20 residues. In a second test on 486 proteins with discontinuous domains, ThreaDom achieves an average precision 84% and recall 65% in domain boundary prediction. Finally, ThreaDom was examined on 56 targets from CASP8 and had a domain overlap rate 73, 87 and 85% with the target for Free Modeling, Hard multiple-domain and discontinuous domain proteins, respectively, which are significantly higher than most domain predictors in the CASP8. Similar results were achieved on the targets from the most recently CASP9 and CASP10 experiments. Availability: http://zhanglab.ccmb.med.umich.edu/ThreaDom/. Contact: zhng@umich.edu Supplementary information: Supplementary data are available at Bioinformatics online. Oxford University Press 2013-07-01 2013-06-19 /pmc/articles/PMC3694664/ /pubmed/23812990 http://dx.doi.org/10.1093/bioinformatics/btt209 Text en © The Author 2013. Published by Oxford University Press. http://creativecommons.org/licenses/by-nc/3.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/3.0/), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial re-use, please contact journals.permissions@oup.com
spellingShingle Ismb/Eccb 2013 Proceedings Papers Committee July 21 to July 23, 2013, Berlin, Germany
Xue, Zhidong
Xu, Dong
Wang, Yan
Zhang, Yang
ThreaDom: extracting protein domain boundary information from multiple threading alignments
title ThreaDom: extracting protein domain boundary information from multiple threading alignments
title_full ThreaDom: extracting protein domain boundary information from multiple threading alignments
title_fullStr ThreaDom: extracting protein domain boundary information from multiple threading alignments
title_full_unstemmed ThreaDom: extracting protein domain boundary information from multiple threading alignments
title_short ThreaDom: extracting protein domain boundary information from multiple threading alignments
title_sort threadom: extracting protein domain boundary information from multiple threading alignments
topic Ismb/Eccb 2013 Proceedings Papers Committee July 21 to July 23, 2013, Berlin, Germany
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3694664/
https://www.ncbi.nlm.nih.gov/pubmed/23812990
http://dx.doi.org/10.1093/bioinformatics/btt209
work_keys_str_mv AT xuezhidong threadomextractingproteindomainboundaryinformationfrommultiplethreadingalignments
AT xudong threadomextractingproteindomainboundaryinformationfrommultiplethreadingalignments
AT wangyan threadomextractingproteindomainboundaryinformationfrommultiplethreadingalignments
AT zhangyang threadomextractingproteindomainboundaryinformationfrommultiplethreadingalignments