Cargando…
DAhunter: a web-based server that identifies homologous proteins by comparing domain architecture
We present DAhunter, a web-based server that identifies homologous proteins by comparing domain architectures, the organization of protein domains. A major obstacle in comparison of domain architecture is the existence of ‘promiscuous’ domains, which carry out auxiliary functions and appear in many...
Autores principales: | , |
---|---|
Formato: | Texto |
Lenguaje: | English |
Publicado: |
Oxford University Press
2008
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2447808/ https://www.ncbi.nlm.nih.gov/pubmed/18411203 http://dx.doi.org/10.1093/nar/gkn172 |
_version_ | 1782157004542836736 |
---|---|
author | Lee, Byungwook Lee, Doheon |
author_facet | Lee, Byungwook Lee, Doheon |
author_sort | Lee, Byungwook |
collection | PubMed |
description | We present DAhunter, a web-based server that identifies homologous proteins by comparing domain architectures, the organization of protein domains. A major obstacle in comparison of domain architecture is the existence of ‘promiscuous’ domains, which carry out auxiliary functions and appear in many unrelated proteins. To distinguish these promiscuous domains from protein domains, we assigned a weight score to each domain extracted from RefSeq proteins, based on its abundance and versatility. A domain's score represents its importance in the ‘protein world’ and is used in the comparison of domain architectures. In scoring domains, DAhunter also considers domain combinations as well as single domains. To measure the similarity of two domain architectures, we developed several methods that are based on algorithms used in information retrieval (the cosine similarity, the Goodman–Kruskal γ function, and domain duplication index) and then combined these into a similarity score. Compared with other domain architecture algorithms, DAhunter is better at identifying homology. The server is available at http://www.dahunter.kr and http://localodom.kobic.re.kr/dahunter/index.htm |
format | Text |
id | pubmed-2447808 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2008 |
publisher | Oxford University Press |
record_format | MEDLINE/PubMed |
spelling | pubmed-24478082008-07-09 DAhunter: a web-based server that identifies homologous proteins by comparing domain architecture Lee, Byungwook Lee, Doheon Nucleic Acids Res Articles We present DAhunter, a web-based server that identifies homologous proteins by comparing domain architectures, the organization of protein domains. A major obstacle in comparison of domain architecture is the existence of ‘promiscuous’ domains, which carry out auxiliary functions and appear in many unrelated proteins. To distinguish these promiscuous domains from protein domains, we assigned a weight score to each domain extracted from RefSeq proteins, based on its abundance and versatility. A domain's score represents its importance in the ‘protein world’ and is used in the comparison of domain architectures. In scoring domains, DAhunter also considers domain combinations as well as single domains. To measure the similarity of two domain architectures, we developed several methods that are based on algorithms used in information retrieval (the cosine similarity, the Goodman–Kruskal γ function, and domain duplication index) and then combined these into a similarity score. Compared with other domain architecture algorithms, DAhunter is better at identifying homology. The server is available at http://www.dahunter.kr and http://localodom.kobic.re.kr/dahunter/index.htm Oxford University Press 2008-07-01 2008-04-14 /pmc/articles/PMC2447808/ /pubmed/18411203 http://dx.doi.org/10.1093/nar/gkn172 Text en © 2008 The Author(s) http://creativecommons.org/licenses/by-nc/2.0/uk/ This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/2.0/uk/) which permits unrestricted non-commercial use, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Articles Lee, Byungwook Lee, Doheon DAhunter: a web-based server that identifies homologous proteins by comparing domain architecture |
title | DAhunter: a web-based server that identifies homologous proteins by comparing domain architecture |
title_full | DAhunter: a web-based server that identifies homologous proteins by comparing domain architecture |
title_fullStr | DAhunter: a web-based server that identifies homologous proteins by comparing domain architecture |
title_full_unstemmed | DAhunter: a web-based server that identifies homologous proteins by comparing domain architecture |
title_short | DAhunter: a web-based server that identifies homologous proteins by comparing domain architecture |
title_sort | dahunter: a web-based server that identifies homologous proteins by comparing domain architecture |
topic | Articles |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2447808/ https://www.ncbi.nlm.nih.gov/pubmed/18411203 http://dx.doi.org/10.1093/nar/gkn172 |
work_keys_str_mv | AT leebyungwook dahunterawebbasedserverthatidentifieshomologousproteinsbycomparingdomainarchitecture AT leedoheon dahunterawebbasedserverthatidentifieshomologousproteinsbycomparingdomainarchitecture |