Cargando…
Domain fusion analysis by applying relational algebra to protein sequence and domain databases
BACKGROUND: Domain fusion analysis is a useful method to predict functionally linked proteins that may be involved in direct protein-protein interactions or in the same metabolic or signaling pathway. As separate domain databases like BLOCKS, PROSITE, Pfam, SMART, PRINTS-S, ProDom, TIGRFAMs, and ama...
Autores principales: | , |
---|---|
Formato: | Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2003
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC156618/ https://www.ncbi.nlm.nih.gov/pubmed/12734020 http://dx.doi.org/10.1186/1471-2105-4-16 |
_version_ | 1782120785945559040 |
---|---|
author | Truong, Kevin Ikura, Mitsuhiko |
author_facet | Truong, Kevin Ikura, Mitsuhiko |
author_sort | Truong, Kevin |
collection | PubMed |
description | BACKGROUND: Domain fusion analysis is a useful method to predict functionally linked proteins that may be involved in direct protein-protein interactions or in the same metabolic or signaling pathway. As separate domain databases like BLOCKS, PROSITE, Pfam, SMART, PRINTS-S, ProDom, TIGRFAMs, and amalgamated domain databases like InterPro continue to grow in size and quality, a computational method to perform domain fusion analysis that leverages on these efforts will become increasingly powerful. RESULTS: This paper proposes a computational method employing relational algebra to find domain fusions in protein sequence databases. The feasibility of this method was illustrated on the SWISS-PROT+TrEMBL sequence database using domain predictions from the Pfam HMM (hidden Markov model) database. We identified 235 and 189 putative functionally linked protein partners in H. sapiens and S. cerevisiae, respectively. From scientific literature, we were able to confirm many of these functional linkages, while the remainder offer testable experimental hypothesis. Results can be viewed at . CONCLUSION: As the analysis can be computed quickly on any relational database that supports standard SQL (structured query language), it can be dynamically updated along with the sequence and domain databases, thereby improving the quality of predictions over time. |
format | Text |
id | pubmed-156618 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2003 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-1566182003-06-05 Domain fusion analysis by applying relational algebra to protein sequence and domain databases Truong, Kevin Ikura, Mitsuhiko BMC Bioinformatics Methodology Article BACKGROUND: Domain fusion analysis is a useful method to predict functionally linked proteins that may be involved in direct protein-protein interactions or in the same metabolic or signaling pathway. As separate domain databases like BLOCKS, PROSITE, Pfam, SMART, PRINTS-S, ProDom, TIGRFAMs, and amalgamated domain databases like InterPro continue to grow in size and quality, a computational method to perform domain fusion analysis that leverages on these efforts will become increasingly powerful. RESULTS: This paper proposes a computational method employing relational algebra to find domain fusions in protein sequence databases. The feasibility of this method was illustrated on the SWISS-PROT+TrEMBL sequence database using domain predictions from the Pfam HMM (hidden Markov model) database. We identified 235 and 189 putative functionally linked protein partners in H. sapiens and S. cerevisiae, respectively. From scientific literature, we were able to confirm many of these functional linkages, while the remainder offer testable experimental hypothesis. Results can be viewed at . CONCLUSION: As the analysis can be computed quickly on any relational database that supports standard SQL (structured query language), it can be dynamically updated along with the sequence and domain databases, thereby improving the quality of predictions over time. BioMed Central 2003-05-06 /pmc/articles/PMC156618/ /pubmed/12734020 http://dx.doi.org/10.1186/1471-2105-4-16 Text en Copyright © 2003 Truong and Ikura; licensee BioMed Central Ltd. This is an Open Access article: verbatim copying and redistribution of this article are permitted in all media for any purpose, provided this notice is preserved along with the article's original URL. |
spellingShingle | Methodology Article Truong, Kevin Ikura, Mitsuhiko Domain fusion analysis by applying relational algebra to protein sequence and domain databases |
title | Domain fusion analysis by applying relational algebra to protein sequence and domain databases |
title_full | Domain fusion analysis by applying relational algebra to protein sequence and domain databases |
title_fullStr | Domain fusion analysis by applying relational algebra to protein sequence and domain databases |
title_full_unstemmed | Domain fusion analysis by applying relational algebra to protein sequence and domain databases |
title_short | Domain fusion analysis by applying relational algebra to protein sequence and domain databases |
title_sort | domain fusion analysis by applying relational algebra to protein sequence and domain databases |
topic | Methodology Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC156618/ https://www.ncbi.nlm.nih.gov/pubmed/12734020 http://dx.doi.org/10.1186/1471-2105-4-16 |
work_keys_str_mv | AT truongkevin domainfusionanalysisbyapplyingrelationalalgebratoproteinsequenceanddomaindatabases AT ikuramitsuhiko domainfusionanalysisbyapplyingrelationalalgebratoproteinsequenceanddomaindatabases |