Cargando…
MOTIPS: Automated Motif Analysis for Predicting Targets of Modular Protein Domains
BACKGROUND: Many protein interactions, especially those involved in signaling, involve short linear motifs consisting of 5-10 amino acid residues that interact with modular protein domains such as the SH3 binding domains and the kinase catalytic domains. One straightforward way of identifying these...
Autores principales: | , , , , , , , |
---|---|
Formato: | Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2010
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2882932/ https://www.ncbi.nlm.nih.gov/pubmed/20459839 http://dx.doi.org/10.1186/1471-2105-11-243 |
_version_ | 1782182222366769152 |
---|---|
author | Lam, Hugo YK Kim, Philip M Mok, Janine Tonikian, Raffi Sidhu, Sachdev S Turk, Benjamin E Snyder, Michael Gerstein, Mark B |
author_facet | Lam, Hugo YK Kim, Philip M Mok, Janine Tonikian, Raffi Sidhu, Sachdev S Turk, Benjamin E Snyder, Michael Gerstein, Mark B |
author_sort | Lam, Hugo YK |
collection | PubMed |
description | BACKGROUND: Many protein interactions, especially those involved in signaling, involve short linear motifs consisting of 5-10 amino acid residues that interact with modular protein domains such as the SH3 binding domains and the kinase catalytic domains. One straightforward way of identifying these interactions is by scanning for matches to the motif against all the sequences in a target proteome. However, predicting domain targets by motif sequence alone without considering other genomic and structural information has been shown to be lacking in accuracy. RESULTS: We developed an efficient search algorithm to scan the target proteome for potential domain targets and to increase the accuracy of each hit by integrating a variety of pre-computed features, such as conservation, surface propensity, and disorder. The integration is performed using naïve Bayes and a training set of validated experiments. CONCLUSIONS: By integrating a variety of biologically relevant features to predict domain targets, we demonstrated a notably improved prediction of modular protein domain targets. Combined with emerging high-resolution data of domain specificities, we believe that our approach can assist in the reconstruction of many signaling pathways. |
format | Text |
id | pubmed-2882932 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2010 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-28829322010-06-10 MOTIPS: Automated Motif Analysis for Predicting Targets of Modular Protein Domains Lam, Hugo YK Kim, Philip M Mok, Janine Tonikian, Raffi Sidhu, Sachdev S Turk, Benjamin E Snyder, Michael Gerstein, Mark B BMC Bioinformatics Methodology article BACKGROUND: Many protein interactions, especially those involved in signaling, involve short linear motifs consisting of 5-10 amino acid residues that interact with modular protein domains such as the SH3 binding domains and the kinase catalytic domains. One straightforward way of identifying these interactions is by scanning for matches to the motif against all the sequences in a target proteome. However, predicting domain targets by motif sequence alone without considering other genomic and structural information has been shown to be lacking in accuracy. RESULTS: We developed an efficient search algorithm to scan the target proteome for potential domain targets and to increase the accuracy of each hit by integrating a variety of pre-computed features, such as conservation, surface propensity, and disorder. The integration is performed using naïve Bayes and a training set of validated experiments. CONCLUSIONS: By integrating a variety of biologically relevant features to predict domain targets, we demonstrated a notably improved prediction of modular protein domain targets. Combined with emerging high-resolution data of domain specificities, we believe that our approach can assist in the reconstruction of many signaling pathways. BioMed Central 2010-05-11 /pmc/articles/PMC2882932/ /pubmed/20459839 http://dx.doi.org/10.1186/1471-2105-11-243 Text en Copyright ©2010 Lam et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Methodology article Lam, Hugo YK Kim, Philip M Mok, Janine Tonikian, Raffi Sidhu, Sachdev S Turk, Benjamin E Snyder, Michael Gerstein, Mark B MOTIPS: Automated Motif Analysis for Predicting Targets of Modular Protein Domains |
title | MOTIPS: Automated Motif Analysis for Predicting Targets of Modular Protein Domains |
title_full | MOTIPS: Automated Motif Analysis for Predicting Targets of Modular Protein Domains |
title_fullStr | MOTIPS: Automated Motif Analysis for Predicting Targets of Modular Protein Domains |
title_full_unstemmed | MOTIPS: Automated Motif Analysis for Predicting Targets of Modular Protein Domains |
title_short | MOTIPS: Automated Motif Analysis for Predicting Targets of Modular Protein Domains |
title_sort | motips: automated motif analysis for predicting targets of modular protein domains |
topic | Methodology article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2882932/ https://www.ncbi.nlm.nih.gov/pubmed/20459839 http://dx.doi.org/10.1186/1471-2105-11-243 |
work_keys_str_mv | AT lamhugoyk motipsautomatedmotifanalysisforpredictingtargetsofmodularproteindomains AT kimphilipm motipsautomatedmotifanalysisforpredictingtargetsofmodularproteindomains AT mokjanine motipsautomatedmotifanalysisforpredictingtargetsofmodularproteindomains AT tonikianraffi motipsautomatedmotifanalysisforpredictingtargetsofmodularproteindomains AT sidhusachdevs motipsautomatedmotifanalysisforpredictingtargetsofmodularproteindomains AT turkbenjamine motipsautomatedmotifanalysisforpredictingtargetsofmodularproteindomains AT snydermichael motipsautomatedmotifanalysisforpredictingtargetsofmodularproteindomains AT gersteinmarkb motipsautomatedmotifanalysisforpredictingtargetsofmodularproteindomains |