Cargando…

Hammock: a hidden Markov model-based peptide clustering algorithm to identify protein-interaction consensus motifs in large datasets

Motivation: Proteins often recognize their interaction partners on the basis of short linear motifs located in disordered regions on proteins’ surface. Experimental techniques that study such motifs use short peptides to mimic the structural properties of interacting proteins. Continued development...

Descripción completa

Detalles Bibliográficos
Autores principales:	Krejci, Adam, Hupp, Ted R., Lexa, Matej, Vojtesek, Borivoj, Muller, Petr
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	Oxford University Press 2016
Materias:	Original Papers
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4681989/ https://www.ncbi.nlm.nih.gov/pubmed/26342231 http://dx.doi.org/10.1093/bioinformatics/btv522

_version_	1782405811569426432
author	Krejci, Adam Hupp, Ted R. Lexa, Matej Vojtesek, Borivoj Muller, Petr
author_facet	Krejci, Adam Hupp, Ted R. Lexa, Matej Vojtesek, Borivoj Muller, Petr
author_sort	Krejci, Adam
collection	PubMed
description	Motivation: Proteins often recognize their interaction partners on the basis of short linear motifs located in disordered regions on proteins’ surface. Experimental techniques that study such motifs use short peptides to mimic the structural properties of interacting proteins. Continued development of these methods allows for large-scale screening, resulting in vast amounts of peptide sequences, potentially containing information on multiple protein-protein interactions. Processing of such datasets is a complex but essential task for large-scale studies investigating protein-protein interactions. Results: The software tool presented in this article is able to rapidly identify multiple clusters of sequences carrying shared specificity motifs in massive datasets from various sources and generate multiple sequence alignments of identified clusters. The method was applied on a previously published smaller dataset containing distinct classes of ligands for SH3 domains, as well as on a new, an order of magnitude larger dataset containing epitopes for several monoclonal antibodies. The software successfully identified clusters of sequences mimicking epitopes of antibody targets, as well as secondary clusters revealing that the antibodies accept some deviations from original epitope sequences. Another test indicates that processing of even much larger datasets is computationally feasible. Availability and implementation: Hammock is published under GNU GPL v. 3 license and is freely available as a standalone program (from http://www.recamo.cz/en/software/hammock-cluster-peptides/) or as a tool for the Galaxy toolbox (from https://toolshed.g2.bx.psu.edu/view/hammock/hammock). The source code can be downloaded from https://github.com/hammock-dev/hammock/releases. Contact: muller@mou.cz Supplementary information: Supplementary data are available at Bioinformatics online.
format	Online Article Text
id	pubmed-4681989
institution	National Center for Biotechnology Information
language	English
publishDate	2016
publisher	Oxford University Press
record_format	MEDLINE/PubMed
spelling	pubmed-46819892015-12-18 Hammock: a hidden Markov model-based peptide clustering algorithm to identify protein-interaction consensus motifs in large datasets Krejci, Adam Hupp, Ted R. Lexa, Matej Vojtesek, Borivoj Muller, Petr Bioinformatics Original Papers Motivation: Proteins often recognize their interaction partners on the basis of short linear motifs located in disordered regions on proteins’ surface. Experimental techniques that study such motifs use short peptides to mimic the structural properties of interacting proteins. Continued development of these methods allows for large-scale screening, resulting in vast amounts of peptide sequences, potentially containing information on multiple protein-protein interactions. Processing of such datasets is a complex but essential task for large-scale studies investigating protein-protein interactions. Results: The software tool presented in this article is able to rapidly identify multiple clusters of sequences carrying shared specificity motifs in massive datasets from various sources and generate multiple sequence alignments of identified clusters. The method was applied on a previously published smaller dataset containing distinct classes of ligands for SH3 domains, as well as on a new, an order of magnitude larger dataset containing epitopes for several monoclonal antibodies. The software successfully identified clusters of sequences mimicking epitopes of antibody targets, as well as secondary clusters revealing that the antibodies accept some deviations from original epitope sequences. Another test indicates that processing of even much larger datasets is computationally feasible. Availability and implementation: Hammock is published under GNU GPL v. 3 license and is freely available as a standalone program (from http://www.recamo.cz/en/software/hammock-cluster-peptides/) or as a tool for the Galaxy toolbox (from https://toolshed.g2.bx.psu.edu/view/hammock/hammock). The source code can be downloaded from https://github.com/hammock-dev/hammock/releases. Contact: muller@mou.cz Supplementary information: Supplementary data are available at Bioinformatics online. Oxford University Press 2016-01-01 2015-09-05 /pmc/articles/PMC4681989/ /pubmed/26342231 http://dx.doi.org/10.1093/bioinformatics/btv522 Text en © The Author 2015. Published by Oxford University Press. http://creativecommons.org/licenses/by/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle	Original Papers Krejci, Adam Hupp, Ted R. Lexa, Matej Vojtesek, Borivoj Muller, Petr Hammock: a hidden Markov model-based peptide clustering algorithm to identify protein-interaction consensus motifs in large datasets
title	Hammock: a hidden Markov model-based peptide clustering algorithm to identify protein-interaction consensus motifs in large datasets
title_full	Hammock: a hidden Markov model-based peptide clustering algorithm to identify protein-interaction consensus motifs in large datasets
title_fullStr	Hammock: a hidden Markov model-based peptide clustering algorithm to identify protein-interaction consensus motifs in large datasets
title_full_unstemmed	Hammock: a hidden Markov model-based peptide clustering algorithm to identify protein-interaction consensus motifs in large datasets
title_short	Hammock: a hidden Markov model-based peptide clustering algorithm to identify protein-interaction consensus motifs in large datasets
title_sort	hammock: a hidden markov model-based peptide clustering algorithm to identify protein-interaction consensus motifs in large datasets
topic	Original Papers
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4681989/ https://www.ncbi.nlm.nih.gov/pubmed/26342231 http://dx.doi.org/10.1093/bioinformatics/btv522
work_keys_str_mv	AT krejciadam hammockahiddenmarkovmodelbasedpeptideclusteringalgorithmtoidentifyproteininteractionconsensusmotifsinlargedatasets AT hupptedr hammockahiddenmarkovmodelbasedpeptideclusteringalgorithmtoidentifyproteininteractionconsensusmotifsinlargedatasets AT lexamatej hammockahiddenmarkovmodelbasedpeptideclusteringalgorithmtoidentifyproteininteractionconsensusmotifsinlargedatasets AT vojtesekborivoj hammockahiddenmarkovmodelbasedpeptideclusteringalgorithmtoidentifyproteininteractionconsensusmotifsinlargedatasets AT mullerpetr hammockahiddenmarkovmodelbasedpeptideclusteringalgorithmtoidentifyproteininteractionconsensusmotifsinlargedatasets

Hammock: a hidden Markov model-based peptide clustering algorithm to identify protein-interaction consensus motifs in large datasets

Ejemplares similares