Cargando…

GibbsCluster: unsupervised clustering and alignment of peptide sequences

Receptor interactions with short linear peptide fragments (ligands) are at the base of many biological signaling processes. Conserved and information-rich amino acid patterns, commonly called sequence motifs, shape and regulate these interactions. Because of the properties of a receptor-ligand syste...

Descripción completa

Detalles Bibliográficos
Autores principales: Andreatta, Massimo, Alvarez, Bruno, Nielsen, Morten
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2017
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5570237/
https://www.ncbi.nlm.nih.gov/pubmed/28407089
http://dx.doi.org/10.1093/nar/gkx248
_version_ 1783259142172966912
author Andreatta, Massimo
Alvarez, Bruno
Nielsen, Morten
author_facet Andreatta, Massimo
Alvarez, Bruno
Nielsen, Morten
author_sort Andreatta, Massimo
collection PubMed
description Receptor interactions with short linear peptide fragments (ligands) are at the base of many biological signaling processes. Conserved and information-rich amino acid patterns, commonly called sequence motifs, shape and regulate these interactions. Because of the properties of a receptor-ligand system or of the assay used to interrogate it, experimental data often contain multiple sequence motifs. GibbsCluster is a powerful tool for unsupervised motif discovery because it can simultaneously cluster and align peptide data. The GibbsCluster 2.0 presented here is an improved version incorporating insertion and deletions accounting for variations in motif length in the peptide input. In basic terms, the program takes as input a set of peptide sequences and clusters them into meaningful groups. It returns the optimal number of clusters it identified, together with the sequence alignment and sequence motif characterizing each cluster. Several parameters are available to customize cluster analysis, including adjustable penalties for small clusters and overlapping groups and a trash cluster to remove outliers. As an example application, we used the server to deconvolute multiple specificities in large-scale peptidome data generated by mass spectrometry. The server is available at http://www.cbs.dtu.dk/services/GibbsCluster-2.0.
format Online
Article
Text
id pubmed-5570237
institution National Center for Biotechnology Information
language English
publishDate 2017
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-55702372017-08-29 GibbsCluster: unsupervised clustering and alignment of peptide sequences Andreatta, Massimo Alvarez, Bruno Nielsen, Morten Nucleic Acids Res Web Server Issue Receptor interactions with short linear peptide fragments (ligands) are at the base of many biological signaling processes. Conserved and information-rich amino acid patterns, commonly called sequence motifs, shape and regulate these interactions. Because of the properties of a receptor-ligand system or of the assay used to interrogate it, experimental data often contain multiple sequence motifs. GibbsCluster is a powerful tool for unsupervised motif discovery because it can simultaneously cluster and align peptide data. The GibbsCluster 2.0 presented here is an improved version incorporating insertion and deletions accounting for variations in motif length in the peptide input. In basic terms, the program takes as input a set of peptide sequences and clusters them into meaningful groups. It returns the optimal number of clusters it identified, together with the sequence alignment and sequence motif characterizing each cluster. Several parameters are available to customize cluster analysis, including adjustable penalties for small clusters and overlapping groups and a trash cluster to remove outliers. As an example application, we used the server to deconvolute multiple specificities in large-scale peptidome data generated by mass spectrometry. The server is available at http://www.cbs.dtu.dk/services/GibbsCluster-2.0. Oxford University Press 2017-07-03 2017-04-12 /pmc/articles/PMC5570237/ /pubmed/28407089 http://dx.doi.org/10.1093/nar/gkx248 Text en © The Author(s) 2017. Published by Oxford University Press on behalf of Nucleic Acids Research. http://creativecommons.org/licenses/by/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Web Server Issue
Andreatta, Massimo
Alvarez, Bruno
Nielsen, Morten
GibbsCluster: unsupervised clustering and alignment of peptide sequences
title GibbsCluster: unsupervised clustering and alignment of peptide sequences
title_full GibbsCluster: unsupervised clustering and alignment of peptide sequences
title_fullStr GibbsCluster: unsupervised clustering and alignment of peptide sequences
title_full_unstemmed GibbsCluster: unsupervised clustering and alignment of peptide sequences
title_short GibbsCluster: unsupervised clustering and alignment of peptide sequences
title_sort gibbscluster: unsupervised clustering and alignment of peptide sequences
topic Web Server Issue
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5570237/
https://www.ncbi.nlm.nih.gov/pubmed/28407089
http://dx.doi.org/10.1093/nar/gkx248
work_keys_str_mv AT andreattamassimo gibbsclusterunsupervisedclusteringandalignmentofpeptidesequences
AT alvarezbruno gibbsclusterunsupervisedclusteringandalignmentofpeptidesequences
AT nielsenmorten gibbsclusterunsupervisedclusteringandalignmentofpeptidesequences