Cargando…

Greedy de novo motif discovery to construct motif repositories for bacterial proteomes

BACKGROUND: Bacterial surfaces are complex systems, constructed from membranes, peptidoglycan and, importantly, proteins. The proteins play crucial roles as critical regulators of how the bacterium interacts with and survive in its environment. A full catalog of the motifs in protein families and th...

Descripción completa

Detalles Bibliográficos
Autores principales: Khakzad, Hamed, Malmström, Johan, Malmström, Lars
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6471678/
https://www.ncbi.nlm.nih.gov/pubmed/30999854
http://dx.doi.org/10.1186/s12859-019-2686-8
_version_ 1783412079199256576
author Khakzad, Hamed
Malmström, Johan
Malmström, Lars
author_facet Khakzad, Hamed
Malmström, Johan
Malmström, Lars
author_sort Khakzad, Hamed
collection PubMed
description BACKGROUND: Bacterial surfaces are complex systems, constructed from membranes, peptidoglycan and, importantly, proteins. The proteins play crucial roles as critical regulators of how the bacterium interacts with and survive in its environment. A full catalog of the motifs in protein families and their relative conservation grade is a prerequisite to target the protein-protein interaction that bacterial surface protein makes to host proteins. RESULTS: In this paper, we propose a greedy approach to identify conserved motifs in large sequence families iteratively. Each iteration discovers a motif de novo and masks all occurrences of that motif. Remaining unmasked sequences are subjected to the next round of motif detection until no more significant motifs can be found. We demonstrate the utility of the method through the construction of a proteome-wide motif repository for Group A Streptococcus (GAS), a significant human pathogen. GAS produce numerous surface proteins that interact with over 100 human plasma proteins, helping the bacteria to evade the host immune response. We used the repository to find that proteins part of the bacterial surface has motif architectures that differ from intracellular proteins. CONCLUSIONS: We elucidate that the M protein, a coiled-coil homodimer that extends over 500 A from the cell wall, has a motif architecture that differs between various GAS strains. As the M protein is known to bind a variety of different plasma proteins, the results indicate that the different motif architectures are responsible for the quantitative differences of plasma proteins that various strains bind. The speed and applicability of the method enable its application to all major human pathogens. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (10.1186/s12859-019-2686-8) contains supplementary material, which is available to authorized users.
format Online
Article
Text
id pubmed-6471678
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-64716782019-04-24 Greedy de novo motif discovery to construct motif repositories for bacterial proteomes Khakzad, Hamed Malmström, Johan Malmström, Lars BMC Bioinformatics Research BACKGROUND: Bacterial surfaces are complex systems, constructed from membranes, peptidoglycan and, importantly, proteins. The proteins play crucial roles as critical regulators of how the bacterium interacts with and survive in its environment. A full catalog of the motifs in protein families and their relative conservation grade is a prerequisite to target the protein-protein interaction that bacterial surface protein makes to host proteins. RESULTS: In this paper, we propose a greedy approach to identify conserved motifs in large sequence families iteratively. Each iteration discovers a motif de novo and masks all occurrences of that motif. Remaining unmasked sequences are subjected to the next round of motif detection until no more significant motifs can be found. We demonstrate the utility of the method through the construction of a proteome-wide motif repository for Group A Streptococcus (GAS), a significant human pathogen. GAS produce numerous surface proteins that interact with over 100 human plasma proteins, helping the bacteria to evade the host immune response. We used the repository to find that proteins part of the bacterial surface has motif architectures that differ from intracellular proteins. CONCLUSIONS: We elucidate that the M protein, a coiled-coil homodimer that extends over 500 A from the cell wall, has a motif architecture that differs between various GAS strains. As the M protein is known to bind a variety of different plasma proteins, the results indicate that the different motif architectures are responsible for the quantitative differences of plasma proteins that various strains bind. The speed and applicability of the method enable its application to all major human pathogens. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (10.1186/s12859-019-2686-8) contains supplementary material, which is available to authorized users. BioMed Central 2019-04-18 /pmc/articles/PMC6471678/ /pubmed/30999854 http://dx.doi.org/10.1186/s12859-019-2686-8 Text en © The Author(s) 2019 Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver(http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Research
Khakzad, Hamed
Malmström, Johan
Malmström, Lars
Greedy de novo motif discovery to construct motif repositories for bacterial proteomes
title Greedy de novo motif discovery to construct motif repositories for bacterial proteomes
title_full Greedy de novo motif discovery to construct motif repositories for bacterial proteomes
title_fullStr Greedy de novo motif discovery to construct motif repositories for bacterial proteomes
title_full_unstemmed Greedy de novo motif discovery to construct motif repositories for bacterial proteomes
title_short Greedy de novo motif discovery to construct motif repositories for bacterial proteomes
title_sort greedy de novo motif discovery to construct motif repositories for bacterial proteomes
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6471678/
https://www.ncbi.nlm.nih.gov/pubmed/30999854
http://dx.doi.org/10.1186/s12859-019-2686-8
work_keys_str_mv AT khakzadhamed greedydenovomotifdiscoverytoconstructmotifrepositoriesforbacterialproteomes
AT malmstromjohan greedydenovomotifdiscoverytoconstructmotifrepositoriesforbacterialproteomes
AT malmstromlars greedydenovomotifdiscoverytoconstructmotifrepositoriesforbacterialproteomes