Cargando…

A combined bioinformatics and functional metagenomics approach to discovering lipolytic biocatalysts

The majority of protein sequence data published today is of metagenomic origin. However, our ability to assign functions to these sequences is often hampered by our general inability to cultivate the larger part of microbial species and the sheer amount of sequence data generated in these projects....

Descripción completa

Detalles Bibliográficos
Autores principales: Masuch, Thorsten, Kusnezowa, Anna, Nilewski, Sebastian, Bautista, José T., Kourist, Robert, Leichert, Lars I.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Frontiers Media S.A. 2015
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4602143/
https://www.ncbi.nlm.nih.gov/pubmed/26528261
http://dx.doi.org/10.3389/fmicb.2015.01110
_version_ 1782394661047894016
author Masuch, Thorsten
Kusnezowa, Anna
Nilewski, Sebastian
Bautista, José T.
Kourist, Robert
Leichert, Lars I.
author_facet Masuch, Thorsten
Kusnezowa, Anna
Nilewski, Sebastian
Bautista, José T.
Kourist, Robert
Leichert, Lars I.
author_sort Masuch, Thorsten
collection PubMed
description The majority of protein sequence data published today is of metagenomic origin. However, our ability to assign functions to these sequences is often hampered by our general inability to cultivate the larger part of microbial species and the sheer amount of sequence data generated in these projects. Here we present a combination of bioinformatics, synthetic biology, and Escherichia coli genetics to discover biocatalysts in metagenomic datasets. We created a subset of the Global Ocean Sampling dataset, the largest metagenomic project published to date, by removing all proteins that matched Hidden Markov Models of known protein families from PFAM and TIGRFAM with high confidence (E-value > 10(-5)). This essentially left us with proteins with low or no homology to known protein families, still encompassing ~1.7 million different sequences. In this subset, we then identified protein families de novo with a Markov clustering algorithm. For each protein family, we defined a single representative based on its phylogenetic relationship to all other members in that family. This reduced the dataset to ~17,000 representatives of protein families with more than 10 members. Based on conserved regions typical for lipases and esterases, we selected a representative gene from a family of 27 members for synthesis. This protein, when expressed in E. coli, showed lipolytic activity toward para-nitrophenyl (pNP) esters. The K(m)-value of the enzyme was 66.68 μM for pNP-butyrate and 68.08 μM for pNP-palmitate with k(cat)/K(m) values at 3.4 × 10(6) and 6.6 × 10(5) M(-1)s(-1), respectively. Hydrolysis of model substrates showed enantiopreference for the R-form. Reactions yielded 43 and 61% enantiomeric excess of products with ibuprofen methyl ester and 2-phenylpropanoic acid ethyl ester, respectively. The enzyme retains 50% of its maximum activity at temperatures as low as 10°C, its activity is enhanced in artificial seawater and buffers with higher salt concentrations with an optimum osmolarity of 3,890 mosmol/l.
format Online
Article
Text
id pubmed-4602143
institution National Center for Biotechnology Information
language English
publishDate 2015
publisher Frontiers Media S.A.
record_format MEDLINE/PubMed
spelling pubmed-46021432015-11-02 A combined bioinformatics and functional metagenomics approach to discovering lipolytic biocatalysts Masuch, Thorsten Kusnezowa, Anna Nilewski, Sebastian Bautista, José T. Kourist, Robert Leichert, Lars I. Front Microbiol Microbiology The majority of protein sequence data published today is of metagenomic origin. However, our ability to assign functions to these sequences is often hampered by our general inability to cultivate the larger part of microbial species and the sheer amount of sequence data generated in these projects. Here we present a combination of bioinformatics, synthetic biology, and Escherichia coli genetics to discover biocatalysts in metagenomic datasets. We created a subset of the Global Ocean Sampling dataset, the largest metagenomic project published to date, by removing all proteins that matched Hidden Markov Models of known protein families from PFAM and TIGRFAM with high confidence (E-value > 10(-5)). This essentially left us with proteins with low or no homology to known protein families, still encompassing ~1.7 million different sequences. In this subset, we then identified protein families de novo with a Markov clustering algorithm. For each protein family, we defined a single representative based on its phylogenetic relationship to all other members in that family. This reduced the dataset to ~17,000 representatives of protein families with more than 10 members. Based on conserved regions typical for lipases and esterases, we selected a representative gene from a family of 27 members for synthesis. This protein, when expressed in E. coli, showed lipolytic activity toward para-nitrophenyl (pNP) esters. The K(m)-value of the enzyme was 66.68 μM for pNP-butyrate and 68.08 μM for pNP-palmitate with k(cat)/K(m) values at 3.4 × 10(6) and 6.6 × 10(5) M(-1)s(-1), respectively. Hydrolysis of model substrates showed enantiopreference for the R-form. Reactions yielded 43 and 61% enantiomeric excess of products with ibuprofen methyl ester and 2-phenylpropanoic acid ethyl ester, respectively. The enzyme retains 50% of its maximum activity at temperatures as low as 10°C, its activity is enhanced in artificial seawater and buffers with higher salt concentrations with an optimum osmolarity of 3,890 mosmol/l. Frontiers Media S.A. 2015-10-13 /pmc/articles/PMC4602143/ /pubmed/26528261 http://dx.doi.org/10.3389/fmicb.2015.01110 Text en Copyright © 2015 Masuch, Kusnezowa, Nilewski, Bautista, Kourist and Leichert. http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) or licensor are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle Microbiology
Masuch, Thorsten
Kusnezowa, Anna
Nilewski, Sebastian
Bautista, José T.
Kourist, Robert
Leichert, Lars I.
A combined bioinformatics and functional metagenomics approach to discovering lipolytic biocatalysts
title A combined bioinformatics and functional metagenomics approach to discovering lipolytic biocatalysts
title_full A combined bioinformatics and functional metagenomics approach to discovering lipolytic biocatalysts
title_fullStr A combined bioinformatics and functional metagenomics approach to discovering lipolytic biocatalysts
title_full_unstemmed A combined bioinformatics and functional metagenomics approach to discovering lipolytic biocatalysts
title_short A combined bioinformatics and functional metagenomics approach to discovering lipolytic biocatalysts
title_sort combined bioinformatics and functional metagenomics approach to discovering lipolytic biocatalysts
topic Microbiology
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4602143/
https://www.ncbi.nlm.nih.gov/pubmed/26528261
http://dx.doi.org/10.3389/fmicb.2015.01110
work_keys_str_mv AT masuchthorsten acombinedbioinformaticsandfunctionalmetagenomicsapproachtodiscoveringlipolyticbiocatalysts
AT kusnezowaanna acombinedbioinformaticsandfunctionalmetagenomicsapproachtodiscoveringlipolyticbiocatalysts
AT nilewskisebastian acombinedbioinformaticsandfunctionalmetagenomicsapproachtodiscoveringlipolyticbiocatalysts
AT bautistajoset acombinedbioinformaticsandfunctionalmetagenomicsapproachtodiscoveringlipolyticbiocatalysts
AT kouristrobert acombinedbioinformaticsandfunctionalmetagenomicsapproachtodiscoveringlipolyticbiocatalysts
AT leichertlarsi acombinedbioinformaticsandfunctionalmetagenomicsapproachtodiscoveringlipolyticbiocatalysts
AT masuchthorsten combinedbioinformaticsandfunctionalmetagenomicsapproachtodiscoveringlipolyticbiocatalysts
AT kusnezowaanna combinedbioinformaticsandfunctionalmetagenomicsapproachtodiscoveringlipolyticbiocatalysts
AT nilewskisebastian combinedbioinformaticsandfunctionalmetagenomicsapproachtodiscoveringlipolyticbiocatalysts
AT bautistajoset combinedbioinformaticsandfunctionalmetagenomicsapproachtodiscoveringlipolyticbiocatalysts
AT kouristrobert combinedbioinformaticsandfunctionalmetagenomicsapproachtodiscoveringlipolyticbiocatalysts
AT leichertlarsi combinedbioinformaticsandfunctionalmetagenomicsapproachtodiscoveringlipolyticbiocatalysts