Cargando…
A combined bioinformatics and functional metagenomics approach to discovering lipolytic biocatalysts
The majority of protein sequence data published today is of metagenomic origin. However, our ability to assign functions to these sequences is often hampered by our general inability to cultivate the larger part of microbial species and the sheer amount of sequence data generated in these projects....
Autores principales: | , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Frontiers Media S.A.
2015
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4602143/ https://www.ncbi.nlm.nih.gov/pubmed/26528261 http://dx.doi.org/10.3389/fmicb.2015.01110 |
_version_ | 1782394661047894016 |
---|---|
author | Masuch, Thorsten Kusnezowa, Anna Nilewski, Sebastian Bautista, José T. Kourist, Robert Leichert, Lars I. |
author_facet | Masuch, Thorsten Kusnezowa, Anna Nilewski, Sebastian Bautista, José T. Kourist, Robert Leichert, Lars I. |
author_sort | Masuch, Thorsten |
collection | PubMed |
description | The majority of protein sequence data published today is of metagenomic origin. However, our ability to assign functions to these sequences is often hampered by our general inability to cultivate the larger part of microbial species and the sheer amount of sequence data generated in these projects. Here we present a combination of bioinformatics, synthetic biology, and Escherichia coli genetics to discover biocatalysts in metagenomic datasets. We created a subset of the Global Ocean Sampling dataset, the largest metagenomic project published to date, by removing all proteins that matched Hidden Markov Models of known protein families from PFAM and TIGRFAM with high confidence (E-value > 10(-5)). This essentially left us with proteins with low or no homology to known protein families, still encompassing ~1.7 million different sequences. In this subset, we then identified protein families de novo with a Markov clustering algorithm. For each protein family, we defined a single representative based on its phylogenetic relationship to all other members in that family. This reduced the dataset to ~17,000 representatives of protein families with more than 10 members. Based on conserved regions typical for lipases and esterases, we selected a representative gene from a family of 27 members for synthesis. This protein, when expressed in E. coli, showed lipolytic activity toward para-nitrophenyl (pNP) esters. The K(m)-value of the enzyme was 66.68 μM for pNP-butyrate and 68.08 μM for pNP-palmitate with k(cat)/K(m) values at 3.4 × 10(6) and 6.6 × 10(5) M(-1)s(-1), respectively. Hydrolysis of model substrates showed enantiopreference for the R-form. Reactions yielded 43 and 61% enantiomeric excess of products with ibuprofen methyl ester and 2-phenylpropanoic acid ethyl ester, respectively. The enzyme retains 50% of its maximum activity at temperatures as low as 10°C, its activity is enhanced in artificial seawater and buffers with higher salt concentrations with an optimum osmolarity of 3,890 mosmol/l. |
format | Online Article Text |
id | pubmed-4602143 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2015 |
publisher | Frontiers Media S.A. |
record_format | MEDLINE/PubMed |
spelling | pubmed-46021432015-11-02 A combined bioinformatics and functional metagenomics approach to discovering lipolytic biocatalysts Masuch, Thorsten Kusnezowa, Anna Nilewski, Sebastian Bautista, José T. Kourist, Robert Leichert, Lars I. Front Microbiol Microbiology The majority of protein sequence data published today is of metagenomic origin. However, our ability to assign functions to these sequences is often hampered by our general inability to cultivate the larger part of microbial species and the sheer amount of sequence data generated in these projects. Here we present a combination of bioinformatics, synthetic biology, and Escherichia coli genetics to discover biocatalysts in metagenomic datasets. We created a subset of the Global Ocean Sampling dataset, the largest metagenomic project published to date, by removing all proteins that matched Hidden Markov Models of known protein families from PFAM and TIGRFAM with high confidence (E-value > 10(-5)). This essentially left us with proteins with low or no homology to known protein families, still encompassing ~1.7 million different sequences. In this subset, we then identified protein families de novo with a Markov clustering algorithm. For each protein family, we defined a single representative based on its phylogenetic relationship to all other members in that family. This reduced the dataset to ~17,000 representatives of protein families with more than 10 members. Based on conserved regions typical for lipases and esterases, we selected a representative gene from a family of 27 members for synthesis. This protein, when expressed in E. coli, showed lipolytic activity toward para-nitrophenyl (pNP) esters. The K(m)-value of the enzyme was 66.68 μM for pNP-butyrate and 68.08 μM for pNP-palmitate with k(cat)/K(m) values at 3.4 × 10(6) and 6.6 × 10(5) M(-1)s(-1), respectively. Hydrolysis of model substrates showed enantiopreference for the R-form. Reactions yielded 43 and 61% enantiomeric excess of products with ibuprofen methyl ester and 2-phenylpropanoic acid ethyl ester, respectively. The enzyme retains 50% of its maximum activity at temperatures as low as 10°C, its activity is enhanced in artificial seawater and buffers with higher salt concentrations with an optimum osmolarity of 3,890 mosmol/l. Frontiers Media S.A. 2015-10-13 /pmc/articles/PMC4602143/ /pubmed/26528261 http://dx.doi.org/10.3389/fmicb.2015.01110 Text en Copyright © 2015 Masuch, Kusnezowa, Nilewski, Bautista, Kourist and Leichert. http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) or licensor are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms. |
spellingShingle | Microbiology Masuch, Thorsten Kusnezowa, Anna Nilewski, Sebastian Bautista, José T. Kourist, Robert Leichert, Lars I. A combined bioinformatics and functional metagenomics approach to discovering lipolytic biocatalysts |
title | A combined bioinformatics and functional metagenomics approach to discovering lipolytic biocatalysts |
title_full | A combined bioinformatics and functional metagenomics approach to discovering lipolytic biocatalysts |
title_fullStr | A combined bioinformatics and functional metagenomics approach to discovering lipolytic biocatalysts |
title_full_unstemmed | A combined bioinformatics and functional metagenomics approach to discovering lipolytic biocatalysts |
title_short | A combined bioinformatics and functional metagenomics approach to discovering lipolytic biocatalysts |
title_sort | combined bioinformatics and functional metagenomics approach to discovering lipolytic biocatalysts |
topic | Microbiology |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4602143/ https://www.ncbi.nlm.nih.gov/pubmed/26528261 http://dx.doi.org/10.3389/fmicb.2015.01110 |
work_keys_str_mv | AT masuchthorsten acombinedbioinformaticsandfunctionalmetagenomicsapproachtodiscoveringlipolyticbiocatalysts AT kusnezowaanna acombinedbioinformaticsandfunctionalmetagenomicsapproachtodiscoveringlipolyticbiocatalysts AT nilewskisebastian acombinedbioinformaticsandfunctionalmetagenomicsapproachtodiscoveringlipolyticbiocatalysts AT bautistajoset acombinedbioinformaticsandfunctionalmetagenomicsapproachtodiscoveringlipolyticbiocatalysts AT kouristrobert acombinedbioinformaticsandfunctionalmetagenomicsapproachtodiscoveringlipolyticbiocatalysts AT leichertlarsi acombinedbioinformaticsandfunctionalmetagenomicsapproachtodiscoveringlipolyticbiocatalysts AT masuchthorsten combinedbioinformaticsandfunctionalmetagenomicsapproachtodiscoveringlipolyticbiocatalysts AT kusnezowaanna combinedbioinformaticsandfunctionalmetagenomicsapproachtodiscoveringlipolyticbiocatalysts AT nilewskisebastian combinedbioinformaticsandfunctionalmetagenomicsapproachtodiscoveringlipolyticbiocatalysts AT bautistajoset combinedbioinformaticsandfunctionalmetagenomicsapproachtodiscoveringlipolyticbiocatalysts AT kouristrobert combinedbioinformaticsandfunctionalmetagenomicsapproachtodiscoveringlipolyticbiocatalysts AT leichertlarsi combinedbioinformaticsandfunctionalmetagenomicsapproachtodiscoveringlipolyticbiocatalysts |