Cargando…

A roadmap for metagenomic enzyme discovery

Covering: up to 2021 Metagenomics has yielded massive amounts of sequencing data offering a glimpse into the biosynthetic potential of the uncultivated microbial majority. While genome-resolved information about microbial communities from nearly every environment on earth is now available, the abili...

Descripción completa

Detalles Bibliográficos
Autores principales: Robinson, Serina L., Piel, Jörn, Sunagawa, Shinichi
Formato: Online Artículo Texto
Lenguaje:English
Publicado: The Royal Society of Chemistry 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8597712/
https://www.ncbi.nlm.nih.gov/pubmed/34821235
http://dx.doi.org/10.1039/d1np00006c
_version_ 1784600658947604480
author Robinson, Serina L.
Piel, Jörn
Sunagawa, Shinichi
author_facet Robinson, Serina L.
Piel, Jörn
Sunagawa, Shinichi
author_sort Robinson, Serina L.
collection PubMed
description Covering: up to 2021 Metagenomics has yielded massive amounts of sequencing data offering a glimpse into the biosynthetic potential of the uncultivated microbial majority. While genome-resolved information about microbial communities from nearly every environment on earth is now available, the ability to accurately predict biocatalytic functions directly from sequencing data remains challenging. Compared to primary metabolic pathways, enzymes involved in secondary metabolism often catalyze specialized reactions with diverse substrates, making these pathways rich resources for the discovery of new enzymology. To date, functional insights gained from studies on environmental DNA (eDNA) have largely relied on PCR- or activity-based screening of eDNA fragments cloned in fosmid or cosmid libraries. As an alternative, shotgun metagenomics holds underexplored potential for the discovery of new enzymes directly from eDNA by avoiding common biases introduced through PCR- or activity-guided functional metagenomics workflows. However, inferring new enzyme functions directly from eDNA is similar to searching for a ‘needle in a haystack’ without direct links between genotype and phenotype. The goal of this review is to provide a roadmap to navigate shotgun metagenomic sequencing data and identify new candidate biosynthetic enzymes. We cover both computational and experimental strategies to mine metagenomes and explore protein sequence space with a spotlight on natural product biosynthesis. Specifically, we compare in silico methods for enzyme discovery including phylogenetics, sequence similarity networks, genomic context, 3D structure-based approaches, and machine learning techniques. We also discuss various experimental strategies to test computational predictions including heterologous expression and screening. Finally, we provide an outlook for future directions in the field with an emphasis on meta-omics, single-cell genomics, cell-free expression systems, and sequence-independent methods.
format Online
Article
Text
id pubmed-8597712
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher The Royal Society of Chemistry
record_format MEDLINE/PubMed
spelling pubmed-85977122021-11-23 A roadmap for metagenomic enzyme discovery Robinson, Serina L. Piel, Jörn Sunagawa, Shinichi Nat Prod Rep Chemistry Covering: up to 2021 Metagenomics has yielded massive amounts of sequencing data offering a glimpse into the biosynthetic potential of the uncultivated microbial majority. While genome-resolved information about microbial communities from nearly every environment on earth is now available, the ability to accurately predict biocatalytic functions directly from sequencing data remains challenging. Compared to primary metabolic pathways, enzymes involved in secondary metabolism often catalyze specialized reactions with diverse substrates, making these pathways rich resources for the discovery of new enzymology. To date, functional insights gained from studies on environmental DNA (eDNA) have largely relied on PCR- or activity-based screening of eDNA fragments cloned in fosmid or cosmid libraries. As an alternative, shotgun metagenomics holds underexplored potential for the discovery of new enzymes directly from eDNA by avoiding common biases introduced through PCR- or activity-guided functional metagenomics workflows. However, inferring new enzyme functions directly from eDNA is similar to searching for a ‘needle in a haystack’ without direct links between genotype and phenotype. The goal of this review is to provide a roadmap to navigate shotgun metagenomic sequencing data and identify new candidate biosynthetic enzymes. We cover both computational and experimental strategies to mine metagenomes and explore protein sequence space with a spotlight on natural product biosynthesis. Specifically, we compare in silico methods for enzyme discovery including phylogenetics, sequence similarity networks, genomic context, 3D structure-based approaches, and machine learning techniques. We also discuss various experimental strategies to test computational predictions including heterologous expression and screening. Finally, we provide an outlook for future directions in the field with an emphasis on meta-omics, single-cell genomics, cell-free expression systems, and sequence-independent methods. The Royal Society of Chemistry 2021-04-12 /pmc/articles/PMC8597712/ /pubmed/34821235 http://dx.doi.org/10.1039/d1np00006c Text en This journal is © The Royal Society of Chemistry https://creativecommons.org/licenses/by-nc/3.0/
spellingShingle Chemistry
Robinson, Serina L.
Piel, Jörn
Sunagawa, Shinichi
A roadmap for metagenomic enzyme discovery
title A roadmap for metagenomic enzyme discovery
title_full A roadmap for metagenomic enzyme discovery
title_fullStr A roadmap for metagenomic enzyme discovery
title_full_unstemmed A roadmap for metagenomic enzyme discovery
title_short A roadmap for metagenomic enzyme discovery
title_sort roadmap for metagenomic enzyme discovery
topic Chemistry
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8597712/
https://www.ncbi.nlm.nih.gov/pubmed/34821235
http://dx.doi.org/10.1039/d1np00006c
work_keys_str_mv AT robinsonserinal aroadmapformetagenomicenzymediscovery
AT pieljorn aroadmapformetagenomicenzymediscovery
AT sunagawashinichi aroadmapformetagenomicenzymediscovery
AT robinsonserinal roadmapformetagenomicenzymediscovery
AT pieljorn roadmapformetagenomicenzymediscovery
AT sunagawashinichi roadmapformetagenomicenzymediscovery