Cargando…

MetCap: a bioinformatics probe design pipeline for large-scale targeted metagenomics

BACKGROUND: Massive sequencing of genes from different environments has evolved metagenomics as central to enhancing the understanding of the wide diversity of micro-organisms and their roles in driving ecological processes. Reduced cost and high throughput sequencing has made large-scale projects a...

Descripción completa

Detalles Bibliográficos
Autores principales: Kushwaha, Sandeep K, Manoharan, Lokeshwaran, Meerupati, Tejashwari, Hedlund, Katarina, Ahrén, Dag
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2015
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4355349/
https://www.ncbi.nlm.nih.gov/pubmed/25880302
http://dx.doi.org/10.1186/s12859-015-0501-8
_version_ 1782360840295415808
author Kushwaha, Sandeep K
Manoharan, Lokeshwaran
Meerupati, Tejashwari
Hedlund, Katarina
Ahrén, Dag
author_facet Kushwaha, Sandeep K
Manoharan, Lokeshwaran
Meerupati, Tejashwari
Hedlund, Katarina
Ahrén, Dag
author_sort Kushwaha, Sandeep K
collection PubMed
description BACKGROUND: Massive sequencing of genes from different environments has evolved metagenomics as central to enhancing the understanding of the wide diversity of micro-organisms and their roles in driving ecological processes. Reduced cost and high throughput sequencing has made large-scale projects achievable to a wider group of researchers, though complete metagenome sequencing is still a daunting task in terms of sequencing as well as the downstream bioinformatics analyses. Alternative approaches such as targeted amplicon sequencing requires custom PCR primer generation, and is not scalable to thousands of genes or gene families. RESULTS: In this study, we are presenting a web-based tool called MetCap that circumvents the limitations of amplicon sequencing of multiple genes by designing probes that are suitable for large-scale targeted metagenomics sequencing studies. MetCap provides a novel approach to target thousands of genes and genomic regions that could be used in targeted metagenomics studies. Automatic analysis of user-defined sequences is performed, and probes specifically designed for metagenome studies are generated. To illustrate the advantage of a targeted metagenome approach, we have generated more than 300,000 probes that match more than 400,000 publicly available sequences related to carbon degradation, and used these probes for target sequencing in a soil metagenome study. The results show high enrichment of target genes and a successful capturing of the majority of gene families. MetCap is freely available to users from: http://soilecology.biol.lu.se/metcap/. CONCLUSION: MetCap is facilitating probe-based target enrichment as an easy and efficient alternative tool compared to complex primer-based enrichment for large-scale investigations of metagenomes. Our results have shown efficient large-scale target enrichment through MetCap-designed probes for a soil metagenome. The web service is suitable for any targeted metagenomics project that aims to study several genes simultaneously. The novel bioinformatics approach taken by the web service will enable researchers in microbial ecology to tap into the vast diversity of microbial communities using targeted metagenomics as a cost-effective alternative to whole metagenome sequencing. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12859-015-0501-8) contains supplementary material, which is available to authorized users.
format Online
Article
Text
id pubmed-4355349
institution National Center for Biotechnology Information
language English
publishDate 2015
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-43553492015-03-12 MetCap: a bioinformatics probe design pipeline for large-scale targeted metagenomics Kushwaha, Sandeep K Manoharan, Lokeshwaran Meerupati, Tejashwari Hedlund, Katarina Ahrén, Dag BMC Bioinformatics Software BACKGROUND: Massive sequencing of genes from different environments has evolved metagenomics as central to enhancing the understanding of the wide diversity of micro-organisms and their roles in driving ecological processes. Reduced cost and high throughput sequencing has made large-scale projects achievable to a wider group of researchers, though complete metagenome sequencing is still a daunting task in terms of sequencing as well as the downstream bioinformatics analyses. Alternative approaches such as targeted amplicon sequencing requires custom PCR primer generation, and is not scalable to thousands of genes or gene families. RESULTS: In this study, we are presenting a web-based tool called MetCap that circumvents the limitations of amplicon sequencing of multiple genes by designing probes that are suitable for large-scale targeted metagenomics sequencing studies. MetCap provides a novel approach to target thousands of genes and genomic regions that could be used in targeted metagenomics studies. Automatic analysis of user-defined sequences is performed, and probes specifically designed for metagenome studies are generated. To illustrate the advantage of a targeted metagenome approach, we have generated more than 300,000 probes that match more than 400,000 publicly available sequences related to carbon degradation, and used these probes for target sequencing in a soil metagenome study. The results show high enrichment of target genes and a successful capturing of the majority of gene families. MetCap is freely available to users from: http://soilecology.biol.lu.se/metcap/. CONCLUSION: MetCap is facilitating probe-based target enrichment as an easy and efficient alternative tool compared to complex primer-based enrichment for large-scale investigations of metagenomes. Our results have shown efficient large-scale target enrichment through MetCap-designed probes for a soil metagenome. The web service is suitable for any targeted metagenomics project that aims to study several genes simultaneously. The novel bioinformatics approach taken by the web service will enable researchers in microbial ecology to tap into the vast diversity of microbial communities using targeted metagenomics as a cost-effective alternative to whole metagenome sequencing. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12859-015-0501-8) contains supplementary material, which is available to authorized users. BioMed Central 2015-02-28 /pmc/articles/PMC4355349/ /pubmed/25880302 http://dx.doi.org/10.1186/s12859-015-0501-8 Text en © Kushwaha et al.; licensee BioMed Central. 2015 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly credited. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Software
Kushwaha, Sandeep K
Manoharan, Lokeshwaran
Meerupati, Tejashwari
Hedlund, Katarina
Ahrén, Dag
MetCap: a bioinformatics probe design pipeline for large-scale targeted metagenomics
title MetCap: a bioinformatics probe design pipeline for large-scale targeted metagenomics
title_full MetCap: a bioinformatics probe design pipeline for large-scale targeted metagenomics
title_fullStr MetCap: a bioinformatics probe design pipeline for large-scale targeted metagenomics
title_full_unstemmed MetCap: a bioinformatics probe design pipeline for large-scale targeted metagenomics
title_short MetCap: a bioinformatics probe design pipeline for large-scale targeted metagenomics
title_sort metcap: a bioinformatics probe design pipeline for large-scale targeted metagenomics
topic Software
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4355349/
https://www.ncbi.nlm.nih.gov/pubmed/25880302
http://dx.doi.org/10.1186/s12859-015-0501-8
work_keys_str_mv AT kushwahasandeepk metcapabioinformaticsprobedesignpipelineforlargescaletargetedmetagenomics
AT manoharanlokeshwaran metcapabioinformaticsprobedesignpipelineforlargescaletargetedmetagenomics
AT meerupatitejashwari metcapabioinformaticsprobedesignpipelineforlargescaletargetedmetagenomics
AT hedlundkatarina metcapabioinformaticsprobedesignpipelineforlargescaletargetedmetagenomics
AT ahrendag metcapabioinformaticsprobedesignpipelineforlargescaletargetedmetagenomics