Cargando…

MOCAT: A Metagenomics Assembly and Gene Prediction Toolkit

MOCAT is a highly configurable, modular pipeline for fast, standardized processing of single or paired-end sequencing data generated by the Illumina platform. The pipeline uses state-of-the-art programs to quality control, map, and assemble reads from metagenomic samples sequenced at a depth of seve...

Descripción completa

Detalles Bibliográficos
Autores principales: Kultima, Jens Roat, Sunagawa, Shinichi, Li, Junhua, Chen, Weineng, Chen, Hua, Mende, Daniel R., Arumugam, Manimozhiyan, Pan, Qi, Liu, Binghang, Qin, Junjie, Wang, Jun, Bork, Peer
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2012
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3474746/
https://www.ncbi.nlm.nih.gov/pubmed/23082188
http://dx.doi.org/10.1371/journal.pone.0047656
_version_ 1782246827269029888
author Kultima, Jens Roat
Sunagawa, Shinichi
Li, Junhua
Chen, Weineng
Chen, Hua
Mende, Daniel R.
Arumugam, Manimozhiyan
Pan, Qi
Liu, Binghang
Qin, Junjie
Wang, Jun
Bork, Peer
author_facet Kultima, Jens Roat
Sunagawa, Shinichi
Li, Junhua
Chen, Weineng
Chen, Hua
Mende, Daniel R.
Arumugam, Manimozhiyan
Pan, Qi
Liu, Binghang
Qin, Junjie
Wang, Jun
Bork, Peer
author_sort Kultima, Jens Roat
collection PubMed
description MOCAT is a highly configurable, modular pipeline for fast, standardized processing of single or paired-end sequencing data generated by the Illumina platform. The pipeline uses state-of-the-art programs to quality control, map, and assemble reads from metagenomic samples sequenced at a depth of several billion base pairs, and predict protein-coding genes on assembled metagenomes. Mapping against reference databases allows for read extraction or removal, as well as abundance calculations. Relevant statistics for each processing step can be summarized into multi-sheet Excel documents and queryable SQL databases. MOCAT runs on UNIX machines and integrates seamlessly with the SGE and PBS queuing systems, commonly used to process large datasets. The open source code and modular architecture allow users to modify or exchange the programs that are utilized in the various processing steps. Individual processing steps and parameters were benchmarked and tested on artificial, real, and simulated metagenomes resulting in an improvement of selected quality metrics. MOCAT can be freely downloaded at http://www.bork.embl.de/mocat/.
format Online
Article
Text
id pubmed-3474746
institution National Center for Biotechnology Information
language English
publishDate 2012
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-34747462012-10-18 MOCAT: A Metagenomics Assembly and Gene Prediction Toolkit Kultima, Jens Roat Sunagawa, Shinichi Li, Junhua Chen, Weineng Chen, Hua Mende, Daniel R. Arumugam, Manimozhiyan Pan, Qi Liu, Binghang Qin, Junjie Wang, Jun Bork, Peer PLoS One Research Article MOCAT is a highly configurable, modular pipeline for fast, standardized processing of single or paired-end sequencing data generated by the Illumina platform. The pipeline uses state-of-the-art programs to quality control, map, and assemble reads from metagenomic samples sequenced at a depth of several billion base pairs, and predict protein-coding genes on assembled metagenomes. Mapping against reference databases allows for read extraction or removal, as well as abundance calculations. Relevant statistics for each processing step can be summarized into multi-sheet Excel documents and queryable SQL databases. MOCAT runs on UNIX machines and integrates seamlessly with the SGE and PBS queuing systems, commonly used to process large datasets. The open source code and modular architecture allow users to modify or exchange the programs that are utilized in the various processing steps. Individual processing steps and parameters were benchmarked and tested on artificial, real, and simulated metagenomes resulting in an improvement of selected quality metrics. MOCAT can be freely downloaded at http://www.bork.embl.de/mocat/. Public Library of Science 2012-10-17 /pmc/articles/PMC3474746/ /pubmed/23082188 http://dx.doi.org/10.1371/journal.pone.0047656 Text en © 2012 Kultima et al http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited.
spellingShingle Research Article
Kultima, Jens Roat
Sunagawa, Shinichi
Li, Junhua
Chen, Weineng
Chen, Hua
Mende, Daniel R.
Arumugam, Manimozhiyan
Pan, Qi
Liu, Binghang
Qin, Junjie
Wang, Jun
Bork, Peer
MOCAT: A Metagenomics Assembly and Gene Prediction Toolkit
title MOCAT: A Metagenomics Assembly and Gene Prediction Toolkit
title_full MOCAT: A Metagenomics Assembly and Gene Prediction Toolkit
title_fullStr MOCAT: A Metagenomics Assembly and Gene Prediction Toolkit
title_full_unstemmed MOCAT: A Metagenomics Assembly and Gene Prediction Toolkit
title_short MOCAT: A Metagenomics Assembly and Gene Prediction Toolkit
title_sort mocat: a metagenomics assembly and gene prediction toolkit
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3474746/
https://www.ncbi.nlm.nih.gov/pubmed/23082188
http://dx.doi.org/10.1371/journal.pone.0047656
work_keys_str_mv AT kultimajensroat mocatametagenomicsassemblyandgenepredictiontoolkit
AT sunagawashinichi mocatametagenomicsassemblyandgenepredictiontoolkit
AT lijunhua mocatametagenomicsassemblyandgenepredictiontoolkit
AT chenweineng mocatametagenomicsassemblyandgenepredictiontoolkit
AT chenhua mocatametagenomicsassemblyandgenepredictiontoolkit
AT mendedanielr mocatametagenomicsassemblyandgenepredictiontoolkit
AT arumugammanimozhiyan mocatametagenomicsassemblyandgenepredictiontoolkit
AT panqi mocatametagenomicsassemblyandgenepredictiontoolkit
AT liubinghang mocatametagenomicsassemblyandgenepredictiontoolkit
AT qinjunjie mocatametagenomicsassemblyandgenepredictiontoolkit
AT wangjun mocatametagenomicsassemblyandgenepredictiontoolkit
AT borkpeer mocatametagenomicsassemblyandgenepredictiontoolkit