Cargando…
MOCAT: A Metagenomics Assembly and Gene Prediction Toolkit
MOCAT is a highly configurable, modular pipeline for fast, standardized processing of single or paired-end sequencing data generated by the Illumina platform. The pipeline uses state-of-the-art programs to quality control, map, and assemble reads from metagenomic samples sequenced at a depth of seve...
Autores principales: | , , , , , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Public Library of Science
2012
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3474746/ https://www.ncbi.nlm.nih.gov/pubmed/23082188 http://dx.doi.org/10.1371/journal.pone.0047656 |
_version_ | 1782246827269029888 |
---|---|
author | Kultima, Jens Roat Sunagawa, Shinichi Li, Junhua Chen, Weineng Chen, Hua Mende, Daniel R. Arumugam, Manimozhiyan Pan, Qi Liu, Binghang Qin, Junjie Wang, Jun Bork, Peer |
author_facet | Kultima, Jens Roat Sunagawa, Shinichi Li, Junhua Chen, Weineng Chen, Hua Mende, Daniel R. Arumugam, Manimozhiyan Pan, Qi Liu, Binghang Qin, Junjie Wang, Jun Bork, Peer |
author_sort | Kultima, Jens Roat |
collection | PubMed |
description | MOCAT is a highly configurable, modular pipeline for fast, standardized processing of single or paired-end sequencing data generated by the Illumina platform. The pipeline uses state-of-the-art programs to quality control, map, and assemble reads from metagenomic samples sequenced at a depth of several billion base pairs, and predict protein-coding genes on assembled metagenomes. Mapping against reference databases allows for read extraction or removal, as well as abundance calculations. Relevant statistics for each processing step can be summarized into multi-sheet Excel documents and queryable SQL databases. MOCAT runs on UNIX machines and integrates seamlessly with the SGE and PBS queuing systems, commonly used to process large datasets. The open source code and modular architecture allow users to modify or exchange the programs that are utilized in the various processing steps. Individual processing steps and parameters were benchmarked and tested on artificial, real, and simulated metagenomes resulting in an improvement of selected quality metrics. MOCAT can be freely downloaded at http://www.bork.embl.de/mocat/. |
format | Online Article Text |
id | pubmed-3474746 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2012 |
publisher | Public Library of Science |
record_format | MEDLINE/PubMed |
spelling | pubmed-34747462012-10-18 MOCAT: A Metagenomics Assembly and Gene Prediction Toolkit Kultima, Jens Roat Sunagawa, Shinichi Li, Junhua Chen, Weineng Chen, Hua Mende, Daniel R. Arumugam, Manimozhiyan Pan, Qi Liu, Binghang Qin, Junjie Wang, Jun Bork, Peer PLoS One Research Article MOCAT is a highly configurable, modular pipeline for fast, standardized processing of single or paired-end sequencing data generated by the Illumina platform. The pipeline uses state-of-the-art programs to quality control, map, and assemble reads from metagenomic samples sequenced at a depth of several billion base pairs, and predict protein-coding genes on assembled metagenomes. Mapping against reference databases allows for read extraction or removal, as well as abundance calculations. Relevant statistics for each processing step can be summarized into multi-sheet Excel documents and queryable SQL databases. MOCAT runs on UNIX machines and integrates seamlessly with the SGE and PBS queuing systems, commonly used to process large datasets. The open source code and modular architecture allow users to modify or exchange the programs that are utilized in the various processing steps. Individual processing steps and parameters were benchmarked and tested on artificial, real, and simulated metagenomes resulting in an improvement of selected quality metrics. MOCAT can be freely downloaded at http://www.bork.embl.de/mocat/. Public Library of Science 2012-10-17 /pmc/articles/PMC3474746/ /pubmed/23082188 http://dx.doi.org/10.1371/journal.pone.0047656 Text en © 2012 Kultima et al http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited. |
spellingShingle | Research Article Kultima, Jens Roat Sunagawa, Shinichi Li, Junhua Chen, Weineng Chen, Hua Mende, Daniel R. Arumugam, Manimozhiyan Pan, Qi Liu, Binghang Qin, Junjie Wang, Jun Bork, Peer MOCAT: A Metagenomics Assembly and Gene Prediction Toolkit |
title | MOCAT: A Metagenomics Assembly and Gene Prediction Toolkit |
title_full | MOCAT: A Metagenomics Assembly and Gene Prediction Toolkit |
title_fullStr | MOCAT: A Metagenomics Assembly and Gene Prediction Toolkit |
title_full_unstemmed | MOCAT: A Metagenomics Assembly and Gene Prediction Toolkit |
title_short | MOCAT: A Metagenomics Assembly and Gene Prediction Toolkit |
title_sort | mocat: a metagenomics assembly and gene prediction toolkit |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3474746/ https://www.ncbi.nlm.nih.gov/pubmed/23082188 http://dx.doi.org/10.1371/journal.pone.0047656 |
work_keys_str_mv | AT kultimajensroat mocatametagenomicsassemblyandgenepredictiontoolkit AT sunagawashinichi mocatametagenomicsassemblyandgenepredictiontoolkit AT lijunhua mocatametagenomicsassemblyandgenepredictiontoolkit AT chenweineng mocatametagenomicsassemblyandgenepredictiontoolkit AT chenhua mocatametagenomicsassemblyandgenepredictiontoolkit AT mendedanielr mocatametagenomicsassemblyandgenepredictiontoolkit AT arumugammanimozhiyan mocatametagenomicsassemblyandgenepredictiontoolkit AT panqi mocatametagenomicsassemblyandgenepredictiontoolkit AT liubinghang mocatametagenomicsassemblyandgenepredictiontoolkit AT qinjunjie mocatametagenomicsassemblyandgenepredictiontoolkit AT wangjun mocatametagenomicsassemblyandgenepredictiontoolkit AT borkpeer mocatametagenomicsassemblyandgenepredictiontoolkit |