Cargando…

NeSSM: A Next-Generation Sequencing Simulator for Metagenomics

BACKGROUND: Metagenomics can reveal the vast majority of microbes that have been missed by traditional cultivation-based methods. Due to its extremely wide range of application areas, fast metagenome sequencing simulation systems with high fidelity are in great demand to facilitate the development a...

Descripción completa

Detalles Bibliográficos
Autores principales: Jia, Ben, Xuan, Liming, Cai, Kaiye, Hu, Zhiqiang, Ma, Liangxiao, Wei, Chaochun
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2013
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3790878/
https://www.ncbi.nlm.nih.gov/pubmed/24124490
http://dx.doi.org/10.1371/journal.pone.0075448
_version_ 1782286665695363072
author Jia, Ben
Xuan, Liming
Cai, Kaiye
Hu, Zhiqiang
Ma, Liangxiao
Wei, Chaochun
author_facet Jia, Ben
Xuan, Liming
Cai, Kaiye
Hu, Zhiqiang
Ma, Liangxiao
Wei, Chaochun
author_sort Jia, Ben
collection PubMed
description BACKGROUND: Metagenomics can reveal the vast majority of microbes that have been missed by traditional cultivation-based methods. Due to its extremely wide range of application areas, fast metagenome sequencing simulation systems with high fidelity are in great demand to facilitate the development and comparison of metagenomics analysis tools. RESULTS: We present here a customizable metagenome simulation system: NeSSM (Next-generation Sequencing Simulator for Metagenomics). Combining complete genomes currently available, a community composition table, and sequencing parameters, it can simulate metagenome sequencing better than existing systems. Sequencing error models based on the explicit distribution of errors at each base and sequencing coverage bias are incorporated in the simulation. In order to improve the fidelity of simulation, tools are provided by NeSSM to estimate the sequencing error models, sequencing coverage bias and the community composition directly from existing metagenome sequencing data. Currently, NeSSM supports single-end and pair-end sequencing for both 454 and Illumina platforms. In addition, a GPU (graphics processing units) version of NeSSM is also developed to accelerate the simulation. By comparing the simulated sequencing data from NeSSM with experimental metagenome sequencing data, we have demonstrated that NeSSM performs better in many aspects than existing popular metagenome simulators, such as MetaSim, GemSIM and Grinder. The GPU version of NeSSM is more than one-order of magnitude faster than MetaSim. CONCLUSIONS: NeSSM is a fast simulation system for high-throughput metagenome sequencing. It can be helpful to develop tools and evaluate strategies for metagenomics analysis and it’s freely available for academic users at http://cbb.sjtu.edu.cn/~ccwei/pub/software/NeSSM.php.
format Online
Article
Text
id pubmed-3790878
institution National Center for Biotechnology Information
language English
publishDate 2013
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-37908782013-10-11 NeSSM: A Next-Generation Sequencing Simulator for Metagenomics Jia, Ben Xuan, Liming Cai, Kaiye Hu, Zhiqiang Ma, Liangxiao Wei, Chaochun PLoS One Research Article BACKGROUND: Metagenomics can reveal the vast majority of microbes that have been missed by traditional cultivation-based methods. Due to its extremely wide range of application areas, fast metagenome sequencing simulation systems with high fidelity are in great demand to facilitate the development and comparison of metagenomics analysis tools. RESULTS: We present here a customizable metagenome simulation system: NeSSM (Next-generation Sequencing Simulator for Metagenomics). Combining complete genomes currently available, a community composition table, and sequencing parameters, it can simulate metagenome sequencing better than existing systems. Sequencing error models based on the explicit distribution of errors at each base and sequencing coverage bias are incorporated in the simulation. In order to improve the fidelity of simulation, tools are provided by NeSSM to estimate the sequencing error models, sequencing coverage bias and the community composition directly from existing metagenome sequencing data. Currently, NeSSM supports single-end and pair-end sequencing for both 454 and Illumina platforms. In addition, a GPU (graphics processing units) version of NeSSM is also developed to accelerate the simulation. By comparing the simulated sequencing data from NeSSM with experimental metagenome sequencing data, we have demonstrated that NeSSM performs better in many aspects than existing popular metagenome simulators, such as MetaSim, GemSIM and Grinder. The GPU version of NeSSM is more than one-order of magnitude faster than MetaSim. CONCLUSIONS: NeSSM is a fast simulation system for high-throughput metagenome sequencing. It can be helpful to develop tools and evaluate strategies for metagenomics analysis and it’s freely available for academic users at http://cbb.sjtu.edu.cn/~ccwei/pub/software/NeSSM.php. Public Library of Science 2013-10-04 /pmc/articles/PMC3790878/ /pubmed/24124490 http://dx.doi.org/10.1371/journal.pone.0075448 Text en © 2013 Jia et al http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited.
spellingShingle Research Article
Jia, Ben
Xuan, Liming
Cai, Kaiye
Hu, Zhiqiang
Ma, Liangxiao
Wei, Chaochun
NeSSM: A Next-Generation Sequencing Simulator for Metagenomics
title NeSSM: A Next-Generation Sequencing Simulator for Metagenomics
title_full NeSSM: A Next-Generation Sequencing Simulator for Metagenomics
title_fullStr NeSSM: A Next-Generation Sequencing Simulator for Metagenomics
title_full_unstemmed NeSSM: A Next-Generation Sequencing Simulator for Metagenomics
title_short NeSSM: A Next-Generation Sequencing Simulator for Metagenomics
title_sort nessm: a next-generation sequencing simulator for metagenomics
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3790878/
https://www.ncbi.nlm.nih.gov/pubmed/24124490
http://dx.doi.org/10.1371/journal.pone.0075448
work_keys_str_mv AT jiaben nessmanextgenerationsequencingsimulatorformetagenomics
AT xuanliming nessmanextgenerationsequencingsimulatorformetagenomics
AT caikaiye nessmanextgenerationsequencingsimulatorformetagenomics
AT huzhiqiang nessmanextgenerationsequencingsimulatorformetagenomics
AT maliangxiao nessmanextgenerationsequencingsimulatorformetagenomics
AT weichaochun nessmanextgenerationsequencingsimulatorformetagenomics