Cargando…
NeSSM: A Next-Generation Sequencing Simulator for Metagenomics
BACKGROUND: Metagenomics can reveal the vast majority of microbes that have been missed by traditional cultivation-based methods. Due to its extremely wide range of application areas, fast metagenome sequencing simulation systems with high fidelity are in great demand to facilitate the development a...
Autores principales: | , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Public Library of Science
2013
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3790878/ https://www.ncbi.nlm.nih.gov/pubmed/24124490 http://dx.doi.org/10.1371/journal.pone.0075448 |
_version_ | 1782286665695363072 |
---|---|
author | Jia, Ben Xuan, Liming Cai, Kaiye Hu, Zhiqiang Ma, Liangxiao Wei, Chaochun |
author_facet | Jia, Ben Xuan, Liming Cai, Kaiye Hu, Zhiqiang Ma, Liangxiao Wei, Chaochun |
author_sort | Jia, Ben |
collection | PubMed |
description | BACKGROUND: Metagenomics can reveal the vast majority of microbes that have been missed by traditional cultivation-based methods. Due to its extremely wide range of application areas, fast metagenome sequencing simulation systems with high fidelity are in great demand to facilitate the development and comparison of metagenomics analysis tools. RESULTS: We present here a customizable metagenome simulation system: NeSSM (Next-generation Sequencing Simulator for Metagenomics). Combining complete genomes currently available, a community composition table, and sequencing parameters, it can simulate metagenome sequencing better than existing systems. Sequencing error models based on the explicit distribution of errors at each base and sequencing coverage bias are incorporated in the simulation. In order to improve the fidelity of simulation, tools are provided by NeSSM to estimate the sequencing error models, sequencing coverage bias and the community composition directly from existing metagenome sequencing data. Currently, NeSSM supports single-end and pair-end sequencing for both 454 and Illumina platforms. In addition, a GPU (graphics processing units) version of NeSSM is also developed to accelerate the simulation. By comparing the simulated sequencing data from NeSSM with experimental metagenome sequencing data, we have demonstrated that NeSSM performs better in many aspects than existing popular metagenome simulators, such as MetaSim, GemSIM and Grinder. The GPU version of NeSSM is more than one-order of magnitude faster than MetaSim. CONCLUSIONS: NeSSM is a fast simulation system for high-throughput metagenome sequencing. It can be helpful to develop tools and evaluate strategies for metagenomics analysis and it’s freely available for academic users at http://cbb.sjtu.edu.cn/~ccwei/pub/software/NeSSM.php. |
format | Online Article Text |
id | pubmed-3790878 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2013 |
publisher | Public Library of Science |
record_format | MEDLINE/PubMed |
spelling | pubmed-37908782013-10-11 NeSSM: A Next-Generation Sequencing Simulator for Metagenomics Jia, Ben Xuan, Liming Cai, Kaiye Hu, Zhiqiang Ma, Liangxiao Wei, Chaochun PLoS One Research Article BACKGROUND: Metagenomics can reveal the vast majority of microbes that have been missed by traditional cultivation-based methods. Due to its extremely wide range of application areas, fast metagenome sequencing simulation systems with high fidelity are in great demand to facilitate the development and comparison of metagenomics analysis tools. RESULTS: We present here a customizable metagenome simulation system: NeSSM (Next-generation Sequencing Simulator for Metagenomics). Combining complete genomes currently available, a community composition table, and sequencing parameters, it can simulate metagenome sequencing better than existing systems. Sequencing error models based on the explicit distribution of errors at each base and sequencing coverage bias are incorporated in the simulation. In order to improve the fidelity of simulation, tools are provided by NeSSM to estimate the sequencing error models, sequencing coverage bias and the community composition directly from existing metagenome sequencing data. Currently, NeSSM supports single-end and pair-end sequencing for both 454 and Illumina platforms. In addition, a GPU (graphics processing units) version of NeSSM is also developed to accelerate the simulation. By comparing the simulated sequencing data from NeSSM with experimental metagenome sequencing data, we have demonstrated that NeSSM performs better in many aspects than existing popular metagenome simulators, such as MetaSim, GemSIM and Grinder. The GPU version of NeSSM is more than one-order of magnitude faster than MetaSim. CONCLUSIONS: NeSSM is a fast simulation system for high-throughput metagenome sequencing. It can be helpful to develop tools and evaluate strategies for metagenomics analysis and it’s freely available for academic users at http://cbb.sjtu.edu.cn/~ccwei/pub/software/NeSSM.php. Public Library of Science 2013-10-04 /pmc/articles/PMC3790878/ /pubmed/24124490 http://dx.doi.org/10.1371/journal.pone.0075448 Text en © 2013 Jia et al http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited. |
spellingShingle | Research Article Jia, Ben Xuan, Liming Cai, Kaiye Hu, Zhiqiang Ma, Liangxiao Wei, Chaochun NeSSM: A Next-Generation Sequencing Simulator for Metagenomics |
title | NeSSM: A Next-Generation Sequencing Simulator for Metagenomics |
title_full | NeSSM: A Next-Generation Sequencing Simulator for Metagenomics |
title_fullStr | NeSSM: A Next-Generation Sequencing Simulator for Metagenomics |
title_full_unstemmed | NeSSM: A Next-Generation Sequencing Simulator for Metagenomics |
title_short | NeSSM: A Next-Generation Sequencing Simulator for Metagenomics |
title_sort | nessm: a next-generation sequencing simulator for metagenomics |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3790878/ https://www.ncbi.nlm.nih.gov/pubmed/24124490 http://dx.doi.org/10.1371/journal.pone.0075448 |
work_keys_str_mv | AT jiaben nessmanextgenerationsequencingsimulatorformetagenomics AT xuanliming nessmanextgenerationsequencingsimulatorformetagenomics AT caikaiye nessmanextgenerationsequencingsimulatorformetagenomics AT huzhiqiang nessmanextgenerationsequencingsimulatorformetagenomics AT maliangxiao nessmanextgenerationsequencingsimulatorformetagenomics AT weichaochun nessmanextgenerationsequencingsimulatorformetagenomics |