Cargando…
Simulating Illumina metagenomic data with InSilicoSeq
MOTIVATION: The accurate in silico simulation of metagenomic datasets is of great importance for benchmarking bioinformatics tools as well as for experimental design. Users are dependant on large-scale simulation to not only design experiments and new projects but also for accurate estimation of com...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Oxford University Press
2019
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6361232/ https://www.ncbi.nlm.nih.gov/pubmed/30016412 http://dx.doi.org/10.1093/bioinformatics/bty630 |
_version_ | 1783392653633650688 |
---|---|
author | Gourlé, Hadrien Karlsson-Lindsjö, Oskar Hayer, Juliette Bongcam-Rudloff, Erik |
author_facet | Gourlé, Hadrien Karlsson-Lindsjö, Oskar Hayer, Juliette Bongcam-Rudloff, Erik |
author_sort | Gourlé, Hadrien |
collection | PubMed |
description | MOTIVATION: The accurate in silico simulation of metagenomic datasets is of great importance for benchmarking bioinformatics tools as well as for experimental design. Users are dependant on large-scale simulation to not only design experiments and new projects but also for accurate estimation of computational needs within a project. Unfortunately, most current read simulators are either not suited for metagenomics, out of date or relatively poorly documented. In this article, we describe InSilicoSeq, a software package to simulate metagenomic Illumina sequencing data. InsilicoSeq has a simple command-line interface and extensive documentation. RESULTS: InSilicoSeq is implemented in Python and capable of simulating realistic Illumina (meta) genomic data in a parallel fashion with sensible default parameters. AVAILABILITY AND IMPLEMENTATION: Source code and documentation are available under the MIT license at https://github.com/HadrienG/InSilicoSeq and https://insilicoseq.readthedocs.io/. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online. |
format | Online Article Text |
id | pubmed-6361232 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2019 |
publisher | Oxford University Press |
record_format | MEDLINE/PubMed |
spelling | pubmed-63612322019-02-08 Simulating Illumina metagenomic data with InSilicoSeq Gourlé, Hadrien Karlsson-Lindsjö, Oskar Hayer, Juliette Bongcam-Rudloff, Erik Bioinformatics Applications Notes MOTIVATION: The accurate in silico simulation of metagenomic datasets is of great importance for benchmarking bioinformatics tools as well as for experimental design. Users are dependant on large-scale simulation to not only design experiments and new projects but also for accurate estimation of computational needs within a project. Unfortunately, most current read simulators are either not suited for metagenomics, out of date or relatively poorly documented. In this article, we describe InSilicoSeq, a software package to simulate metagenomic Illumina sequencing data. InsilicoSeq has a simple command-line interface and extensive documentation. RESULTS: InSilicoSeq is implemented in Python and capable of simulating realistic Illumina (meta) genomic data in a parallel fashion with sensible default parameters. AVAILABILITY AND IMPLEMENTATION: Source code and documentation are available under the MIT license at https://github.com/HadrienG/InSilicoSeq and https://insilicoseq.readthedocs.io/. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online. Oxford University Press 2019-02-01 2018-07-19 /pmc/articles/PMC6361232/ /pubmed/30016412 http://dx.doi.org/10.1093/bioinformatics/bty630 Text en © The Author(s) 2018. Published by Oxford University Press. http://creativecommons.org/licenses/by/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Applications Notes Gourlé, Hadrien Karlsson-Lindsjö, Oskar Hayer, Juliette Bongcam-Rudloff, Erik Simulating Illumina metagenomic data with InSilicoSeq |
title | Simulating Illumina metagenomic data with InSilicoSeq |
title_full | Simulating Illumina metagenomic data with InSilicoSeq |
title_fullStr | Simulating Illumina metagenomic data with InSilicoSeq |
title_full_unstemmed | Simulating Illumina metagenomic data with InSilicoSeq |
title_short | Simulating Illumina metagenomic data with InSilicoSeq |
title_sort | simulating illumina metagenomic data with insilicoseq |
topic | Applications Notes |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6361232/ https://www.ncbi.nlm.nih.gov/pubmed/30016412 http://dx.doi.org/10.1093/bioinformatics/bty630 |
work_keys_str_mv | AT gourlehadrien simulatingilluminametagenomicdatawithinsilicoseq AT karlssonlindsjooskar simulatingilluminametagenomicdatawithinsilicoseq AT hayerjuliette simulatingilluminametagenomicdatawithinsilicoseq AT bongcamrudlofferik simulatingilluminametagenomicdatawithinsilicoseq |