Cargando…

Simulating Illumina metagenomic data with InSilicoSeq

MOTIVATION: The accurate in silico simulation of metagenomic datasets is of great importance for benchmarking bioinformatics tools as well as for experimental design. Users are dependant on large-scale simulation to not only design experiments and new projects but also for accurate estimation of com...

Descripción completa

Detalles Bibliográficos
Autores principales: Gourlé, Hadrien, Karlsson-Lindsjö, Oskar, Hayer, Juliette, Bongcam-Rudloff, Erik
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6361232/
https://www.ncbi.nlm.nih.gov/pubmed/30016412
http://dx.doi.org/10.1093/bioinformatics/bty630
_version_ 1783392653633650688
author Gourlé, Hadrien
Karlsson-Lindsjö, Oskar
Hayer, Juliette
Bongcam-Rudloff, Erik
author_facet Gourlé, Hadrien
Karlsson-Lindsjö, Oskar
Hayer, Juliette
Bongcam-Rudloff, Erik
author_sort Gourlé, Hadrien
collection PubMed
description MOTIVATION: The accurate in silico simulation of metagenomic datasets is of great importance for benchmarking bioinformatics tools as well as for experimental design. Users are dependant on large-scale simulation to not only design experiments and new projects but also for accurate estimation of computational needs within a project. Unfortunately, most current read simulators are either not suited for metagenomics, out of date or relatively poorly documented. In this article, we describe InSilicoSeq, a software package to simulate metagenomic Illumina sequencing data. InsilicoSeq has a simple command-line interface and extensive documentation. RESULTS: InSilicoSeq is implemented in Python and capable of simulating realistic Illumina (meta) genomic data in a parallel fashion with sensible default parameters. AVAILABILITY AND IMPLEMENTATION: Source code and documentation are available under the MIT license at https://github.com/HadrienG/InSilicoSeq and https://insilicoseq.readthedocs.io/. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
format Online
Article
Text
id pubmed-6361232
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-63612322019-02-08 Simulating Illumina metagenomic data with InSilicoSeq Gourlé, Hadrien Karlsson-Lindsjö, Oskar Hayer, Juliette Bongcam-Rudloff, Erik Bioinformatics Applications Notes MOTIVATION: The accurate in silico simulation of metagenomic datasets is of great importance for benchmarking bioinformatics tools as well as for experimental design. Users are dependant on large-scale simulation to not only design experiments and new projects but also for accurate estimation of computational needs within a project. Unfortunately, most current read simulators are either not suited for metagenomics, out of date or relatively poorly documented. In this article, we describe InSilicoSeq, a software package to simulate metagenomic Illumina sequencing data. InsilicoSeq has a simple command-line interface and extensive documentation. RESULTS: InSilicoSeq is implemented in Python and capable of simulating realistic Illumina (meta) genomic data in a parallel fashion with sensible default parameters. AVAILABILITY AND IMPLEMENTATION: Source code and documentation are available under the MIT license at https://github.com/HadrienG/InSilicoSeq and https://insilicoseq.readthedocs.io/. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online. Oxford University Press 2019-02-01 2018-07-19 /pmc/articles/PMC6361232/ /pubmed/30016412 http://dx.doi.org/10.1093/bioinformatics/bty630 Text en © The Author(s) 2018. Published by Oxford University Press. http://creativecommons.org/licenses/by/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Applications Notes
Gourlé, Hadrien
Karlsson-Lindsjö, Oskar
Hayer, Juliette
Bongcam-Rudloff, Erik
Simulating Illumina metagenomic data with InSilicoSeq
title Simulating Illumina metagenomic data with InSilicoSeq
title_full Simulating Illumina metagenomic data with InSilicoSeq
title_fullStr Simulating Illumina metagenomic data with InSilicoSeq
title_full_unstemmed Simulating Illumina metagenomic data with InSilicoSeq
title_short Simulating Illumina metagenomic data with InSilicoSeq
title_sort simulating illumina metagenomic data with insilicoseq
topic Applications Notes
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6361232/
https://www.ncbi.nlm.nih.gov/pubmed/30016412
http://dx.doi.org/10.1093/bioinformatics/bty630
work_keys_str_mv AT gourlehadrien simulatingilluminametagenomicdatawithinsilicoseq
AT karlssonlindsjooskar simulatingilluminametagenomicdatawithinsilicoseq
AT hayerjuliette simulatingilluminametagenomicdatawithinsilicoseq
AT bongcamrudlofferik simulatingilluminametagenomicdatawithinsilicoseq