Cargando…

scrm: efficiently simulating long sequences using the approximated coalescent with recombination

Motivation: Coalescent-based simulation software for genomic sequences allows the efficient in silico generation of short- and medium-sized genetic sequences. However, the simulation of genome-size datasets as produced by next-generation sequencing is currently only possible using fairly crude appro...

Descripción completa

Detalles Bibliográficos
Autores principales: Staab, Paul R., Zhu, Sha, Metzler, Dirk, Lunter, Gerton
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2015
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4426833/
https://www.ncbi.nlm.nih.gov/pubmed/25596205
http://dx.doi.org/10.1093/bioinformatics/btu861
_version_ 1782370642679562240
author Staab, Paul R.
Zhu, Sha
Metzler, Dirk
Lunter, Gerton
author_facet Staab, Paul R.
Zhu, Sha
Metzler, Dirk
Lunter, Gerton
author_sort Staab, Paul R.
collection PubMed
description Motivation: Coalescent-based simulation software for genomic sequences allows the efficient in silico generation of short- and medium-sized genetic sequences. However, the simulation of genome-size datasets as produced by next-generation sequencing is currently only possible using fairly crude approximations. Results: We present the sequential coalescent with recombination model (SCRM), a new method that efficiently and accurately approximates the coalescent with recombination, closing the gap between current approximations and the exact model. We present an efficient implementation and show that it can simulate genomic-scale datasets with an essentially correct linkage structure. Availability and implementation: The open source implementation scrm is freely available at https://scrm.github.io under the conditions of the GPLv3 license. Contact: staab@bio.lmu.de or gerton.lunter@well.ox.ac.uk. Supplementary information: Supplementary data are available at Bioinformatics online.
format Online
Article
Text
id pubmed-4426833
institution National Center for Biotechnology Information
language English
publishDate 2015
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-44268332015-05-15 scrm: efficiently simulating long sequences using the approximated coalescent with recombination Staab, Paul R. Zhu, Sha Metzler, Dirk Lunter, Gerton Bioinformatics Applications Notes Motivation: Coalescent-based simulation software for genomic sequences allows the efficient in silico generation of short- and medium-sized genetic sequences. However, the simulation of genome-size datasets as produced by next-generation sequencing is currently only possible using fairly crude approximations. Results: We present the sequential coalescent with recombination model (SCRM), a new method that efficiently and accurately approximates the coalescent with recombination, closing the gap between current approximations and the exact model. We present an efficient implementation and show that it can simulate genomic-scale datasets with an essentially correct linkage structure. Availability and implementation: The open source implementation scrm is freely available at https://scrm.github.io under the conditions of the GPLv3 license. Contact: staab@bio.lmu.de or gerton.lunter@well.ox.ac.uk. Supplementary information: Supplementary data are available at Bioinformatics online. Oxford University Press 2015-05-15 2015-01-08 /pmc/articles/PMC4426833/ /pubmed/25596205 http://dx.doi.org/10.1093/bioinformatics/btu861 Text en © The Author 2015. Published by Oxford University Press. http://creativecommons.org/licenses/by/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Applications Notes
Staab, Paul R.
Zhu, Sha
Metzler, Dirk
Lunter, Gerton
scrm: efficiently simulating long sequences using the approximated coalescent with recombination
title scrm: efficiently simulating long sequences using the approximated coalescent with recombination
title_full scrm: efficiently simulating long sequences using the approximated coalescent with recombination
title_fullStr scrm: efficiently simulating long sequences using the approximated coalescent with recombination
title_full_unstemmed scrm: efficiently simulating long sequences using the approximated coalescent with recombination
title_short scrm: efficiently simulating long sequences using the approximated coalescent with recombination
title_sort scrm: efficiently simulating long sequences using the approximated coalescent with recombination
topic Applications Notes
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4426833/
https://www.ncbi.nlm.nih.gov/pubmed/25596205
http://dx.doi.org/10.1093/bioinformatics/btu861
work_keys_str_mv AT staabpaulr scrmefficientlysimulatinglongsequencesusingtheapproximatedcoalescentwithrecombination
AT zhusha scrmefficientlysimulatinglongsequencesusingtheapproximatedcoalescentwithrecombination
AT metzlerdirk scrmefficientlysimulatinglongsequencesusingtheapproximatedcoalescentwithrecombination
AT luntergerton scrmefficientlysimulatinglongsequencesusingtheapproximatedcoalescentwithrecombination