Cargando…
scrm: efficiently simulating long sequences using the approximated coalescent with recombination
Motivation: Coalescent-based simulation software for genomic sequences allows the efficient in silico generation of short- and medium-sized genetic sequences. However, the simulation of genome-size datasets as produced by next-generation sequencing is currently only possible using fairly crude appro...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Oxford University Press
2015
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4426833/ https://www.ncbi.nlm.nih.gov/pubmed/25596205 http://dx.doi.org/10.1093/bioinformatics/btu861 |
_version_ | 1782370642679562240 |
---|---|
author | Staab, Paul R. Zhu, Sha Metzler, Dirk Lunter, Gerton |
author_facet | Staab, Paul R. Zhu, Sha Metzler, Dirk Lunter, Gerton |
author_sort | Staab, Paul R. |
collection | PubMed |
description | Motivation: Coalescent-based simulation software for genomic sequences allows the efficient in silico generation of short- and medium-sized genetic sequences. However, the simulation of genome-size datasets as produced by next-generation sequencing is currently only possible using fairly crude approximations. Results: We present the sequential coalescent with recombination model (SCRM), a new method that efficiently and accurately approximates the coalescent with recombination, closing the gap between current approximations and the exact model. We present an efficient implementation and show that it can simulate genomic-scale datasets with an essentially correct linkage structure. Availability and implementation: The open source implementation scrm is freely available at https://scrm.github.io under the conditions of the GPLv3 license. Contact: staab@bio.lmu.de or gerton.lunter@well.ox.ac.uk. Supplementary information: Supplementary data are available at Bioinformatics online. |
format | Online Article Text |
id | pubmed-4426833 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2015 |
publisher | Oxford University Press |
record_format | MEDLINE/PubMed |
spelling | pubmed-44268332015-05-15 scrm: efficiently simulating long sequences using the approximated coalescent with recombination Staab, Paul R. Zhu, Sha Metzler, Dirk Lunter, Gerton Bioinformatics Applications Notes Motivation: Coalescent-based simulation software for genomic sequences allows the efficient in silico generation of short- and medium-sized genetic sequences. However, the simulation of genome-size datasets as produced by next-generation sequencing is currently only possible using fairly crude approximations. Results: We present the sequential coalescent with recombination model (SCRM), a new method that efficiently and accurately approximates the coalescent with recombination, closing the gap between current approximations and the exact model. We present an efficient implementation and show that it can simulate genomic-scale datasets with an essentially correct linkage structure. Availability and implementation: The open source implementation scrm is freely available at https://scrm.github.io under the conditions of the GPLv3 license. Contact: staab@bio.lmu.de or gerton.lunter@well.ox.ac.uk. Supplementary information: Supplementary data are available at Bioinformatics online. Oxford University Press 2015-05-15 2015-01-08 /pmc/articles/PMC4426833/ /pubmed/25596205 http://dx.doi.org/10.1093/bioinformatics/btu861 Text en © The Author 2015. Published by Oxford University Press. http://creativecommons.org/licenses/by/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Applications Notes Staab, Paul R. Zhu, Sha Metzler, Dirk Lunter, Gerton scrm: efficiently simulating long sequences using the approximated coalescent with recombination |
title | scrm: efficiently simulating long sequences using the approximated coalescent with recombination |
title_full | scrm: efficiently simulating long sequences using the approximated coalescent with recombination |
title_fullStr | scrm: efficiently simulating long sequences using the approximated coalescent with recombination |
title_full_unstemmed | scrm: efficiently simulating long sequences using the approximated coalescent with recombination |
title_short | scrm: efficiently simulating long sequences using the approximated coalescent with recombination |
title_sort | scrm: efficiently simulating long sequences using the approximated coalescent with recombination |
topic | Applications Notes |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4426833/ https://www.ncbi.nlm.nih.gov/pubmed/25596205 http://dx.doi.org/10.1093/bioinformatics/btu861 |
work_keys_str_mv | AT staabpaulr scrmefficientlysimulatinglongsequencesusingtheapproximatedcoalescentwithrecombination AT zhusha scrmefficientlysimulatinglongsequencesusingtheapproximatedcoalescentwithrecombination AT metzlerdirk scrmefficientlysimulatinglongsequencesusingtheapproximatedcoalescentwithrecombination AT luntergerton scrmefficientlysimulatinglongsequencesusingtheapproximatedcoalescentwithrecombination |