Cargando…

Probabilistic model based on circular statistics for quantifying coverage depth dynamics originating from DNA replication

BACKGROUND: With the development of DNA sequencing technology, static omics profiling in microbial communities, such as taxonomic and functional gene composition determination, has become possible. Additionally, the recently proposed in situ growth rate estimation method allows the applicable range...

Descripción completa

Detalles Bibliográficos
Autores principales: Suzuki, Shinya, Yamada, Takuji
Formato: Online Artículo Texto
Lenguaje:English
Publicado: PeerJ Inc. 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7104724/
https://www.ncbi.nlm.nih.gov/pubmed/32257635
http://dx.doi.org/10.7717/peerj.8722
_version_ 1783512292598480896
author Suzuki, Shinya
Yamada, Takuji
author_facet Suzuki, Shinya
Yamada, Takuji
author_sort Suzuki, Shinya
collection PubMed
description BACKGROUND: With the development of DNA sequencing technology, static omics profiling in microbial communities, such as taxonomic and functional gene composition determination, has become possible. Additionally, the recently proposed in situ growth rate estimation method allows the applicable range of current comparative metagenomics to be extended to dynamic profiling. However, with this method, the applicable target range is presently limited. Furthermore, the characteristics of coverage depth during replication have not been sufficiently investigated. RESULTS: We developed a probabilistic model that mimics coverage depth dynamics. This statistical model explains the bias that occurs in the coverage depth due to DNA replication and errors that arise from coverage depth observation. Although our method requires a complete genome sequence, it involves a stable to low coverage depth (>0.01×). We also evaluated the estimation using real whole-genome sequence datasets and reproduced the growth dynamics observed in previous studies. By utilizing a circular distribution in the model, our method facilitates the quantification of unmeasured coverage depth features, including peakedness, skewness, and degree of density, around the replication origin. When we applied the model to time-series culture samples, the skewness parameter, which indicates the asymmetry, was stable over time; however, the peakedness and degree of density parameters, which indicate the concentration level at the replication origin, changed dynamically. Furthermore, we demonstrated the activity measurement of multiple replication origins in a single chromosome. CONCLUSIONS: We devised a novel framework for quantifying coverage depth dynamics. Our study is expected to serve as a basis for replication activity estimation from a broader perspective using the statistical model.
format Online
Article
Text
id pubmed-7104724
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher PeerJ Inc.
record_format MEDLINE/PubMed
spelling pubmed-71047242020-04-02 Probabilistic model based on circular statistics for quantifying coverage depth dynamics originating from DNA replication Suzuki, Shinya Yamada, Takuji PeerJ Bioinformatics BACKGROUND: With the development of DNA sequencing technology, static omics profiling in microbial communities, such as taxonomic and functional gene composition determination, has become possible. Additionally, the recently proposed in situ growth rate estimation method allows the applicable range of current comparative metagenomics to be extended to dynamic profiling. However, with this method, the applicable target range is presently limited. Furthermore, the characteristics of coverage depth during replication have not been sufficiently investigated. RESULTS: We developed a probabilistic model that mimics coverage depth dynamics. This statistical model explains the bias that occurs in the coverage depth due to DNA replication and errors that arise from coverage depth observation. Although our method requires a complete genome sequence, it involves a stable to low coverage depth (>0.01×). We also evaluated the estimation using real whole-genome sequence datasets and reproduced the growth dynamics observed in previous studies. By utilizing a circular distribution in the model, our method facilitates the quantification of unmeasured coverage depth features, including peakedness, skewness, and degree of density, around the replication origin. When we applied the model to time-series culture samples, the skewness parameter, which indicates the asymmetry, was stable over time; however, the peakedness and degree of density parameters, which indicate the concentration level at the replication origin, changed dynamically. Furthermore, we demonstrated the activity measurement of multiple replication origins in a single chromosome. CONCLUSIONS: We devised a novel framework for quantifying coverage depth dynamics. Our study is expected to serve as a basis for replication activity estimation from a broader perspective using the statistical model. PeerJ Inc. 2020-03-27 /pmc/articles/PMC7104724/ /pubmed/32257635 http://dx.doi.org/10.7717/peerj.8722 Text en ©2020 Suzuki and Yamada https://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, reproduction and adaptation in any medium and for any purpose provided that it is properly attributed. For attribution, the original author(s), title, publication source (PeerJ) and either DOI or URL of the article must be cited.
spellingShingle Bioinformatics
Suzuki, Shinya
Yamada, Takuji
Probabilistic model based on circular statistics for quantifying coverage depth dynamics originating from DNA replication
title Probabilistic model based on circular statistics for quantifying coverage depth dynamics originating from DNA replication
title_full Probabilistic model based on circular statistics for quantifying coverage depth dynamics originating from DNA replication
title_fullStr Probabilistic model based on circular statistics for quantifying coverage depth dynamics originating from DNA replication
title_full_unstemmed Probabilistic model based on circular statistics for quantifying coverage depth dynamics originating from DNA replication
title_short Probabilistic model based on circular statistics for quantifying coverage depth dynamics originating from DNA replication
title_sort probabilistic model based on circular statistics for quantifying coverage depth dynamics originating from dna replication
topic Bioinformatics
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7104724/
https://www.ncbi.nlm.nih.gov/pubmed/32257635
http://dx.doi.org/10.7717/peerj.8722
work_keys_str_mv AT suzukishinya probabilisticmodelbasedoncircularstatisticsforquantifyingcoveragedepthdynamicsoriginatingfromdnareplication
AT yamadatakuji probabilisticmodelbasedoncircularstatisticsforquantifyingcoveragedepthdynamicsoriginatingfromdnareplication