Cargando…
A stitch in time: Efficient computation of genomic DNA melting bubbles
BACKGROUND: It is of biological interest to make genome-wide predictions of the locations of DNA melting bubbles using statistical mechanics models. Computationally, this poses the challenge that a generic search through all combinations of bubble starts and ends is quadratic. RESULTS: An efficient...
Autor principal: | |
---|---|
Formato: | Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2008
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2553405/ https://www.ncbi.nlm.nih.gov/pubmed/18637171 http://dx.doi.org/10.1186/1748-7188-3-10 |
_version_ | 1782159500641304576 |
---|---|
author | Tøstesen, Eivind |
author_facet | Tøstesen, Eivind |
author_sort | Tøstesen, Eivind |
collection | PubMed |
description | BACKGROUND: It is of biological interest to make genome-wide predictions of the locations of DNA melting bubbles using statistical mechanics models. Computationally, this poses the challenge that a generic search through all combinations of bubble starts and ends is quadratic. RESULTS: An efficient algorithm is described, which shows that the time complexity of the task is O(NlogN) rather than quadratic. The algorithm exploits that bubble lengths may be limited, but without a prior assumption of a maximal bubble length. No approximations, such as windowing, have been introduced to reduce the time complexity. More than just finding the bubbles, the algorithm produces a stitch profile, which is a probabilistic graphical model of bubbles and helical regions. The algorithm applies a probability peak finding method based on a hierarchical analysis of the energy barriers in the Poland-Scheraga model. CONCLUSION: Exact and fast computation of genomic stitch profiles is thus feasible. Sequences of several megabases have been computed, only limited by computer memory. Possible applications are the genome-wide comparisons of bubbles with promotors, TSS, viral integration sites, and other melting-related regions. |
format | Text |
id | pubmed-2553405 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2008 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-25534052008-09-26 A stitch in time: Efficient computation of genomic DNA melting bubbles Tøstesen, Eivind Algorithms Mol Biol Research BACKGROUND: It is of biological interest to make genome-wide predictions of the locations of DNA melting bubbles using statistical mechanics models. Computationally, this poses the challenge that a generic search through all combinations of bubble starts and ends is quadratic. RESULTS: An efficient algorithm is described, which shows that the time complexity of the task is O(NlogN) rather than quadratic. The algorithm exploits that bubble lengths may be limited, but without a prior assumption of a maximal bubble length. No approximations, such as windowing, have been introduced to reduce the time complexity. More than just finding the bubbles, the algorithm produces a stitch profile, which is a probabilistic graphical model of bubbles and helical regions. The algorithm applies a probability peak finding method based on a hierarchical analysis of the energy barriers in the Poland-Scheraga model. CONCLUSION: Exact and fast computation of genomic stitch profiles is thus feasible. Sequences of several megabases have been computed, only limited by computer memory. Possible applications are the genome-wide comparisons of bubbles with promotors, TSS, viral integration sites, and other melting-related regions. BioMed Central 2008-07-17 /pmc/articles/PMC2553405/ /pubmed/18637171 http://dx.doi.org/10.1186/1748-7188-3-10 Text en Copyright © 2008 Tøstesen; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License ( (http://creativecommons.org/licenses/by/2.0) ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Research Tøstesen, Eivind A stitch in time: Efficient computation of genomic DNA melting bubbles |
title | A stitch in time: Efficient computation of genomic DNA melting bubbles |
title_full | A stitch in time: Efficient computation of genomic DNA melting bubbles |
title_fullStr | A stitch in time: Efficient computation of genomic DNA melting bubbles |
title_full_unstemmed | A stitch in time: Efficient computation of genomic DNA melting bubbles |
title_short | A stitch in time: Efficient computation of genomic DNA melting bubbles |
title_sort | stitch in time: efficient computation of genomic dna melting bubbles |
topic | Research |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2553405/ https://www.ncbi.nlm.nih.gov/pubmed/18637171 http://dx.doi.org/10.1186/1748-7188-3-10 |
work_keys_str_mv | AT tøsteseneivind astitchintimeefficientcomputationofgenomicdnameltingbubbles AT tøsteseneivind stitchintimeefficientcomputationofgenomicdnameltingbubbles |