Cargando…

A stitch in time: Efficient computation of genomic DNA melting bubbles

BACKGROUND: It is of biological interest to make genome-wide predictions of the locations of DNA melting bubbles using statistical mechanics models. Computationally, this poses the challenge that a generic search through all combinations of bubble starts and ends is quadratic. RESULTS: An efficient...

Descripción completa

Detalles Bibliográficos
Autor principal: Tøstesen, Eivind
Formato: Texto
Lenguaje:English
Publicado: BioMed Central 2008
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2553405/
https://www.ncbi.nlm.nih.gov/pubmed/18637171
http://dx.doi.org/10.1186/1748-7188-3-10
_version_ 1782159500641304576
author Tøstesen, Eivind
author_facet Tøstesen, Eivind
author_sort Tøstesen, Eivind
collection PubMed
description BACKGROUND: It is of biological interest to make genome-wide predictions of the locations of DNA melting bubbles using statistical mechanics models. Computationally, this poses the challenge that a generic search through all combinations of bubble starts and ends is quadratic. RESULTS: An efficient algorithm is described, which shows that the time complexity of the task is O(NlogN) rather than quadratic. The algorithm exploits that bubble lengths may be limited, but without a prior assumption of a maximal bubble length. No approximations, such as windowing, have been introduced to reduce the time complexity. More than just finding the bubbles, the algorithm produces a stitch profile, which is a probabilistic graphical model of bubbles and helical regions. The algorithm applies a probability peak finding method based on a hierarchical analysis of the energy barriers in the Poland-Scheraga model. CONCLUSION: Exact and fast computation of genomic stitch profiles is thus feasible. Sequences of several megabases have been computed, only limited by computer memory. Possible applications are the genome-wide comparisons of bubbles with promotors, TSS, viral integration sites, and other melting-related regions.
format Text
id pubmed-2553405
institution National Center for Biotechnology Information
language English
publishDate 2008
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-25534052008-09-26 A stitch in time: Efficient computation of genomic DNA melting bubbles Tøstesen, Eivind Algorithms Mol Biol Research BACKGROUND: It is of biological interest to make genome-wide predictions of the locations of DNA melting bubbles using statistical mechanics models. Computationally, this poses the challenge that a generic search through all combinations of bubble starts and ends is quadratic. RESULTS: An efficient algorithm is described, which shows that the time complexity of the task is O(NlogN) rather than quadratic. The algorithm exploits that bubble lengths may be limited, but without a prior assumption of a maximal bubble length. No approximations, such as windowing, have been introduced to reduce the time complexity. More than just finding the bubbles, the algorithm produces a stitch profile, which is a probabilistic graphical model of bubbles and helical regions. The algorithm applies a probability peak finding method based on a hierarchical analysis of the energy barriers in the Poland-Scheraga model. CONCLUSION: Exact and fast computation of genomic stitch profiles is thus feasible. Sequences of several megabases have been computed, only limited by computer memory. Possible applications are the genome-wide comparisons of bubbles with promotors, TSS, viral integration sites, and other melting-related regions. BioMed Central 2008-07-17 /pmc/articles/PMC2553405/ /pubmed/18637171 http://dx.doi.org/10.1186/1748-7188-3-10 Text en Copyright © 2008 Tøstesen; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License ( (http://creativecommons.org/licenses/by/2.0) ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research
Tøstesen, Eivind
A stitch in time: Efficient computation of genomic DNA melting bubbles
title A stitch in time: Efficient computation of genomic DNA melting bubbles
title_full A stitch in time: Efficient computation of genomic DNA melting bubbles
title_fullStr A stitch in time: Efficient computation of genomic DNA melting bubbles
title_full_unstemmed A stitch in time: Efficient computation of genomic DNA melting bubbles
title_short A stitch in time: Efficient computation of genomic DNA melting bubbles
title_sort stitch in time: efficient computation of genomic dna melting bubbles
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2553405/
https://www.ncbi.nlm.nih.gov/pubmed/18637171
http://dx.doi.org/10.1186/1748-7188-3-10
work_keys_str_mv AT tøsteseneivind astitchintimeefficientcomputationofgenomicdnameltingbubbles
AT tøsteseneivind stitchintimeefficientcomputationofgenomicdnameltingbubbles