Cargando…

Exploring possible DNA structures in real-time polymerase kinetics using Pacific Biosciences sequencer data

BACKGROUND: Pausing of DNA polymerase can indicate the presence of a DNA structure that differs from the canonical double-helix. Here we detail a method to investigate how polymerase pausing in the Pacific Biosciences sequencer reads can be related to DNA sequences. The Pacific Biosciences sequencer...

Descripción completa

Detalles Bibliográficos
Autores principales: Sawaya, Sterling, Boocock, James, Black, Michael A, Gemmell, Neil J
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2015
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4384361/
https://www.ncbi.nlm.nih.gov/pubmed/25626999
http://dx.doi.org/10.1186/s12859-014-0449-0
_version_ 1782364891642855424
author Sawaya, Sterling
Boocock, James
Black, Michael A
Gemmell, Neil J
author_facet Sawaya, Sterling
Boocock, James
Black, Michael A
Gemmell, Neil J
author_sort Sawaya, Sterling
collection PubMed
description BACKGROUND: Pausing of DNA polymerase can indicate the presence of a DNA structure that differs from the canonical double-helix. Here we detail a method to investigate how polymerase pausing in the Pacific Biosciences sequencer reads can be related to DNA sequences. The Pacific Biosciences sequencer uses optics to view a polymerase and its interaction with a single DNA molecule in real-time, offering a unique way to detect potential alternative DNA structures. RESULTS: We have developed a new way to examine polymerase kinetics data and relate it to the DNA sequence by using a wavelet transform of read information from the sequencer. We use this method to examine how polymerase kinetics are related to nucleotide base composition. We then examine tandem repeat sequences known for their ability to form different DNA structures: (CGG)n and (CG)n repeats which can, respectively, form G-quadruplex DNA and Z-DNA. We find pausing around the (CGG)n repeat that may indicate the presence of G-quadruplexes in some of the sequencer reads. The (CG)n repeat does not appear to cause polymerase pausing, but its kinetics signature nevertheless suggests the possibility that alternative nucleotide conformations may sometimes be present. CONCLUSION: We discuss the implications of using our method to discover DNA sequences capable of forming alternative structures. The analyses presented here can be reproduced on any Pacific Biosciences kinetics data for any DNA pattern of interest using an R package that we have made publicly available.
format Online
Article
Text
id pubmed-4384361
institution National Center for Biotechnology Information
language English
publishDate 2015
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-43843612015-04-04 Exploring possible DNA structures in real-time polymerase kinetics using Pacific Biosciences sequencer data Sawaya, Sterling Boocock, James Black, Michael A Gemmell, Neil J BMC Bioinformatics Methodology Article BACKGROUND: Pausing of DNA polymerase can indicate the presence of a DNA structure that differs from the canonical double-helix. Here we detail a method to investigate how polymerase pausing in the Pacific Biosciences sequencer reads can be related to DNA sequences. The Pacific Biosciences sequencer uses optics to view a polymerase and its interaction with a single DNA molecule in real-time, offering a unique way to detect potential alternative DNA structures. RESULTS: We have developed a new way to examine polymerase kinetics data and relate it to the DNA sequence by using a wavelet transform of read information from the sequencer. We use this method to examine how polymerase kinetics are related to nucleotide base composition. We then examine tandem repeat sequences known for their ability to form different DNA structures: (CGG)n and (CG)n repeats which can, respectively, form G-quadruplex DNA and Z-DNA. We find pausing around the (CGG)n repeat that may indicate the presence of G-quadruplexes in some of the sequencer reads. The (CG)n repeat does not appear to cause polymerase pausing, but its kinetics signature nevertheless suggests the possibility that alternative nucleotide conformations may sometimes be present. CONCLUSION: We discuss the implications of using our method to discover DNA sequences capable of forming alternative structures. The analyses presented here can be reproduced on any Pacific Biosciences kinetics data for any DNA pattern of interest using an R package that we have made publicly available. BioMed Central 2015-01-28 /pmc/articles/PMC4384361/ /pubmed/25626999 http://dx.doi.org/10.1186/s12859-014-0449-0 Text en © Sawaya et al.; licensee BioMed Central. 2015 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly credited. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Methodology Article
Sawaya, Sterling
Boocock, James
Black, Michael A
Gemmell, Neil J
Exploring possible DNA structures in real-time polymerase kinetics using Pacific Biosciences sequencer data
title Exploring possible DNA structures in real-time polymerase kinetics using Pacific Biosciences sequencer data
title_full Exploring possible DNA structures in real-time polymerase kinetics using Pacific Biosciences sequencer data
title_fullStr Exploring possible DNA structures in real-time polymerase kinetics using Pacific Biosciences sequencer data
title_full_unstemmed Exploring possible DNA structures in real-time polymerase kinetics using Pacific Biosciences sequencer data
title_short Exploring possible DNA structures in real-time polymerase kinetics using Pacific Biosciences sequencer data
title_sort exploring possible dna structures in real-time polymerase kinetics using pacific biosciences sequencer data
topic Methodology Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4384361/
https://www.ncbi.nlm.nih.gov/pubmed/25626999
http://dx.doi.org/10.1186/s12859-014-0449-0
work_keys_str_mv AT sawayasterling exploringpossiblednastructuresinrealtimepolymerasekineticsusingpacificbiosciencessequencerdata
AT boocockjames exploringpossiblednastructuresinrealtimepolymerasekineticsusingpacificbiosciencessequencerdata
AT blackmichaela exploringpossiblednastructuresinrealtimepolymerasekineticsusingpacificbiosciencessequencerdata
AT gemmellneilj exploringpossiblednastructuresinrealtimepolymerasekineticsusingpacificbiosciencessequencerdata