Cargando…

Facilitated sequence assembly using densely labeled optical DNA barcodes: A combinatorial auction approach

The output from whole genome sequencing is a set of contigs, i.e. short non-overlapping DNA sequences (sizes 1-100 kilobasepairs). Piecing the contigs together is an especially difficult task for previously unsequenced DNA, and may not be feasible due to factors such as the lack of sufficient covera...

Descripción completa

Detalles Bibliográficos
Autores principales: Dvirnas, Albertas, Pichler, Christoffer, Stewart, Callum L., Quaderi, Saair, Nyberg, Lena K., Müller, Vilhelm, Kumar Bikkarolla, Santosh, Kristiansson, Erik, Sandegren, Linus, Westerlund, Fredrik, Ambjörnsson, Tobias
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2018
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5844556/
https://www.ncbi.nlm.nih.gov/pubmed/29522539
http://dx.doi.org/10.1371/journal.pone.0193900
_version_ 1783305257207463936
author Dvirnas, Albertas
Pichler, Christoffer
Stewart, Callum L.
Quaderi, Saair
Nyberg, Lena K.
Müller, Vilhelm
Kumar Bikkarolla, Santosh
Kristiansson, Erik
Sandegren, Linus
Westerlund, Fredrik
Ambjörnsson, Tobias
author_facet Dvirnas, Albertas
Pichler, Christoffer
Stewart, Callum L.
Quaderi, Saair
Nyberg, Lena K.
Müller, Vilhelm
Kumar Bikkarolla, Santosh
Kristiansson, Erik
Sandegren, Linus
Westerlund, Fredrik
Ambjörnsson, Tobias
author_sort Dvirnas, Albertas
collection PubMed
description The output from whole genome sequencing is a set of contigs, i.e. short non-overlapping DNA sequences (sizes 1-100 kilobasepairs). Piecing the contigs together is an especially difficult task for previously unsequenced DNA, and may not be feasible due to factors such as the lack of sufficient coverage or larger repetitive regions which generate gaps in the final sequence. Here we propose a new method for scaffolding such contigs. The proposed method uses densely labeled optical DNA barcodes from competitive binding experiments as scaffolds. On these scaffolds we position theoretical barcodes which are calculated from the contig sequences. This allows us to construct longer DNA sequences from the contig sequences. This proof-of-principle study extends previous studies which use sparsely labeled DNA barcodes for scaffolding purposes. Our method applies a probabilistic approach that allows us to discard “foreign” contigs from mixed samples with contigs from different types of DNA. We satisfy the contig non-overlap constraint by formulating the contig placement challenge as a combinatorial auction problem. Our exact algorithm for solving this problem reduces computational costs compared to previous methods in the combinatorial auction field. We demonstrate the usefulness of the proposed scaffolding method both for synthetic contigs and for contigs obtained using Illumina sequencing for a mixed sample with plasmid and chromosomal DNA.
format Online
Article
Text
id pubmed-5844556
institution National Center for Biotechnology Information
language English
publishDate 2018
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-58445562018-03-23 Facilitated sequence assembly using densely labeled optical DNA barcodes: A combinatorial auction approach Dvirnas, Albertas Pichler, Christoffer Stewart, Callum L. Quaderi, Saair Nyberg, Lena K. Müller, Vilhelm Kumar Bikkarolla, Santosh Kristiansson, Erik Sandegren, Linus Westerlund, Fredrik Ambjörnsson, Tobias PLoS One Research Article The output from whole genome sequencing is a set of contigs, i.e. short non-overlapping DNA sequences (sizes 1-100 kilobasepairs). Piecing the contigs together is an especially difficult task for previously unsequenced DNA, and may not be feasible due to factors such as the lack of sufficient coverage or larger repetitive regions which generate gaps in the final sequence. Here we propose a new method for scaffolding such contigs. The proposed method uses densely labeled optical DNA barcodes from competitive binding experiments as scaffolds. On these scaffolds we position theoretical barcodes which are calculated from the contig sequences. This allows us to construct longer DNA sequences from the contig sequences. This proof-of-principle study extends previous studies which use sparsely labeled DNA barcodes for scaffolding purposes. Our method applies a probabilistic approach that allows us to discard “foreign” contigs from mixed samples with contigs from different types of DNA. We satisfy the contig non-overlap constraint by formulating the contig placement challenge as a combinatorial auction problem. Our exact algorithm for solving this problem reduces computational costs compared to previous methods in the combinatorial auction field. We demonstrate the usefulness of the proposed scaffolding method both for synthetic contigs and for contigs obtained using Illumina sequencing for a mixed sample with plasmid and chromosomal DNA. Public Library of Science 2018-03-09 /pmc/articles/PMC5844556/ /pubmed/29522539 http://dx.doi.org/10.1371/journal.pone.0193900 Text en © 2018 Dvirnas et al http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle Research Article
Dvirnas, Albertas
Pichler, Christoffer
Stewart, Callum L.
Quaderi, Saair
Nyberg, Lena K.
Müller, Vilhelm
Kumar Bikkarolla, Santosh
Kristiansson, Erik
Sandegren, Linus
Westerlund, Fredrik
Ambjörnsson, Tobias
Facilitated sequence assembly using densely labeled optical DNA barcodes: A combinatorial auction approach
title Facilitated sequence assembly using densely labeled optical DNA barcodes: A combinatorial auction approach
title_full Facilitated sequence assembly using densely labeled optical DNA barcodes: A combinatorial auction approach
title_fullStr Facilitated sequence assembly using densely labeled optical DNA barcodes: A combinatorial auction approach
title_full_unstemmed Facilitated sequence assembly using densely labeled optical DNA barcodes: A combinatorial auction approach
title_short Facilitated sequence assembly using densely labeled optical DNA barcodes: A combinatorial auction approach
title_sort facilitated sequence assembly using densely labeled optical dna barcodes: a combinatorial auction approach
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5844556/
https://www.ncbi.nlm.nih.gov/pubmed/29522539
http://dx.doi.org/10.1371/journal.pone.0193900
work_keys_str_mv AT dvirnasalbertas facilitatedsequenceassemblyusingdenselylabeledopticaldnabarcodesacombinatorialauctionapproach
AT pichlerchristoffer facilitatedsequenceassemblyusingdenselylabeledopticaldnabarcodesacombinatorialauctionapproach
AT stewartcalluml facilitatedsequenceassemblyusingdenselylabeledopticaldnabarcodesacombinatorialauctionapproach
AT quaderisaair facilitatedsequenceassemblyusingdenselylabeledopticaldnabarcodesacombinatorialauctionapproach
AT nyberglenak facilitatedsequenceassemblyusingdenselylabeledopticaldnabarcodesacombinatorialauctionapproach
AT mullervilhelm facilitatedsequenceassemblyusingdenselylabeledopticaldnabarcodesacombinatorialauctionapproach
AT kumarbikkarollasantosh facilitatedsequenceassemblyusingdenselylabeledopticaldnabarcodesacombinatorialauctionapproach
AT kristianssonerik facilitatedsequenceassemblyusingdenselylabeledopticaldnabarcodesacombinatorialauctionapproach
AT sandegrenlinus facilitatedsequenceassemblyusingdenselylabeledopticaldnabarcodesacombinatorialauctionapproach
AT westerlundfredrik facilitatedsequenceassemblyusingdenselylabeledopticaldnabarcodesacombinatorialauctionapproach
AT ambjornssontobias facilitatedsequenceassemblyusingdenselylabeledopticaldnabarcodesacombinatorialauctionapproach