Cargando…

Optimal pooling for genome re-sequencing with ultra-high-throughput short-read technologies

New generation sequencing technologies offer unique opportunities and challenges for re-sequencing studies. In this article, we focus on re-sequencing experiments using the Solexa technology, based on bacterial artificial chromosome (BAC) clones, and address an experimental design problem. In these...

Descripción completa

Detalles Bibliográficos
Autores principales: Hajirasouliha, Iman, Hormozdiari, Fereydoun, Sahinalp, S. Cenk, Birol, Inanc
Formato: Texto
Lenguaje:English
Publicado: Oxford University Press 2008
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2718651/
https://www.ncbi.nlm.nih.gov/pubmed/18586730
http://dx.doi.org/10.1093/bioinformatics/btn173
_version_ 1782170008372117504
author Hajirasouliha, Iman
Hormozdiari, Fereydoun
Sahinalp, S. Cenk
Birol, Inanc
author_facet Hajirasouliha, Iman
Hormozdiari, Fereydoun
Sahinalp, S. Cenk
Birol, Inanc
author_sort Hajirasouliha, Iman
collection PubMed
description New generation sequencing technologies offer unique opportunities and challenges for re-sequencing studies. In this article, we focus on re-sequencing experiments using the Solexa technology, based on bacterial artificial chromosome (BAC) clones, and address an experimental design problem. In these specific experiments, approximate coordinates of the BACs on a reference genome are known, and fine-scale differences between the BAC sequences and the reference are of interest. The high-throughput characteristics of the sequencing technology makes it possible to multiplex BAC sequencing experiments by pooling BACs for a cost-effective operation. However, the way BACs are pooled in such re-sequencing experiments has an effect on the downstream analysis of the generated data, mostly due to subsequences common to multiple BACs. The experimental design strategy we develop in this article offers combinatorial solutions based on approximation algorithms for the well-known max n-cut problem and the related max n-section problem on hypergraphs. Our algorithms, when applied to a number of sample cases give more than a 2-fold performance improvement over random partitioning. Contact:cenk@cs.sfu.ca
format Text
id pubmed-2718651
institution National Center for Biotechnology Information
language English
publishDate 2008
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-27186512009-07-31 Optimal pooling for genome re-sequencing with ultra-high-throughput short-read technologies Hajirasouliha, Iman Hormozdiari, Fereydoun Sahinalp, S. Cenk Birol, Inanc Bioinformatics Ismb 2008 Conference Proceedings 19–23 July 2008, Toronto New generation sequencing technologies offer unique opportunities and challenges for re-sequencing studies. In this article, we focus on re-sequencing experiments using the Solexa technology, based on bacterial artificial chromosome (BAC) clones, and address an experimental design problem. In these specific experiments, approximate coordinates of the BACs on a reference genome are known, and fine-scale differences between the BAC sequences and the reference are of interest. The high-throughput characteristics of the sequencing technology makes it possible to multiplex BAC sequencing experiments by pooling BACs for a cost-effective operation. However, the way BACs are pooled in such re-sequencing experiments has an effect on the downstream analysis of the generated data, mostly due to subsequences common to multiple BACs. The experimental design strategy we develop in this article offers combinatorial solutions based on approximation algorithms for the well-known max n-cut problem and the related max n-section problem on hypergraphs. Our algorithms, when applied to a number of sample cases give more than a 2-fold performance improvement over random partitioning. Contact:cenk@cs.sfu.ca Oxford University Press 2008-07-01 /pmc/articles/PMC2718651/ /pubmed/18586730 http://dx.doi.org/10.1093/bioinformatics/btn173 Text en © 2008 The Author(s) http://creativecommons.org/licenses/by-nc/2.0/uk/ This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/2.0/uk/) which permits unrestricted non-commercial use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Ismb 2008 Conference Proceedings 19–23 July 2008, Toronto
Hajirasouliha, Iman
Hormozdiari, Fereydoun
Sahinalp, S. Cenk
Birol, Inanc
Optimal pooling for genome re-sequencing with ultra-high-throughput short-read technologies
title Optimal pooling for genome re-sequencing with ultra-high-throughput short-read technologies
title_full Optimal pooling for genome re-sequencing with ultra-high-throughput short-read technologies
title_fullStr Optimal pooling for genome re-sequencing with ultra-high-throughput short-read technologies
title_full_unstemmed Optimal pooling for genome re-sequencing with ultra-high-throughput short-read technologies
title_short Optimal pooling for genome re-sequencing with ultra-high-throughput short-read technologies
title_sort optimal pooling for genome re-sequencing with ultra-high-throughput short-read technologies
topic Ismb 2008 Conference Proceedings 19–23 July 2008, Toronto
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2718651/
https://www.ncbi.nlm.nih.gov/pubmed/18586730
http://dx.doi.org/10.1093/bioinformatics/btn173
work_keys_str_mv AT hajirasoulihaiman optimalpoolingforgenomeresequencingwithultrahighthroughputshortreadtechnologies
AT hormozdiarifereydoun optimalpoolingforgenomeresequencingwithultrahighthroughputshortreadtechnologies
AT sahinalpscenk optimalpoolingforgenomeresequencingwithultrahighthroughputshortreadtechnologies
AT birolinanc optimalpoolingforgenomeresequencingwithultrahighthroughputshortreadtechnologies