Cargando…

An efficient approach to BAC based assembly of complex genomes

BACKGROUND: There has been an exponential growth in the number of genome sequencing projects since the introduction of next generation DNA sequencing technologies. Genome projects have increasingly involved assembly of whole genome data which produces inferior assemblies compared to traditional Sang...

Descripción completa

Detalles Bibliográficos
Autores principales: Visendi, Paul, Berkman, Paul J., Hayashi, Satomi, Golicz, Agnieszka A., Bayer, Philipp E., Ruperao, Pradeep, Hurgobin, Bhavna, Montenegro, Juan, Chan, Chon-Kit Kenneth, Staňková, Helena, Batley, Jacqueline, Šimková, Hana, Doležel, Jaroslav, Edwards, David
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2016
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4719536/
https://www.ncbi.nlm.nih.gov/pubmed/26793268
http://dx.doi.org/10.1186/s13007-016-0107-9
_version_ 1782410949258379264
author Visendi, Paul
Berkman, Paul J.
Hayashi, Satomi
Golicz, Agnieszka A.
Bayer, Philipp E.
Ruperao, Pradeep
Hurgobin, Bhavna
Montenegro, Juan
Chan, Chon-Kit Kenneth
Staňková, Helena
Batley, Jacqueline
Šimková, Hana
Doležel, Jaroslav
Edwards, David
author_facet Visendi, Paul
Berkman, Paul J.
Hayashi, Satomi
Golicz, Agnieszka A.
Bayer, Philipp E.
Ruperao, Pradeep
Hurgobin, Bhavna
Montenegro, Juan
Chan, Chon-Kit Kenneth
Staňková, Helena
Batley, Jacqueline
Šimková, Hana
Doležel, Jaroslav
Edwards, David
author_sort Visendi, Paul
collection PubMed
description BACKGROUND: There has been an exponential growth in the number of genome sequencing projects since the introduction of next generation DNA sequencing technologies. Genome projects have increasingly involved assembly of whole genome data which produces inferior assemblies compared to traditional Sanger sequencing of genomic fragments cloned into bacterial artificial chromosomes (BACs). While whole genome shotgun sequencing using next generation sequencing (NGS) is relatively fast and inexpensive, this method is extremely challenging for highly complex genomes, where polyploidy or high repeat content confounds accurate assembly, or where a highly accurate ‘gold’ reference is required. Several attempts have been made to improve genome sequencing approaches by incorporating NGS methods, to variable success. RESULTS: We present the application of a novel BAC sequencing approach which combines indexed pools of BACs, Illumina paired read sequencing, a sequence assembler specifically designed for complex BAC assembly, and a custom bioinformatics pipeline. We demonstrate this method by sequencing and assembling BAC cloned fragments from bread wheat and sugarcane genomes. CONCLUSIONS: We demonstrate that our assembly approach is accurate, robust, cost effective and scalable, with applications for complete genome sequencing in large and complex genomes. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s13007-016-0107-9) contains supplementary material, which is available to authorized users.
format Online
Article
Text
id pubmed-4719536
institution National Center for Biotechnology Information
language English
publishDate 2016
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-47195362016-01-21 An efficient approach to BAC based assembly of complex genomes Visendi, Paul Berkman, Paul J. Hayashi, Satomi Golicz, Agnieszka A. Bayer, Philipp E. Ruperao, Pradeep Hurgobin, Bhavna Montenegro, Juan Chan, Chon-Kit Kenneth Staňková, Helena Batley, Jacqueline Šimková, Hana Doležel, Jaroslav Edwards, David Plant Methods Methodology BACKGROUND: There has been an exponential growth in the number of genome sequencing projects since the introduction of next generation DNA sequencing technologies. Genome projects have increasingly involved assembly of whole genome data which produces inferior assemblies compared to traditional Sanger sequencing of genomic fragments cloned into bacterial artificial chromosomes (BACs). While whole genome shotgun sequencing using next generation sequencing (NGS) is relatively fast and inexpensive, this method is extremely challenging for highly complex genomes, where polyploidy or high repeat content confounds accurate assembly, or where a highly accurate ‘gold’ reference is required. Several attempts have been made to improve genome sequencing approaches by incorporating NGS methods, to variable success. RESULTS: We present the application of a novel BAC sequencing approach which combines indexed pools of BACs, Illumina paired read sequencing, a sequence assembler specifically designed for complex BAC assembly, and a custom bioinformatics pipeline. We demonstrate this method by sequencing and assembling BAC cloned fragments from bread wheat and sugarcane genomes. CONCLUSIONS: We demonstrate that our assembly approach is accurate, robust, cost effective and scalable, with applications for complete genome sequencing in large and complex genomes. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s13007-016-0107-9) contains supplementary material, which is available to authorized users. BioMed Central 2016-01-20 /pmc/articles/PMC4719536/ /pubmed/26793268 http://dx.doi.org/10.1186/s13007-016-0107-9 Text en © Visendi et al. 2016 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Methodology
Visendi, Paul
Berkman, Paul J.
Hayashi, Satomi
Golicz, Agnieszka A.
Bayer, Philipp E.
Ruperao, Pradeep
Hurgobin, Bhavna
Montenegro, Juan
Chan, Chon-Kit Kenneth
Staňková, Helena
Batley, Jacqueline
Šimková, Hana
Doležel, Jaroslav
Edwards, David
An efficient approach to BAC based assembly of complex genomes
title An efficient approach to BAC based assembly of complex genomes
title_full An efficient approach to BAC based assembly of complex genomes
title_fullStr An efficient approach to BAC based assembly of complex genomes
title_full_unstemmed An efficient approach to BAC based assembly of complex genomes
title_short An efficient approach to BAC based assembly of complex genomes
title_sort efficient approach to bac based assembly of complex genomes
topic Methodology
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4719536/
https://www.ncbi.nlm.nih.gov/pubmed/26793268
http://dx.doi.org/10.1186/s13007-016-0107-9
work_keys_str_mv AT visendipaul anefficientapproachtobacbasedassemblyofcomplexgenomes
AT berkmanpaulj anefficientapproachtobacbasedassemblyofcomplexgenomes
AT hayashisatomi anefficientapproachtobacbasedassemblyofcomplexgenomes
AT goliczagnieszkaa anefficientapproachtobacbasedassemblyofcomplexgenomes
AT bayerphilippe anefficientapproachtobacbasedassemblyofcomplexgenomes
AT ruperaopradeep anefficientapproachtobacbasedassemblyofcomplexgenomes
AT hurgobinbhavna anefficientapproachtobacbasedassemblyofcomplexgenomes
AT montenegrojuan anefficientapproachtobacbasedassemblyofcomplexgenomes
AT chanchonkitkenneth anefficientapproachtobacbasedassemblyofcomplexgenomes
AT stankovahelena anefficientapproachtobacbasedassemblyofcomplexgenomes
AT batleyjacqueline anefficientapproachtobacbasedassemblyofcomplexgenomes
AT simkovahana anefficientapproachtobacbasedassemblyofcomplexgenomes
AT dolezeljaroslav anefficientapproachtobacbasedassemblyofcomplexgenomes
AT edwardsdavid anefficientapproachtobacbasedassemblyofcomplexgenomes
AT visendipaul efficientapproachtobacbasedassemblyofcomplexgenomes
AT berkmanpaulj efficientapproachtobacbasedassemblyofcomplexgenomes
AT hayashisatomi efficientapproachtobacbasedassemblyofcomplexgenomes
AT goliczagnieszkaa efficientapproachtobacbasedassemblyofcomplexgenomes
AT bayerphilippe efficientapproachtobacbasedassemblyofcomplexgenomes
AT ruperaopradeep efficientapproachtobacbasedassemblyofcomplexgenomes
AT hurgobinbhavna efficientapproachtobacbasedassemblyofcomplexgenomes
AT montenegrojuan efficientapproachtobacbasedassemblyofcomplexgenomes
AT chanchonkitkenneth efficientapproachtobacbasedassemblyofcomplexgenomes
AT stankovahelena efficientapproachtobacbasedassemblyofcomplexgenomes
AT batleyjacqueline efficientapproachtobacbasedassemblyofcomplexgenomes
AT simkovahana efficientapproachtobacbasedassemblyofcomplexgenomes
AT dolezeljaroslav efficientapproachtobacbasedassemblyofcomplexgenomes
AT edwardsdavid efficientapproachtobacbasedassemblyofcomplexgenomes