Cargando…

Strategies for optimizing BioNano and Dovetail explored through a second reference quality assembly for the legume model, Medicago truncatula

BACKGROUND: Third generation sequencing technologies, with sequencing reads in the tens- of kilo-bases, facilitate genome assembly by spanning ambiguous regions and improving continuity. This has been critical for plant genomes, which are difficult to assemble due to high repeat content, gene family...

Descripción completa

Detalles Bibliográficos
Autores principales: Moll, Karen M., Zhou, Peng, Ramaraj, Thiruvarangan, Fajardo, Diego, Devitt, Nicholas P., Sadowsky, Michael J., Stupar, Robert M., Tiffin, Peter, Miller, Jason R., Young, Nevin D., Silverstein, Kevin A. T., Mudge, Joann
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2017
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5545040/
https://www.ncbi.nlm.nih.gov/pubmed/28778149
http://dx.doi.org/10.1186/s12864-017-3971-4
_version_ 1783255355985231872
author Moll, Karen M.
Zhou, Peng
Ramaraj, Thiruvarangan
Fajardo, Diego
Devitt, Nicholas P.
Sadowsky, Michael J.
Stupar, Robert M.
Tiffin, Peter
Miller, Jason R.
Young, Nevin D.
Silverstein, Kevin A. T.
Mudge, Joann
author_facet Moll, Karen M.
Zhou, Peng
Ramaraj, Thiruvarangan
Fajardo, Diego
Devitt, Nicholas P.
Sadowsky, Michael J.
Stupar, Robert M.
Tiffin, Peter
Miller, Jason R.
Young, Nevin D.
Silverstein, Kevin A. T.
Mudge, Joann
author_sort Moll, Karen M.
collection PubMed
description BACKGROUND: Third generation sequencing technologies, with sequencing reads in the tens- of kilo-bases, facilitate genome assembly by spanning ambiguous regions and improving continuity. This has been critical for plant genomes, which are difficult to assemble due to high repeat content, gene family expansions, segmental and tandem duplications, and polyploidy. Recently, high-throughput mapping and scaffolding strategies have further improved continuity. Together, these long-range technologies enable quality draft assemblies of complex genomes in a cost-effective and timely manner. RESULTS: Here, we present high quality genome assemblies of the model legume plant, Medicago truncatula (R108) using PacBio, Dovetail Chicago (hereafter, Dovetail) and BioNano technologies. To test these technologies for plant genome assembly, we generated five assemblies using all possible combinations and ordering of these three technologies in the R108 assembly. While the BioNano and Dovetail joins overlapped, they also showed complementary gains in continuity and join numbers. Both technologies spanned repetitive regions that PacBio alone was unable to bridge. Combining technologies, particularly Dovetail followed by BioNano, resulted in notable improvements compared to Dovetail or BioNano alone. A combination of PacBio, Dovetail, and BioNano was used to generate a high quality draft assembly of R108, a M. truncatula accession widely used in studies of functional genomics. As a test for the usefulness of the resulting genome sequence, the new R108 assembly was used to pinpoint breakpoints and characterize flanking sequence of a previously identified translocation between chromosomes 4 and 8, identifying more than 22.7 Mb of novel sequence not present in the earlier A17 reference assembly. CONCLUSIONS: Adding Dovetail followed by BioNano data yielded complementary improvements in continuity over the original PacBio assembly. This strategy proved efficient and cost-effective for developing a quality draft assembly compared to traditional reference assemblies. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12864-017-3971-4) contains supplementary material, which is available to authorized users.
format Online
Article
Text
id pubmed-5545040
institution National Center for Biotechnology Information
language English
publishDate 2017
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-55450402017-08-07 Strategies for optimizing BioNano and Dovetail explored through a second reference quality assembly for the legume model, Medicago truncatula Moll, Karen M. Zhou, Peng Ramaraj, Thiruvarangan Fajardo, Diego Devitt, Nicholas P. Sadowsky, Michael J. Stupar, Robert M. Tiffin, Peter Miller, Jason R. Young, Nevin D. Silverstein, Kevin A. T. Mudge, Joann BMC Genomics Research Article BACKGROUND: Third generation sequencing technologies, with sequencing reads in the tens- of kilo-bases, facilitate genome assembly by spanning ambiguous regions and improving continuity. This has been critical for plant genomes, which are difficult to assemble due to high repeat content, gene family expansions, segmental and tandem duplications, and polyploidy. Recently, high-throughput mapping and scaffolding strategies have further improved continuity. Together, these long-range technologies enable quality draft assemblies of complex genomes in a cost-effective and timely manner. RESULTS: Here, we present high quality genome assemblies of the model legume plant, Medicago truncatula (R108) using PacBio, Dovetail Chicago (hereafter, Dovetail) and BioNano technologies. To test these technologies for plant genome assembly, we generated five assemblies using all possible combinations and ordering of these three technologies in the R108 assembly. While the BioNano and Dovetail joins overlapped, they also showed complementary gains in continuity and join numbers. Both technologies spanned repetitive regions that PacBio alone was unable to bridge. Combining technologies, particularly Dovetail followed by BioNano, resulted in notable improvements compared to Dovetail or BioNano alone. A combination of PacBio, Dovetail, and BioNano was used to generate a high quality draft assembly of R108, a M. truncatula accession widely used in studies of functional genomics. As a test for the usefulness of the resulting genome sequence, the new R108 assembly was used to pinpoint breakpoints and characterize flanking sequence of a previously identified translocation between chromosomes 4 and 8, identifying more than 22.7 Mb of novel sequence not present in the earlier A17 reference assembly. CONCLUSIONS: Adding Dovetail followed by BioNano data yielded complementary improvements in continuity over the original PacBio assembly. This strategy proved efficient and cost-effective for developing a quality draft assembly compared to traditional reference assemblies. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12864-017-3971-4) contains supplementary material, which is available to authorized users. BioMed Central 2017-08-04 /pmc/articles/PMC5545040/ /pubmed/28778149 http://dx.doi.org/10.1186/s12864-017-3971-4 Text en © The Author(s). 2017 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Research Article
Moll, Karen M.
Zhou, Peng
Ramaraj, Thiruvarangan
Fajardo, Diego
Devitt, Nicholas P.
Sadowsky, Michael J.
Stupar, Robert M.
Tiffin, Peter
Miller, Jason R.
Young, Nevin D.
Silverstein, Kevin A. T.
Mudge, Joann
Strategies for optimizing BioNano and Dovetail explored through a second reference quality assembly for the legume model, Medicago truncatula
title Strategies for optimizing BioNano and Dovetail explored through a second reference quality assembly for the legume model, Medicago truncatula
title_full Strategies for optimizing BioNano and Dovetail explored through a second reference quality assembly for the legume model, Medicago truncatula
title_fullStr Strategies for optimizing BioNano and Dovetail explored through a second reference quality assembly for the legume model, Medicago truncatula
title_full_unstemmed Strategies for optimizing BioNano and Dovetail explored through a second reference quality assembly for the legume model, Medicago truncatula
title_short Strategies for optimizing BioNano and Dovetail explored through a second reference quality assembly for the legume model, Medicago truncatula
title_sort strategies for optimizing bionano and dovetail explored through a second reference quality assembly for the legume model, medicago truncatula
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5545040/
https://www.ncbi.nlm.nih.gov/pubmed/28778149
http://dx.doi.org/10.1186/s12864-017-3971-4
work_keys_str_mv AT mollkarenm strategiesforoptimizingbionanoanddovetailexploredthroughasecondreferencequalityassemblyforthelegumemodelmedicagotruncatula
AT zhoupeng strategiesforoptimizingbionanoanddovetailexploredthroughasecondreferencequalityassemblyforthelegumemodelmedicagotruncatula
AT ramarajthiruvarangan strategiesforoptimizingbionanoanddovetailexploredthroughasecondreferencequalityassemblyforthelegumemodelmedicagotruncatula
AT fajardodiego strategiesforoptimizingbionanoanddovetailexploredthroughasecondreferencequalityassemblyforthelegumemodelmedicagotruncatula
AT devittnicholasp strategiesforoptimizingbionanoanddovetailexploredthroughasecondreferencequalityassemblyforthelegumemodelmedicagotruncatula
AT sadowskymichaelj strategiesforoptimizingbionanoanddovetailexploredthroughasecondreferencequalityassemblyforthelegumemodelmedicagotruncatula
AT stuparrobertm strategiesforoptimizingbionanoanddovetailexploredthroughasecondreferencequalityassemblyforthelegumemodelmedicagotruncatula
AT tiffinpeter strategiesforoptimizingbionanoanddovetailexploredthroughasecondreferencequalityassemblyforthelegumemodelmedicagotruncatula
AT millerjasonr strategiesforoptimizingbionanoanddovetailexploredthroughasecondreferencequalityassemblyforthelegumemodelmedicagotruncatula
AT youngnevind strategiesforoptimizingbionanoanddovetailexploredthroughasecondreferencequalityassemblyforthelegumemodelmedicagotruncatula
AT silversteinkevinat strategiesforoptimizingbionanoanddovetailexploredthroughasecondreferencequalityassemblyforthelegumemodelmedicagotruncatula
AT mudgejoann strategiesforoptimizingbionanoanddovetailexploredthroughasecondreferencequalityassemblyforthelegumemodelmedicagotruncatula