Cargando…

Whole Genome Profiling provides a robust framework for physical mapping and sequencing in the highly complex and repetitive wheat genome

BACKGROUND: Sequencing projects using a clone-by-clone approach require the availability of a robust physical map. The SNaPshot technology, based on pair-wise comparisons of restriction fragments sizes, has been used recently to build the first physical map of a wheat chromosome and to complete the...

Descripción completa

Detalles Bibliográficos
Autores principales: Philippe, Romain, Choulet, Frédéric, Paux, Etienne, van Oeveren, Jan, Tang, Jifeng, Wittenberg, Alexander HJ, Janssen, Antoine, van Eijk, Michiel JT, Stormo, Keith, Alberti, Adriana, Wincker, Patrick, Akhunov, Eduard, van der Vossen, Edwin, Feuillet, Catherine
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2012
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3311077/
https://www.ncbi.nlm.nih.gov/pubmed/22289472
http://dx.doi.org/10.1186/1471-2164-13-47
_version_ 1782227741369696256
author Philippe, Romain
Choulet, Frédéric
Paux, Etienne
van Oeveren, Jan
Tang, Jifeng
Wittenberg, Alexander HJ
Janssen, Antoine
van Eijk, Michiel JT
Stormo, Keith
Alberti, Adriana
Wincker, Patrick
Akhunov, Eduard
van der Vossen, Edwin
Feuillet, Catherine
author_facet Philippe, Romain
Choulet, Frédéric
Paux, Etienne
van Oeveren, Jan
Tang, Jifeng
Wittenberg, Alexander HJ
Janssen, Antoine
van Eijk, Michiel JT
Stormo, Keith
Alberti, Adriana
Wincker, Patrick
Akhunov, Eduard
van der Vossen, Edwin
Feuillet, Catherine
author_sort Philippe, Romain
collection PubMed
description BACKGROUND: Sequencing projects using a clone-by-clone approach require the availability of a robust physical map. The SNaPshot technology, based on pair-wise comparisons of restriction fragments sizes, has been used recently to build the first physical map of a wheat chromosome and to complete the maize physical map. However, restriction fragments sizes shared randomly between two non-overlapping BACs often lead to chimerical contigs and mis-assembled BACs in such large and repetitive genomes. Whole Genome Profiling (WGP™) was developed recently as a new sequence-based physical mapping technology and has the potential to limit this problem. RESULTS: A subset of the wheat 3B chromosome BAC library covering 230 Mb was used to establish a WGP physical map and to compare it to a map obtained with the SNaPshot technology. We first adapted the WGP-based assembly methodology to cope with the complexity of the wheat genome. Then, the results showed that the WGP map covers the same length than the SNaPshot map but with 30% less contigs and, more importantly with 3.5 times less mis-assembled BACs. Finally, we evaluated the benefit of integrating WGP tags in different sequence assemblies obtained after Roche/454 sequencing of BAC pools. We showed that while WGP tag integration improves assemblies performed with unpaired reads and with paired-end reads at low coverage, it does not significantly improve sequence assemblies performed at high coverage (25x) with paired-end reads. CONCLUSIONS: Our results demonstrate that, with a suitable assembly methodology, WGP builds more robust physical maps than the SNaPshot technology in wheat and that WGP can be adapted to any genome. Moreover, WGP tag integration in sequence assemblies improves low quality assembly. However, to achieve a high quality draft sequence assembly, a sequencing depth of 25x paired-end reads is required, at which point WGP tag integration does not provide additional scaffolding value. Finally, we suggest that WGP tags can support the efficient sequencing of BAC pools by enabling reliable assignment of sequence scaffolds to their BAC of origin, a feature that is of great interest when using BAC pooling strategies to reduce the cost of sequencing large genomes.
format Online
Article
Text
id pubmed-3311077
institution National Center for Biotechnology Information
language English
publishDate 2012
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-33110772012-03-24 Whole Genome Profiling provides a robust framework for physical mapping and sequencing in the highly complex and repetitive wheat genome Philippe, Romain Choulet, Frédéric Paux, Etienne van Oeveren, Jan Tang, Jifeng Wittenberg, Alexander HJ Janssen, Antoine van Eijk, Michiel JT Stormo, Keith Alberti, Adriana Wincker, Patrick Akhunov, Eduard van der Vossen, Edwin Feuillet, Catherine BMC Genomics Methodology Article BACKGROUND: Sequencing projects using a clone-by-clone approach require the availability of a robust physical map. The SNaPshot technology, based on pair-wise comparisons of restriction fragments sizes, has been used recently to build the first physical map of a wheat chromosome and to complete the maize physical map. However, restriction fragments sizes shared randomly between two non-overlapping BACs often lead to chimerical contigs and mis-assembled BACs in such large and repetitive genomes. Whole Genome Profiling (WGP™) was developed recently as a new sequence-based physical mapping technology and has the potential to limit this problem. RESULTS: A subset of the wheat 3B chromosome BAC library covering 230 Mb was used to establish a WGP physical map and to compare it to a map obtained with the SNaPshot technology. We first adapted the WGP-based assembly methodology to cope with the complexity of the wheat genome. Then, the results showed that the WGP map covers the same length than the SNaPshot map but with 30% less contigs and, more importantly with 3.5 times less mis-assembled BACs. Finally, we evaluated the benefit of integrating WGP tags in different sequence assemblies obtained after Roche/454 sequencing of BAC pools. We showed that while WGP tag integration improves assemblies performed with unpaired reads and with paired-end reads at low coverage, it does not significantly improve sequence assemblies performed at high coverage (25x) with paired-end reads. CONCLUSIONS: Our results demonstrate that, with a suitable assembly methodology, WGP builds more robust physical maps than the SNaPshot technology in wheat and that WGP can be adapted to any genome. Moreover, WGP tag integration in sequence assemblies improves low quality assembly. However, to achieve a high quality draft sequence assembly, a sequencing depth of 25x paired-end reads is required, at which point WGP tag integration does not provide additional scaffolding value. Finally, we suggest that WGP tags can support the efficient sequencing of BAC pools by enabling reliable assignment of sequence scaffolds to their BAC of origin, a feature that is of great interest when using BAC pooling strategies to reduce the cost of sequencing large genomes. BioMed Central 2012-01-30 /pmc/articles/PMC3311077/ /pubmed/22289472 http://dx.doi.org/10.1186/1471-2164-13-47 Text en Copyright ©2012 Philippe et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Methodology Article
Philippe, Romain
Choulet, Frédéric
Paux, Etienne
van Oeveren, Jan
Tang, Jifeng
Wittenberg, Alexander HJ
Janssen, Antoine
van Eijk, Michiel JT
Stormo, Keith
Alberti, Adriana
Wincker, Patrick
Akhunov, Eduard
van der Vossen, Edwin
Feuillet, Catherine
Whole Genome Profiling provides a robust framework for physical mapping and sequencing in the highly complex and repetitive wheat genome
title Whole Genome Profiling provides a robust framework for physical mapping and sequencing in the highly complex and repetitive wheat genome
title_full Whole Genome Profiling provides a robust framework for physical mapping and sequencing in the highly complex and repetitive wheat genome
title_fullStr Whole Genome Profiling provides a robust framework for physical mapping and sequencing in the highly complex and repetitive wheat genome
title_full_unstemmed Whole Genome Profiling provides a robust framework for physical mapping and sequencing in the highly complex and repetitive wheat genome
title_short Whole Genome Profiling provides a robust framework for physical mapping and sequencing in the highly complex and repetitive wheat genome
title_sort whole genome profiling provides a robust framework for physical mapping and sequencing in the highly complex and repetitive wheat genome
topic Methodology Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3311077/
https://www.ncbi.nlm.nih.gov/pubmed/22289472
http://dx.doi.org/10.1186/1471-2164-13-47
work_keys_str_mv AT philipperomain wholegenomeprofilingprovidesarobustframeworkforphysicalmappingandsequencinginthehighlycomplexandrepetitivewheatgenome
AT chouletfrederic wholegenomeprofilingprovidesarobustframeworkforphysicalmappingandsequencinginthehighlycomplexandrepetitivewheatgenome
AT pauxetienne wholegenomeprofilingprovidesarobustframeworkforphysicalmappingandsequencinginthehighlycomplexandrepetitivewheatgenome
AT vanoeverenjan wholegenomeprofilingprovidesarobustframeworkforphysicalmappingandsequencinginthehighlycomplexandrepetitivewheatgenome
AT tangjifeng wholegenomeprofilingprovidesarobustframeworkforphysicalmappingandsequencinginthehighlycomplexandrepetitivewheatgenome
AT wittenbergalexanderhj wholegenomeprofilingprovidesarobustframeworkforphysicalmappingandsequencinginthehighlycomplexandrepetitivewheatgenome
AT janssenantoine wholegenomeprofilingprovidesarobustframeworkforphysicalmappingandsequencinginthehighlycomplexandrepetitivewheatgenome
AT vaneijkmichieljt wholegenomeprofilingprovidesarobustframeworkforphysicalmappingandsequencinginthehighlycomplexandrepetitivewheatgenome
AT stormokeith wholegenomeprofilingprovidesarobustframeworkforphysicalmappingandsequencinginthehighlycomplexandrepetitivewheatgenome
AT albertiadriana wholegenomeprofilingprovidesarobustframeworkforphysicalmappingandsequencinginthehighlycomplexandrepetitivewheatgenome
AT winckerpatrick wholegenomeprofilingprovidesarobustframeworkforphysicalmappingandsequencinginthehighlycomplexandrepetitivewheatgenome
AT akhunoveduard wholegenomeprofilingprovidesarobustframeworkforphysicalmappingandsequencinginthehighlycomplexandrepetitivewheatgenome
AT vandervossenedwin wholegenomeprofilingprovidesarobustframeworkforphysicalmappingandsequencinginthehighlycomplexandrepetitivewheatgenome
AT feuilletcatherine wholegenomeprofilingprovidesarobustframeworkforphysicalmappingandsequencinginthehighlycomplexandrepetitivewheatgenome