Cargando…

Optical maps refine the bread wheat Triticum aestivum cv. Chinese Spring genome assembly

Until recently, achieving a reference‐quality genome sequence for bread wheat was long thought beyond the limits of genome sequencing and assembly technology, primarily due to the large genome size and > 80% repetitive sequence content. The release of the chromosome scale 14.5‐Gb IWGSC RefSeq v1....

Descripción completa

Detalles Bibliográficos
Autores principales: Zhu, Tingting, Wang, Le, Rimbert, Hélène, Rodriguez, Juan C., Deal, Karin R., De Oliveira, Romain, Choulet, Frédéric, Keeble‐Gagnère, Gabriel, Tibbits, Josquin, Rogers, Jane, Eversole, Kellye, Appels, Rudi, Gu, Yong Q., Mascher, Martin, Dvorak, Jan, Luo, Ming‐Cheng
Formato: Online Artículo Texto
Lenguaje:English
Publicado: John Wiley and Sons Inc. 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8360199/
https://www.ncbi.nlm.nih.gov/pubmed/33893684
http://dx.doi.org/10.1111/tpj.15289
_version_ 1783737699653386240
author Zhu, Tingting
Wang, Le
Rimbert, Hélène
Rodriguez, Juan C.
Deal, Karin R.
De Oliveira, Romain
Choulet, Frédéric
Keeble‐Gagnère, Gabriel
Tibbits, Josquin
Rogers, Jane
Eversole, Kellye
Appels, Rudi
Gu, Yong Q.
Mascher, Martin
Dvorak, Jan
Luo, Ming‐Cheng
author_facet Zhu, Tingting
Wang, Le
Rimbert, Hélène
Rodriguez, Juan C.
Deal, Karin R.
De Oliveira, Romain
Choulet, Frédéric
Keeble‐Gagnère, Gabriel
Tibbits, Josquin
Rogers, Jane
Eversole, Kellye
Appels, Rudi
Gu, Yong Q.
Mascher, Martin
Dvorak, Jan
Luo, Ming‐Cheng
author_sort Zhu, Tingting
collection PubMed
description Until recently, achieving a reference‐quality genome sequence for bread wheat was long thought beyond the limits of genome sequencing and assembly technology, primarily due to the large genome size and > 80% repetitive sequence content. The release of the chromosome scale 14.5‐Gb IWGSC RefSeq v1.0 genome sequence of bread wheat cv. Chinese Spring (CS) was, therefore, a milestone. Here, we used a direct label and stain (DLS) optical map of the CS genome together with a prior nick, label, repair and stain (NLRS) optical map, and sequence contigs assembled with Pacific Biosciences long reads, to refine the v1.0 assembly. Inconsistencies between the sequence and maps were reconciled and gaps were closed. Gap filling and anchoring of 279 unplaced scaffolds increased the total length of pseudomolecules by 168 Mb (excluding Ns). Positions and orientations were corrected for 233 and 354 scaffolds, respectively, representing 10% of the genome sequence. The accuracy of the remaining 90% of the assembly was validated. As a result of the increased contiguity, the numbers of transposable elements (TEs) and intact TEs have increased in IWGSC RefSeq v2.1 compared with v1.0. In total, 98% of the gene models identified in v1.0 were mapped onto this new assembly through development of a dedicated approach implemented in the MAGAAT pipeline. The numbers of high‐confidence genes on pseudomolecules have increased from 105 319 to 105 534. The reconciled assembly enhances the utility of the sequence for genetic mapping, comparative genomics, gene annotation and isolation, and more general studies on the biology of wheat.
format Online
Article
Text
id pubmed-8360199
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher John Wiley and Sons Inc.
record_format MEDLINE/PubMed
spelling pubmed-83601992021-08-17 Optical maps refine the bread wheat Triticum aestivum cv. Chinese Spring genome assembly Zhu, Tingting Wang, Le Rimbert, Hélène Rodriguez, Juan C. Deal, Karin R. De Oliveira, Romain Choulet, Frédéric Keeble‐Gagnère, Gabriel Tibbits, Josquin Rogers, Jane Eversole, Kellye Appels, Rudi Gu, Yong Q. Mascher, Martin Dvorak, Jan Luo, Ming‐Cheng Plant J Resource Until recently, achieving a reference‐quality genome sequence for bread wheat was long thought beyond the limits of genome sequencing and assembly technology, primarily due to the large genome size and > 80% repetitive sequence content. The release of the chromosome scale 14.5‐Gb IWGSC RefSeq v1.0 genome sequence of bread wheat cv. Chinese Spring (CS) was, therefore, a milestone. Here, we used a direct label and stain (DLS) optical map of the CS genome together with a prior nick, label, repair and stain (NLRS) optical map, and sequence contigs assembled with Pacific Biosciences long reads, to refine the v1.0 assembly. Inconsistencies between the sequence and maps were reconciled and gaps were closed. Gap filling and anchoring of 279 unplaced scaffolds increased the total length of pseudomolecules by 168 Mb (excluding Ns). Positions and orientations were corrected for 233 and 354 scaffolds, respectively, representing 10% of the genome sequence. The accuracy of the remaining 90% of the assembly was validated. As a result of the increased contiguity, the numbers of transposable elements (TEs) and intact TEs have increased in IWGSC RefSeq v2.1 compared with v1.0. In total, 98% of the gene models identified in v1.0 were mapped onto this new assembly through development of a dedicated approach implemented in the MAGAAT pipeline. The numbers of high‐confidence genes on pseudomolecules have increased from 105 319 to 105 534. The reconciled assembly enhances the utility of the sequence for genetic mapping, comparative genomics, gene annotation and isolation, and more general studies on the biology of wheat. John Wiley and Sons Inc. 2021-05-16 2021-07 /pmc/articles/PMC8360199/ /pubmed/33893684 http://dx.doi.org/10.1111/tpj.15289 Text en © 2021 The Authors. The Plant Journal published by Society for Experimental Biology and John Wiley & Sons Ltd https://creativecommons.org/licenses/by-nc/4.0/This is an open access article under the terms of the http://creativecommons.org/licenses/by-nc/4.0/ (https://creativecommons.org/licenses/by-nc/4.0/) License, which permits use, distribution and reproduction in any medium, provided the original work is properly cited and is not used for commercial purposes.
spellingShingle Resource
Zhu, Tingting
Wang, Le
Rimbert, Hélène
Rodriguez, Juan C.
Deal, Karin R.
De Oliveira, Romain
Choulet, Frédéric
Keeble‐Gagnère, Gabriel
Tibbits, Josquin
Rogers, Jane
Eversole, Kellye
Appels, Rudi
Gu, Yong Q.
Mascher, Martin
Dvorak, Jan
Luo, Ming‐Cheng
Optical maps refine the bread wheat Triticum aestivum cv. Chinese Spring genome assembly
title Optical maps refine the bread wheat Triticum aestivum cv. Chinese Spring genome assembly
title_full Optical maps refine the bread wheat Triticum aestivum cv. Chinese Spring genome assembly
title_fullStr Optical maps refine the bread wheat Triticum aestivum cv. Chinese Spring genome assembly
title_full_unstemmed Optical maps refine the bread wheat Triticum aestivum cv. Chinese Spring genome assembly
title_short Optical maps refine the bread wheat Triticum aestivum cv. Chinese Spring genome assembly
title_sort optical maps refine the bread wheat triticum aestivum cv. chinese spring genome assembly
topic Resource
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8360199/
https://www.ncbi.nlm.nih.gov/pubmed/33893684
http://dx.doi.org/10.1111/tpj.15289
work_keys_str_mv AT zhutingting opticalmapsrefinethebreadwheattriticumaestivumcvchinesespringgenomeassembly
AT wangle opticalmapsrefinethebreadwheattriticumaestivumcvchinesespringgenomeassembly
AT rimberthelene opticalmapsrefinethebreadwheattriticumaestivumcvchinesespringgenomeassembly
AT rodriguezjuanc opticalmapsrefinethebreadwheattriticumaestivumcvchinesespringgenomeassembly
AT dealkarinr opticalmapsrefinethebreadwheattriticumaestivumcvchinesespringgenomeassembly
AT deoliveiraromain opticalmapsrefinethebreadwheattriticumaestivumcvchinesespringgenomeassembly
AT chouletfrederic opticalmapsrefinethebreadwheattriticumaestivumcvchinesespringgenomeassembly
AT keeblegagneregabriel opticalmapsrefinethebreadwheattriticumaestivumcvchinesespringgenomeassembly
AT tibbitsjosquin opticalmapsrefinethebreadwheattriticumaestivumcvchinesespringgenomeassembly
AT rogersjane opticalmapsrefinethebreadwheattriticumaestivumcvchinesespringgenomeassembly
AT eversolekellye opticalmapsrefinethebreadwheattriticumaestivumcvchinesespringgenomeassembly
AT appelsrudi opticalmapsrefinethebreadwheattriticumaestivumcvchinesespringgenomeassembly
AT guyongq opticalmapsrefinethebreadwheattriticumaestivumcvchinesespringgenomeassembly
AT maschermartin opticalmapsrefinethebreadwheattriticumaestivumcvchinesespringgenomeassembly
AT dvorakjan opticalmapsrefinethebreadwheattriticumaestivumcvchinesespringgenomeassembly
AT luomingcheng opticalmapsrefinethebreadwheattriticumaestivumcvchinesespringgenomeassembly