Cargando…

Prospects of telomere‐to‐telomere assembly in barley: Analysis of sequence gaps in the MorexV3 reference genome

The first gapless, telomere‐to‐telomere (T2T) sequence assemblies of plant chromosomes were reported recently. However, sequence assemblies of most plant genomes remain fragmented. Only recent breakthroughs in accurate long‐read sequencing have made it possible to achieve highly contiguous sequence...

Descripción completa

Detalles Bibliográficos
Autores principales: Navrátilová, Pavla, Toegelová, Helena, Tulpová, Zuzana, Kuo, Yi‐Tzu, Stein, Nils, Doležel, Jaroslav, Houben, Andreas, Šimková, Hana, Mascher, Martin
Formato: Online Artículo Texto
Lenguaje:English
Publicado: John Wiley and Sons Inc. 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9241371/
https://www.ncbi.nlm.nih.gov/pubmed/35338551
http://dx.doi.org/10.1111/pbi.13816
_version_ 1784737791478857728
author Navrátilová, Pavla
Toegelová, Helena
Tulpová, Zuzana
Kuo, Yi‐Tzu
Stein, Nils
Doležel, Jaroslav
Houben, Andreas
Šimková, Hana
Mascher, Martin
author_facet Navrátilová, Pavla
Toegelová, Helena
Tulpová, Zuzana
Kuo, Yi‐Tzu
Stein, Nils
Doležel, Jaroslav
Houben, Andreas
Šimková, Hana
Mascher, Martin
author_sort Navrátilová, Pavla
collection PubMed
description The first gapless, telomere‐to‐telomere (T2T) sequence assemblies of plant chromosomes were reported recently. However, sequence assemblies of most plant genomes remain fragmented. Only recent breakthroughs in accurate long‐read sequencing have made it possible to achieve highly contiguous sequence assemblies with a few tens of contigs per chromosome, that is a number small enough to allow for a systematic inquiry into the causes of the remaining sequence gaps and the approaches and resources needed to close them. Here, we analyse sequence gaps in the current reference genome sequence of barley cv. Morex (MorexV3). Optical map and sequence raw data, complemented by ChIP‐seq data for centromeric histone variant CENH3, were used to estimate the abundance of centromeric, ribosomal DNA, and subtelomeric repeats in the barley genome. These estimates were compared with copy numbers in the MorexV3 pseudomolecule sequence. We found that almost all centromeric sequences and 45S ribosomal DNA repeat arrays were absent from the MorexV3 pseudomolecules and that the majority of sequence gaps can be attributed to assembly breakdown in long stretches of satellite repeats. However, missing sequences cannot fully account for the difference between assembly size and flow cytometric genome size estimates. We discuss the prospects of gap closure with ultra‐long sequence reads.
format Online
Article
Text
id pubmed-9241371
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher John Wiley and Sons Inc.
record_format MEDLINE/PubMed
spelling pubmed-92413712022-07-01 Prospects of telomere‐to‐telomere assembly in barley: Analysis of sequence gaps in the MorexV3 reference genome Navrátilová, Pavla Toegelová, Helena Tulpová, Zuzana Kuo, Yi‐Tzu Stein, Nils Doležel, Jaroslav Houben, Andreas Šimková, Hana Mascher, Martin Plant Biotechnol J Research Articles The first gapless, telomere‐to‐telomere (T2T) sequence assemblies of plant chromosomes were reported recently. However, sequence assemblies of most plant genomes remain fragmented. Only recent breakthroughs in accurate long‐read sequencing have made it possible to achieve highly contiguous sequence assemblies with a few tens of contigs per chromosome, that is a number small enough to allow for a systematic inquiry into the causes of the remaining sequence gaps and the approaches and resources needed to close them. Here, we analyse sequence gaps in the current reference genome sequence of barley cv. Morex (MorexV3). Optical map and sequence raw data, complemented by ChIP‐seq data for centromeric histone variant CENH3, were used to estimate the abundance of centromeric, ribosomal DNA, and subtelomeric repeats in the barley genome. These estimates were compared with copy numbers in the MorexV3 pseudomolecule sequence. We found that almost all centromeric sequences and 45S ribosomal DNA repeat arrays were absent from the MorexV3 pseudomolecules and that the majority of sequence gaps can be attributed to assembly breakdown in long stretches of satellite repeats. However, missing sequences cannot fully account for the difference between assembly size and flow cytometric genome size estimates. We discuss the prospects of gap closure with ultra‐long sequence reads. John Wiley and Sons Inc. 2022-04-07 2022-07 /pmc/articles/PMC9241371/ /pubmed/35338551 http://dx.doi.org/10.1111/pbi.13816 Text en © 2022 The Authors. Plant Biotechnology Journal published by Society for Experimental Biology and The Association of Applied Biologists and John Wiley & Sons Ltd. https://creativecommons.org/licenses/by/4.0/This is an open access article under the terms of the http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) License, which permits use, distribution and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research Articles
Navrátilová, Pavla
Toegelová, Helena
Tulpová, Zuzana
Kuo, Yi‐Tzu
Stein, Nils
Doležel, Jaroslav
Houben, Andreas
Šimková, Hana
Mascher, Martin
Prospects of telomere‐to‐telomere assembly in barley: Analysis of sequence gaps in the MorexV3 reference genome
title Prospects of telomere‐to‐telomere assembly in barley: Analysis of sequence gaps in the MorexV3 reference genome
title_full Prospects of telomere‐to‐telomere assembly in barley: Analysis of sequence gaps in the MorexV3 reference genome
title_fullStr Prospects of telomere‐to‐telomere assembly in barley: Analysis of sequence gaps in the MorexV3 reference genome
title_full_unstemmed Prospects of telomere‐to‐telomere assembly in barley: Analysis of sequence gaps in the MorexV3 reference genome
title_short Prospects of telomere‐to‐telomere assembly in barley: Analysis of sequence gaps in the MorexV3 reference genome
title_sort prospects of telomere‐to‐telomere assembly in barley: analysis of sequence gaps in the morexv3 reference genome
topic Research Articles
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9241371/
https://www.ncbi.nlm.nih.gov/pubmed/35338551
http://dx.doi.org/10.1111/pbi.13816
work_keys_str_mv AT navratilovapavla prospectsoftelomeretotelomereassemblyinbarleyanalysisofsequencegapsinthemorexv3referencegenome
AT toegelovahelena prospectsoftelomeretotelomereassemblyinbarleyanalysisofsequencegapsinthemorexv3referencegenome
AT tulpovazuzana prospectsoftelomeretotelomereassemblyinbarleyanalysisofsequencegapsinthemorexv3referencegenome
AT kuoyitzu prospectsoftelomeretotelomereassemblyinbarleyanalysisofsequencegapsinthemorexv3referencegenome
AT steinnils prospectsoftelomeretotelomereassemblyinbarleyanalysisofsequencegapsinthemorexv3referencegenome
AT dolezeljaroslav prospectsoftelomeretotelomereassemblyinbarleyanalysisofsequencegapsinthemorexv3referencegenome
AT houbenandreas prospectsoftelomeretotelomereassemblyinbarleyanalysisofsequencegapsinthemorexv3referencegenome
AT simkovahana prospectsoftelomeretotelomereassemblyinbarleyanalysisofsequencegapsinthemorexv3referencegenome
AT maschermartin prospectsoftelomeretotelomereassemblyinbarleyanalysisofsequencegapsinthemorexv3referencegenome