Cargando…
Prospects of telomere‐to‐telomere assembly in barley: Analysis of sequence gaps in the MorexV3 reference genome
The first gapless, telomere‐to‐telomere (T2T) sequence assemblies of plant chromosomes were reported recently. However, sequence assemblies of most plant genomes remain fragmented. Only recent breakthroughs in accurate long‐read sequencing have made it possible to achieve highly contiguous sequence...
Autores principales: | , , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
John Wiley and Sons Inc.
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9241371/ https://www.ncbi.nlm.nih.gov/pubmed/35338551 http://dx.doi.org/10.1111/pbi.13816 |
_version_ | 1784737791478857728 |
---|---|
author | Navrátilová, Pavla Toegelová, Helena Tulpová, Zuzana Kuo, Yi‐Tzu Stein, Nils Doležel, Jaroslav Houben, Andreas Šimková, Hana Mascher, Martin |
author_facet | Navrátilová, Pavla Toegelová, Helena Tulpová, Zuzana Kuo, Yi‐Tzu Stein, Nils Doležel, Jaroslav Houben, Andreas Šimková, Hana Mascher, Martin |
author_sort | Navrátilová, Pavla |
collection | PubMed |
description | The first gapless, telomere‐to‐telomere (T2T) sequence assemblies of plant chromosomes were reported recently. However, sequence assemblies of most plant genomes remain fragmented. Only recent breakthroughs in accurate long‐read sequencing have made it possible to achieve highly contiguous sequence assemblies with a few tens of contigs per chromosome, that is a number small enough to allow for a systematic inquiry into the causes of the remaining sequence gaps and the approaches and resources needed to close them. Here, we analyse sequence gaps in the current reference genome sequence of barley cv. Morex (MorexV3). Optical map and sequence raw data, complemented by ChIP‐seq data for centromeric histone variant CENH3, were used to estimate the abundance of centromeric, ribosomal DNA, and subtelomeric repeats in the barley genome. These estimates were compared with copy numbers in the MorexV3 pseudomolecule sequence. We found that almost all centromeric sequences and 45S ribosomal DNA repeat arrays were absent from the MorexV3 pseudomolecules and that the majority of sequence gaps can be attributed to assembly breakdown in long stretches of satellite repeats. However, missing sequences cannot fully account for the difference between assembly size and flow cytometric genome size estimates. We discuss the prospects of gap closure with ultra‐long sequence reads. |
format | Online Article Text |
id | pubmed-9241371 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | John Wiley and Sons Inc. |
record_format | MEDLINE/PubMed |
spelling | pubmed-92413712022-07-01 Prospects of telomere‐to‐telomere assembly in barley: Analysis of sequence gaps in the MorexV3 reference genome Navrátilová, Pavla Toegelová, Helena Tulpová, Zuzana Kuo, Yi‐Tzu Stein, Nils Doležel, Jaroslav Houben, Andreas Šimková, Hana Mascher, Martin Plant Biotechnol J Research Articles The first gapless, telomere‐to‐telomere (T2T) sequence assemblies of plant chromosomes were reported recently. However, sequence assemblies of most plant genomes remain fragmented. Only recent breakthroughs in accurate long‐read sequencing have made it possible to achieve highly contiguous sequence assemblies with a few tens of contigs per chromosome, that is a number small enough to allow for a systematic inquiry into the causes of the remaining sequence gaps and the approaches and resources needed to close them. Here, we analyse sequence gaps in the current reference genome sequence of barley cv. Morex (MorexV3). Optical map and sequence raw data, complemented by ChIP‐seq data for centromeric histone variant CENH3, were used to estimate the abundance of centromeric, ribosomal DNA, and subtelomeric repeats in the barley genome. These estimates were compared with copy numbers in the MorexV3 pseudomolecule sequence. We found that almost all centromeric sequences and 45S ribosomal DNA repeat arrays were absent from the MorexV3 pseudomolecules and that the majority of sequence gaps can be attributed to assembly breakdown in long stretches of satellite repeats. However, missing sequences cannot fully account for the difference between assembly size and flow cytometric genome size estimates. We discuss the prospects of gap closure with ultra‐long sequence reads. John Wiley and Sons Inc. 2022-04-07 2022-07 /pmc/articles/PMC9241371/ /pubmed/35338551 http://dx.doi.org/10.1111/pbi.13816 Text en © 2022 The Authors. Plant Biotechnology Journal published by Society for Experimental Biology and The Association of Applied Biologists and John Wiley & Sons Ltd. https://creativecommons.org/licenses/by/4.0/This is an open access article under the terms of the http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) License, which permits use, distribution and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Research Articles Navrátilová, Pavla Toegelová, Helena Tulpová, Zuzana Kuo, Yi‐Tzu Stein, Nils Doležel, Jaroslav Houben, Andreas Šimková, Hana Mascher, Martin Prospects of telomere‐to‐telomere assembly in barley: Analysis of sequence gaps in the MorexV3 reference genome |
title | Prospects of telomere‐to‐telomere assembly in barley: Analysis of sequence gaps in the MorexV3 reference genome |
title_full | Prospects of telomere‐to‐telomere assembly in barley: Analysis of sequence gaps in the MorexV3 reference genome |
title_fullStr | Prospects of telomere‐to‐telomere assembly in barley: Analysis of sequence gaps in the MorexV3 reference genome |
title_full_unstemmed | Prospects of telomere‐to‐telomere assembly in barley: Analysis of sequence gaps in the MorexV3 reference genome |
title_short | Prospects of telomere‐to‐telomere assembly in barley: Analysis of sequence gaps in the MorexV3 reference genome |
title_sort | prospects of telomere‐to‐telomere assembly in barley: analysis of sequence gaps in the morexv3 reference genome |
topic | Research Articles |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9241371/ https://www.ncbi.nlm.nih.gov/pubmed/35338551 http://dx.doi.org/10.1111/pbi.13816 |
work_keys_str_mv | AT navratilovapavla prospectsoftelomeretotelomereassemblyinbarleyanalysisofsequencegapsinthemorexv3referencegenome AT toegelovahelena prospectsoftelomeretotelomereassemblyinbarleyanalysisofsequencegapsinthemorexv3referencegenome AT tulpovazuzana prospectsoftelomeretotelomereassemblyinbarleyanalysisofsequencegapsinthemorexv3referencegenome AT kuoyitzu prospectsoftelomeretotelomereassemblyinbarleyanalysisofsequencegapsinthemorexv3referencegenome AT steinnils prospectsoftelomeretotelomereassemblyinbarleyanalysisofsequencegapsinthemorexv3referencegenome AT dolezeljaroslav prospectsoftelomeretotelomereassemblyinbarleyanalysisofsequencegapsinthemorexv3referencegenome AT houbenandreas prospectsoftelomeretotelomereassemblyinbarleyanalysisofsequencegapsinthemorexv3referencegenome AT simkovahana prospectsoftelomeretotelomereassemblyinbarleyanalysisofsequencegapsinthemorexv3referencegenome AT maschermartin prospectsoftelomeretotelomereassemblyinbarleyanalysisofsequencegapsinthemorexv3referencegenome |