Cargando…

Scalable telomere-to-telomere assembly for diploid and polyploid genomes with double graph

Despite recent advances in the length and the accuracy of long-read data, building haplotype-resolved genome assemblies from telomere to telomere still requires considerable computational resources. In this study, we present an efficient de novo assembly algorithm that combines multiple sequencing t...

Descripción completa

Detalles Bibliográficos
Autores principales: Cheng, Haoyu, Asri, Mobin, Lucas, Julian, Koren, Sergey, Li, Heng
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Cornell University 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10274930/
https://www.ncbi.nlm.nih.gov/pubmed/37332563
_version_ 1785059812083499008
author Cheng, Haoyu
Asri, Mobin
Lucas, Julian
Koren, Sergey
Li, Heng
author_facet Cheng, Haoyu
Asri, Mobin
Lucas, Julian
Koren, Sergey
Li, Heng
author_sort Cheng, Haoyu
collection PubMed
description Despite recent advances in the length and the accuracy of long-read data, building haplotype-resolved genome assemblies from telomere to telomere still requires considerable computational resources. In this study, we present an efficient de novo assembly algorithm that combines multiple sequencing technologies to scale up population-wide telomere-to-telomere assemblies. By utilizing twenty-two human and two plant genomes, we demonstrate that our algorithm is around an order of magnitude cheaper than existing methods, while producing better diploid and haploid assemblies. Notably, our algorithm is the only feasible solution to the haplotype-resolved assembly of polyploid genomes.
format Online
Article
Text
id pubmed-10274930
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Cornell University
record_format MEDLINE/PubMed
spelling pubmed-102749302023-06-17 Scalable telomere-to-telomere assembly for diploid and polyploid genomes with double graph Cheng, Haoyu Asri, Mobin Lucas, Julian Koren, Sergey Li, Heng ArXiv Article Despite recent advances in the length and the accuracy of long-read data, building haplotype-resolved genome assemblies from telomere to telomere still requires considerable computational resources. In this study, we present an efficient de novo assembly algorithm that combines multiple sequencing technologies to scale up population-wide telomere-to-telomere assemblies. By utilizing twenty-two human and two plant genomes, we demonstrate that our algorithm is around an order of magnitude cheaper than existing methods, while producing better diploid and haploid assemblies. Notably, our algorithm is the only feasible solution to the haplotype-resolved assembly of polyploid genomes. Cornell University 2023-06-06 /pmc/articles/PMC10274930/ /pubmed/37332563 Text en https://creativecommons.org/licenses/by/4.0/This work is licensed under a Creative Commons Attribution 4.0 International License (https://creativecommons.org/licenses/by/4.0/) , which allows reusers to distribute, remix, adapt, and build upon the material in any medium or format, so long as attribution is given to the creator. The license allows for commercial use.
spellingShingle Article
Cheng, Haoyu
Asri, Mobin
Lucas, Julian
Koren, Sergey
Li, Heng
Scalable telomere-to-telomere assembly for diploid and polyploid genomes with double graph
title Scalable telomere-to-telomere assembly for diploid and polyploid genomes with double graph
title_full Scalable telomere-to-telomere assembly for diploid and polyploid genomes with double graph
title_fullStr Scalable telomere-to-telomere assembly for diploid and polyploid genomes with double graph
title_full_unstemmed Scalable telomere-to-telomere assembly for diploid and polyploid genomes with double graph
title_short Scalable telomere-to-telomere assembly for diploid and polyploid genomes with double graph
title_sort scalable telomere-to-telomere assembly for diploid and polyploid genomes with double graph
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10274930/
https://www.ncbi.nlm.nih.gov/pubmed/37332563
work_keys_str_mv AT chenghaoyu scalabletelomeretotelomereassemblyfordiploidandpolyploidgenomeswithdoublegraph
AT asrimobin scalabletelomeretotelomereassemblyfordiploidandpolyploidgenomeswithdoublegraph
AT lucasjulian scalabletelomeretotelomereassemblyfordiploidandpolyploidgenomeswithdoublegraph
AT korensergey scalabletelomeretotelomereassemblyfordiploidandpolyploidgenomeswithdoublegraph
AT liheng scalabletelomeretotelomereassemblyfordiploidandpolyploidgenomeswithdoublegraph