Cargando…

quarTeT: a telomere-to-telomere toolkit for gap-free genome assembly and centromeric repeat identification

A high-quality genome is the basis for studies on functional, evolutionary, and comparative genomics. The majority of attention has been paid to the solution of complex chromosome structures and highly repetitive sequences, along with the emergence of a new ‘telomere-to-telomere (T2T) assembly’ era....

Descripción completa

Detalles Bibliográficos
Autores principales: Lin, Yunzhi, Ye, Chen, Li, Xingzhu, Chen, Qinyao, Wu, Ying, Zhang, Feng, Pan, Rui, Zhang, Sijia, Chen, Shuxia, Wang, Xu, Cao, Shuo, Wang, Yingzhen, Yue, Yi, Liu, Yongsheng, Yue, Junyang
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10407605/
https://www.ncbi.nlm.nih.gov/pubmed/37560017
http://dx.doi.org/10.1093/hr/uhad127
_version_ 1785086002742689792
author Lin, Yunzhi
Ye, Chen
Li, Xingzhu
Chen, Qinyao
Wu, Ying
Zhang, Feng
Pan, Rui
Zhang, Sijia
Chen, Shuxia
Wang, Xu
Cao, Shuo
Wang, Yingzhen
Yue, Yi
Liu, Yongsheng
Yue, Junyang
author_facet Lin, Yunzhi
Ye, Chen
Li, Xingzhu
Chen, Qinyao
Wu, Ying
Zhang, Feng
Pan, Rui
Zhang, Sijia
Chen, Shuxia
Wang, Xu
Cao, Shuo
Wang, Yingzhen
Yue, Yi
Liu, Yongsheng
Yue, Junyang
author_sort Lin, Yunzhi
collection PubMed
description A high-quality genome is the basis for studies on functional, evolutionary, and comparative genomics. The majority of attention has been paid to the solution of complex chromosome structures and highly repetitive sequences, along with the emergence of a new ‘telomere-to-telomere (T2T) assembly’ era. However, the bioinformatic tools for the automatic construction and/or characterization of T2T genome are limited. Here, we developed a user-friendly web toolkit, quarTeT, which currently includes four modules: AssemblyMapper, GapFiller, TeloExplorer, and CentroMiner. First, AssemblyMapper is designed to assemble phased contigs into the chromosome-level genome by referring to a closely related genome. Then, GapFiller would endeavor to fill all unclosed gaps in a given genome with the aid of additional ultra-long sequences. Finally, TeloExplorer and CentroMiner are applied to identify candidate telomere and centromere as well as their localizations on each chromosome. These four modules can be used alone or in combination with each other for T2T genome assembly and characterization. As a case study, by adopting the entire modular functions of quarTeT, we have achieved the Actinidia chinensis genome assembly that is of a quality comparable to the reported genome Hongyang v4.0, which was assembled with the addition of manual handling. Further evaluation of CentroMiner by searching centromeres in Arabidopsis thaliana and Oryza sativa genomes showed that quarTeT is capable of identifying all the centromeric regions that have been previously detected by experimental methods. Collectively, quarTeT is an efficient toolkit for studies of large-scale T2T genomes and can be accessed at http://www.atcgn.com:8080/quarTeT/home.html without registration.
format Online
Article
Text
id pubmed-10407605
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-104076052023-08-09 quarTeT: a telomere-to-telomere toolkit for gap-free genome assembly and centromeric repeat identification Lin, Yunzhi Ye, Chen Li, Xingzhu Chen, Qinyao Wu, Ying Zhang, Feng Pan, Rui Zhang, Sijia Chen, Shuxia Wang, Xu Cao, Shuo Wang, Yingzhen Yue, Yi Liu, Yongsheng Yue, Junyang Hortic Res Article A high-quality genome is the basis for studies on functional, evolutionary, and comparative genomics. The majority of attention has been paid to the solution of complex chromosome structures and highly repetitive sequences, along with the emergence of a new ‘telomere-to-telomere (T2T) assembly’ era. However, the bioinformatic tools for the automatic construction and/or characterization of T2T genome are limited. Here, we developed a user-friendly web toolkit, quarTeT, which currently includes four modules: AssemblyMapper, GapFiller, TeloExplorer, and CentroMiner. First, AssemblyMapper is designed to assemble phased contigs into the chromosome-level genome by referring to a closely related genome. Then, GapFiller would endeavor to fill all unclosed gaps in a given genome with the aid of additional ultra-long sequences. Finally, TeloExplorer and CentroMiner are applied to identify candidate telomere and centromere as well as their localizations on each chromosome. These four modules can be used alone or in combination with each other for T2T genome assembly and characterization. As a case study, by adopting the entire modular functions of quarTeT, we have achieved the Actinidia chinensis genome assembly that is of a quality comparable to the reported genome Hongyang v4.0, which was assembled with the addition of manual handling. Further evaluation of CentroMiner by searching centromeres in Arabidopsis thaliana and Oryza sativa genomes showed that quarTeT is capable of identifying all the centromeric regions that have been previously detected by experimental methods. Collectively, quarTeT is an efficient toolkit for studies of large-scale T2T genomes and can be accessed at http://www.atcgn.com:8080/quarTeT/home.html without registration. Oxford University Press 2023-06-13 /pmc/articles/PMC10407605/ /pubmed/37560017 http://dx.doi.org/10.1093/hr/uhad127 Text en © The Author(s) 2023. Published by Oxford University Press on behalf of Nanjing Agricultural University. https://creativecommons.org/licenses/by/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Article
Lin, Yunzhi
Ye, Chen
Li, Xingzhu
Chen, Qinyao
Wu, Ying
Zhang, Feng
Pan, Rui
Zhang, Sijia
Chen, Shuxia
Wang, Xu
Cao, Shuo
Wang, Yingzhen
Yue, Yi
Liu, Yongsheng
Yue, Junyang
quarTeT: a telomere-to-telomere toolkit for gap-free genome assembly and centromeric repeat identification
title quarTeT: a telomere-to-telomere toolkit for gap-free genome assembly and centromeric repeat identification
title_full quarTeT: a telomere-to-telomere toolkit for gap-free genome assembly and centromeric repeat identification
title_fullStr quarTeT: a telomere-to-telomere toolkit for gap-free genome assembly and centromeric repeat identification
title_full_unstemmed quarTeT: a telomere-to-telomere toolkit for gap-free genome assembly and centromeric repeat identification
title_short quarTeT: a telomere-to-telomere toolkit for gap-free genome assembly and centromeric repeat identification
title_sort quartet: a telomere-to-telomere toolkit for gap-free genome assembly and centromeric repeat identification
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10407605/
https://www.ncbi.nlm.nih.gov/pubmed/37560017
http://dx.doi.org/10.1093/hr/uhad127
work_keys_str_mv AT linyunzhi quartetatelomeretotelomeretoolkitforgapfreegenomeassemblyandcentromericrepeatidentification
AT yechen quartetatelomeretotelomeretoolkitforgapfreegenomeassemblyandcentromericrepeatidentification
AT lixingzhu quartetatelomeretotelomeretoolkitforgapfreegenomeassemblyandcentromericrepeatidentification
AT chenqinyao quartetatelomeretotelomeretoolkitforgapfreegenomeassemblyandcentromericrepeatidentification
AT wuying quartetatelomeretotelomeretoolkitforgapfreegenomeassemblyandcentromericrepeatidentification
AT zhangfeng quartetatelomeretotelomeretoolkitforgapfreegenomeassemblyandcentromericrepeatidentification
AT panrui quartetatelomeretotelomeretoolkitforgapfreegenomeassemblyandcentromericrepeatidentification
AT zhangsijia quartetatelomeretotelomeretoolkitforgapfreegenomeassemblyandcentromericrepeatidentification
AT chenshuxia quartetatelomeretotelomeretoolkitforgapfreegenomeassemblyandcentromericrepeatidentification
AT wangxu quartetatelomeretotelomeretoolkitforgapfreegenomeassemblyandcentromericrepeatidentification
AT caoshuo quartetatelomeretotelomeretoolkitforgapfreegenomeassemblyandcentromericrepeatidentification
AT wangyingzhen quartetatelomeretotelomeretoolkitforgapfreegenomeassemblyandcentromericrepeatidentification
AT yueyi quartetatelomeretotelomeretoolkitforgapfreegenomeassemblyandcentromericrepeatidentification
AT liuyongsheng quartetatelomeretotelomeretoolkitforgapfreegenomeassemblyandcentromericrepeatidentification
AT yuejunyang quartetatelomeretotelomeretoolkitforgapfreegenomeassemblyandcentromericrepeatidentification