Cargando…

Full-Length Genome of an Ogataea polymorpha Strain CBS4732 ura3Δ Reveals Large Duplicated Segments in Subtelomeric Regions

BACKGROUND: Currently, methylotrophic yeasts (e.g., Pichia pastoris, Ogataea polymorpha, and Candida boindii) are subjects of intense genomics studies in basic research and industrial applications. In the genus Ogataea, most research is focused on three basic O. polymorpha strains-CBS4732, NCYC495,...

Descripción completa

Detalles Bibliográficos
Autores principales: Chang, Jia, Bei, Jinlong, Shao, Qi, Wang, Hemu, Fan, Huan, Yau, Tung On, Bu, Wenjun, Ruan, Jishou, Wei, Dongsheng, Gao, Shan
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Frontiers Media S.A. 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9019687/
https://www.ncbi.nlm.nih.gov/pubmed/35464988
http://dx.doi.org/10.3389/fmicb.2022.855666
_version_ 1784689346660532224
author Chang, Jia
Bei, Jinlong
Shao, Qi
Wang, Hemu
Fan, Huan
Yau, Tung On
Bu, Wenjun
Ruan, Jishou
Wei, Dongsheng
Gao, Shan
author_facet Chang, Jia
Bei, Jinlong
Shao, Qi
Wang, Hemu
Fan, Huan
Yau, Tung On
Bu, Wenjun
Ruan, Jishou
Wei, Dongsheng
Gao, Shan
author_sort Chang, Jia
collection PubMed
description BACKGROUND: Currently, methylotrophic yeasts (e.g., Pichia pastoris, Ogataea polymorpha, and Candida boindii) are subjects of intense genomics studies in basic research and industrial applications. In the genus Ogataea, most research is focused on three basic O. polymorpha strains-CBS4732, NCYC495, and DL-1. However, the relationship between CBS4732, NCYC495, and DL-1 remains unclear, as the genomic differences between them have not be exactly determined without their high-quality complete genomes. As a nutritionally deficient mutant derived from CBS4732, the O. polymorpha strain CBS4732 ura3Δ (named HU-11) is being used for high-yield production of several important proteins or peptides. HU-11 has the same reference genome as CBS4732 (noted as HU-11/CBS4732), because the only genomic difference between them is a 5-bp insertion. RESULTS: In the present study, we have assembled the full-length genome of O. polymorpha HU-11/CBS4732 using high-depth PacBio and Illumina data. Long terminal repeat retrotransposons (LTR-rts), rDNA, 5′ and 3′ telomeric, subtelomeric, low complexity and other repeat regions were exactly determined to improve the genome quality. In brief, the main findings include complete rDNAs, complete LTR-rts, three large duplicated segments in subtelomeric regions and three structural variations between the HU-11/CBS4732 and NCYC495 genomes. These findings are very important for the assembly of full-length genomes of yeast and the correction of assembly errors in the published genomes of Ogataea spp. HU-11/CBS4732 is so phylogenetically close to NCYC495 that the syntenic regions cover nearly 100% of their genomes. Moreover, HU-11/CBS4732 and NCYC495 share a nucleotide identity of 99.5% through their whole genomes. CBS4732 and NCYC495 can be regarded as the same strain in basic research and industrial applications. CONCLUSION: The present study preliminarily revealed the relationship between CBS4732, NCYC495, and DL-1. Our findings provide new opportunities for in-depth understanding of genome evolution in methylotrophic yeasts and lay the foundations for the industrial applications of O. polymorpha CBS4732, NCYC495, DL-1, and their derivative strains. The full-length genome of O. polymorpha HU-11/CBS4732 should be included into the NCBI RefSeq database for future studies of Ogataea spp.
format Online
Article
Text
id pubmed-9019687
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Frontiers Media S.A.
record_format MEDLINE/PubMed
spelling pubmed-90196872022-04-21 Full-Length Genome of an Ogataea polymorpha Strain CBS4732 ura3Δ Reveals Large Duplicated Segments in Subtelomeric Regions Chang, Jia Bei, Jinlong Shao, Qi Wang, Hemu Fan, Huan Yau, Tung On Bu, Wenjun Ruan, Jishou Wei, Dongsheng Gao, Shan Front Microbiol Microbiology BACKGROUND: Currently, methylotrophic yeasts (e.g., Pichia pastoris, Ogataea polymorpha, and Candida boindii) are subjects of intense genomics studies in basic research and industrial applications. In the genus Ogataea, most research is focused on three basic O. polymorpha strains-CBS4732, NCYC495, and DL-1. However, the relationship between CBS4732, NCYC495, and DL-1 remains unclear, as the genomic differences between them have not be exactly determined without their high-quality complete genomes. As a nutritionally deficient mutant derived from CBS4732, the O. polymorpha strain CBS4732 ura3Δ (named HU-11) is being used for high-yield production of several important proteins or peptides. HU-11 has the same reference genome as CBS4732 (noted as HU-11/CBS4732), because the only genomic difference between them is a 5-bp insertion. RESULTS: In the present study, we have assembled the full-length genome of O. polymorpha HU-11/CBS4732 using high-depth PacBio and Illumina data. Long terminal repeat retrotransposons (LTR-rts), rDNA, 5′ and 3′ telomeric, subtelomeric, low complexity and other repeat regions were exactly determined to improve the genome quality. In brief, the main findings include complete rDNAs, complete LTR-rts, three large duplicated segments in subtelomeric regions and three structural variations between the HU-11/CBS4732 and NCYC495 genomes. These findings are very important for the assembly of full-length genomes of yeast and the correction of assembly errors in the published genomes of Ogataea spp. HU-11/CBS4732 is so phylogenetically close to NCYC495 that the syntenic regions cover nearly 100% of their genomes. Moreover, HU-11/CBS4732 and NCYC495 share a nucleotide identity of 99.5% through their whole genomes. CBS4732 and NCYC495 can be regarded as the same strain in basic research and industrial applications. CONCLUSION: The present study preliminarily revealed the relationship between CBS4732, NCYC495, and DL-1. Our findings provide new opportunities for in-depth understanding of genome evolution in methylotrophic yeasts and lay the foundations for the industrial applications of O. polymorpha CBS4732, NCYC495, DL-1, and their derivative strains. The full-length genome of O. polymorpha HU-11/CBS4732 should be included into the NCBI RefSeq database for future studies of Ogataea spp. Frontiers Media S.A. 2022-04-04 /pmc/articles/PMC9019687/ /pubmed/35464988 http://dx.doi.org/10.3389/fmicb.2022.855666 Text en Copyright © 2022 Chang, Bei, Shao, Wang, Fan, Yau, Bu, Ruan, Wei and Gao. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle Microbiology
Chang, Jia
Bei, Jinlong
Shao, Qi
Wang, Hemu
Fan, Huan
Yau, Tung On
Bu, Wenjun
Ruan, Jishou
Wei, Dongsheng
Gao, Shan
Full-Length Genome of an Ogataea polymorpha Strain CBS4732 ura3Δ Reveals Large Duplicated Segments in Subtelomeric Regions
title Full-Length Genome of an Ogataea polymorpha Strain CBS4732 ura3Δ Reveals Large Duplicated Segments in Subtelomeric Regions
title_full Full-Length Genome of an Ogataea polymorpha Strain CBS4732 ura3Δ Reveals Large Duplicated Segments in Subtelomeric Regions
title_fullStr Full-Length Genome of an Ogataea polymorpha Strain CBS4732 ura3Δ Reveals Large Duplicated Segments in Subtelomeric Regions
title_full_unstemmed Full-Length Genome of an Ogataea polymorpha Strain CBS4732 ura3Δ Reveals Large Duplicated Segments in Subtelomeric Regions
title_short Full-Length Genome of an Ogataea polymorpha Strain CBS4732 ura3Δ Reveals Large Duplicated Segments in Subtelomeric Regions
title_sort full-length genome of an ogataea polymorpha strain cbs4732 ura3δ reveals large duplicated segments in subtelomeric regions
topic Microbiology
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9019687/
https://www.ncbi.nlm.nih.gov/pubmed/35464988
http://dx.doi.org/10.3389/fmicb.2022.855666
work_keys_str_mv AT changjia fulllengthgenomeofanogataeapolymorphastraincbs4732ura3drevealslargeduplicatedsegmentsinsubtelomericregions
AT beijinlong fulllengthgenomeofanogataeapolymorphastraincbs4732ura3drevealslargeduplicatedsegmentsinsubtelomericregions
AT shaoqi fulllengthgenomeofanogataeapolymorphastraincbs4732ura3drevealslargeduplicatedsegmentsinsubtelomericregions
AT wanghemu fulllengthgenomeofanogataeapolymorphastraincbs4732ura3drevealslargeduplicatedsegmentsinsubtelomericregions
AT fanhuan fulllengthgenomeofanogataeapolymorphastraincbs4732ura3drevealslargeduplicatedsegmentsinsubtelomericregions
AT yautungon fulllengthgenomeofanogataeapolymorphastraincbs4732ura3drevealslargeduplicatedsegmentsinsubtelomericregions
AT buwenjun fulllengthgenomeofanogataeapolymorphastraincbs4732ura3drevealslargeduplicatedsegmentsinsubtelomericregions
AT ruanjishou fulllengthgenomeofanogataeapolymorphastraincbs4732ura3drevealslargeduplicatedsegmentsinsubtelomericregions
AT weidongsheng fulllengthgenomeofanogataeapolymorphastraincbs4732ura3drevealslargeduplicatedsegmentsinsubtelomericregions
AT gaoshan fulllengthgenomeofanogataeapolymorphastraincbs4732ura3drevealslargeduplicatedsegmentsinsubtelomericregions