Cargando…

Trichoderma reesei complete genome sequence, repeat-induced point mutation, and partitioning of CAZyme gene clusters

BACKGROUND: Trichoderma reesei (Ascomycota, Pezizomycotina) QM6a is a model fungus for a broad spectrum of physiological phenomena, including plant cell wall degradation, industrial production of enzymes, light responses, conidiation, sexual development, polyketide biosynthesis, and plant–fungal int...

Descripción completa

Detalles Bibliográficos
Autores principales: Li, Wan-Chen, Huang, Chien-Hao, Chen, Chia-Ling, Chuang, Yu-Chien, Tung, Shu-Yun, Wang, Ting-Fang
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2017
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5496416/
https://www.ncbi.nlm.nih.gov/pubmed/28690679
http://dx.doi.org/10.1186/s13068-017-0825-x
_version_ 1783247976449179648
author Li, Wan-Chen
Huang, Chien-Hao
Chen, Chia-Ling
Chuang, Yu-Chien
Tung, Shu-Yun
Wang, Ting-Fang
author_facet Li, Wan-Chen
Huang, Chien-Hao
Chen, Chia-Ling
Chuang, Yu-Chien
Tung, Shu-Yun
Wang, Ting-Fang
author_sort Li, Wan-Chen
collection PubMed
description BACKGROUND: Trichoderma reesei (Ascomycota, Pezizomycotina) QM6a is a model fungus for a broad spectrum of physiological phenomena, including plant cell wall degradation, industrial production of enzymes, light responses, conidiation, sexual development, polyketide biosynthesis, and plant–fungal interactions. The genomes of QM6a and its high enzyme-producing mutants have been sequenced by second-generation-sequencing methods and are publicly available from the Joint Genome Institute. While these genome sequences have offered useful information for genomic and transcriptomic studies, their limitations and especially their short read lengths make them poorly suited for some particular biological problems, including assembly, genome-wide determination of chromosome architecture, and genetic modification or engineering. RESULTS: We integrated Pacific Biosciences and Illumina sequencing platforms for the highest-quality genome assembly yet achieved, revealing seven telomere-to-telomere chromosomes (34,922,528 bp; 10877 genes) with 1630 newly predicted genes and >1.5 Mb of new sequences. Most new sequences are located on AT-rich blocks, including 7 centromeres, 14 subtelomeres, and 2329 interspersed AT-rich blocks. The seven QM6a centromeres separately consist of 24 conserved repeats and 37 putative centromere-encoded genes. These findings open up a new perspective for future centromere and chromosome architecture studies. Next, we demonstrate that sexual crossing readily induced cytosine-to-thymine point mutations on both tandem and unlinked duplicated sequences. We also show by bioinformatic analysis that T. reesei has evolved a robust repeat-induced point mutation (RIP) system to accumulate AT-rich sequences, with longer AT-rich blocks having more RIP mutations. The widespread distribution of AT-rich blocks correlates genome-wide partitions with gene clusters, explaining why clustering of genes has been reported to not influence gene expression in T. reesei. CONCLUSION: Compartmentation of ancestral gene clusters by AT-rich blocks might promote flexibilities that are evolutionarily advantageous in this fungus’ soil habitats and other natural environments. Our analyses, together with the complete genome sequence, provide a better blueprint for biotechnological and industrial applications. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s13068-017-0825-x) contains supplementary material, which is available to authorized users.
format Online
Article
Text
id pubmed-5496416
institution National Center for Biotechnology Information
language English
publishDate 2017
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-54964162017-07-07 Trichoderma reesei complete genome sequence, repeat-induced point mutation, and partitioning of CAZyme gene clusters Li, Wan-Chen Huang, Chien-Hao Chen, Chia-Ling Chuang, Yu-Chien Tung, Shu-Yun Wang, Ting-Fang Biotechnol Biofuels Research BACKGROUND: Trichoderma reesei (Ascomycota, Pezizomycotina) QM6a is a model fungus for a broad spectrum of physiological phenomena, including plant cell wall degradation, industrial production of enzymes, light responses, conidiation, sexual development, polyketide biosynthesis, and plant–fungal interactions. The genomes of QM6a and its high enzyme-producing mutants have been sequenced by second-generation-sequencing methods and are publicly available from the Joint Genome Institute. While these genome sequences have offered useful information for genomic and transcriptomic studies, their limitations and especially their short read lengths make them poorly suited for some particular biological problems, including assembly, genome-wide determination of chromosome architecture, and genetic modification or engineering. RESULTS: We integrated Pacific Biosciences and Illumina sequencing platforms for the highest-quality genome assembly yet achieved, revealing seven telomere-to-telomere chromosomes (34,922,528 bp; 10877 genes) with 1630 newly predicted genes and >1.5 Mb of new sequences. Most new sequences are located on AT-rich blocks, including 7 centromeres, 14 subtelomeres, and 2329 interspersed AT-rich blocks. The seven QM6a centromeres separately consist of 24 conserved repeats and 37 putative centromere-encoded genes. These findings open up a new perspective for future centromere and chromosome architecture studies. Next, we demonstrate that sexual crossing readily induced cytosine-to-thymine point mutations on both tandem and unlinked duplicated sequences. We also show by bioinformatic analysis that T. reesei has evolved a robust repeat-induced point mutation (RIP) system to accumulate AT-rich sequences, with longer AT-rich blocks having more RIP mutations. The widespread distribution of AT-rich blocks correlates genome-wide partitions with gene clusters, explaining why clustering of genes has been reported to not influence gene expression in T. reesei. CONCLUSION: Compartmentation of ancestral gene clusters by AT-rich blocks might promote flexibilities that are evolutionarily advantageous in this fungus’ soil habitats and other natural environments. Our analyses, together with the complete genome sequence, provide a better blueprint for biotechnological and industrial applications. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s13068-017-0825-x) contains supplementary material, which is available to authorized users. BioMed Central 2017-07-03 /pmc/articles/PMC5496416/ /pubmed/28690679 http://dx.doi.org/10.1186/s13068-017-0825-x Text en © The Author(s) 2017 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Research
Li, Wan-Chen
Huang, Chien-Hao
Chen, Chia-Ling
Chuang, Yu-Chien
Tung, Shu-Yun
Wang, Ting-Fang
Trichoderma reesei complete genome sequence, repeat-induced point mutation, and partitioning of CAZyme gene clusters
title Trichoderma reesei complete genome sequence, repeat-induced point mutation, and partitioning of CAZyme gene clusters
title_full Trichoderma reesei complete genome sequence, repeat-induced point mutation, and partitioning of CAZyme gene clusters
title_fullStr Trichoderma reesei complete genome sequence, repeat-induced point mutation, and partitioning of CAZyme gene clusters
title_full_unstemmed Trichoderma reesei complete genome sequence, repeat-induced point mutation, and partitioning of CAZyme gene clusters
title_short Trichoderma reesei complete genome sequence, repeat-induced point mutation, and partitioning of CAZyme gene clusters
title_sort trichoderma reesei complete genome sequence, repeat-induced point mutation, and partitioning of cazyme gene clusters
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5496416/
https://www.ncbi.nlm.nih.gov/pubmed/28690679
http://dx.doi.org/10.1186/s13068-017-0825-x
work_keys_str_mv AT liwanchen trichodermareeseicompletegenomesequencerepeatinducedpointmutationandpartitioningofcazymegeneclusters
AT huangchienhao trichodermareeseicompletegenomesequencerepeatinducedpointmutationandpartitioningofcazymegeneclusters
AT chenchialing trichodermareeseicompletegenomesequencerepeatinducedpointmutationandpartitioningofcazymegeneclusters
AT chuangyuchien trichodermareeseicompletegenomesequencerepeatinducedpointmutationandpartitioningofcazymegeneclusters
AT tungshuyun trichodermareeseicompletegenomesequencerepeatinducedpointmutationandpartitioningofcazymegeneclusters
AT wangtingfang trichodermareeseicompletegenomesequencerepeatinducedpointmutationandpartitioningofcazymegeneclusters