Cargando…

Global Transcriptome Characterization and Assembly of the Thermophilic Ascomycete Chaetomium thermophilum

A correct genome annotation is fundamental for research in the field of molecular and structural biology. The annotation of the reference genome of Chaetomium thermophilum has been reported previously, but it is essentially limited to open reading frames (ORFs) of protein coding genes and contains o...

Descripción completa

Detalles Bibliográficos
Autores principales: Singh, Amit, Schermann, Géza, Reislöhner, Sven, Kellner, Nikola, Hurt, Ed, Brunner, Michael
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8535861/
https://www.ncbi.nlm.nih.gov/pubmed/34680944
http://dx.doi.org/10.3390/genes12101549
_version_ 1784587885815529472
author Singh, Amit
Schermann, Géza
Reislöhner, Sven
Kellner, Nikola
Hurt, Ed
Brunner, Michael
author_facet Singh, Amit
Schermann, Géza
Reislöhner, Sven
Kellner, Nikola
Hurt, Ed
Brunner, Michael
author_sort Singh, Amit
collection PubMed
description A correct genome annotation is fundamental for research in the field of molecular and structural biology. The annotation of the reference genome of Chaetomium thermophilum has been reported previously, but it is essentially limited to open reading frames (ORFs) of protein coding genes and contains only a few noncoding transcripts. In this study, we identified and annotated full-length transcripts of C. thermophilum by deep RNA sequencing. We annotated 7044 coding genes and 4567 noncoding genes. Astonishingly, 23% of the coding genes are alternatively spliced. We identified 679 novel coding genes as well as 2878 novel noncoding genes and corrected the structural organization of more than 50% of the previously annotated genes. Furthermore, we substantially extended the Gene Ontology (GO) and Enzyme Commission (EC) lists, which provide comprehensive search tools for potential industrial applications and basic research. The identified novel transcripts and improved annotation will help to understand the gene regulatory landscape in C. thermophilum. The analysis pipeline developed here can be used to build transcriptome assemblies and identify coding and noncoding RNAs of other species.
format Online
Article
Text
id pubmed-8535861
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-85358612021-10-23 Global Transcriptome Characterization and Assembly of the Thermophilic Ascomycete Chaetomium thermophilum Singh, Amit Schermann, Géza Reislöhner, Sven Kellner, Nikola Hurt, Ed Brunner, Michael Genes (Basel) Article A correct genome annotation is fundamental for research in the field of molecular and structural biology. The annotation of the reference genome of Chaetomium thermophilum has been reported previously, but it is essentially limited to open reading frames (ORFs) of protein coding genes and contains only a few noncoding transcripts. In this study, we identified and annotated full-length transcripts of C. thermophilum by deep RNA sequencing. We annotated 7044 coding genes and 4567 noncoding genes. Astonishingly, 23% of the coding genes are alternatively spliced. We identified 679 novel coding genes as well as 2878 novel noncoding genes and corrected the structural organization of more than 50% of the previously annotated genes. Furthermore, we substantially extended the Gene Ontology (GO) and Enzyme Commission (EC) lists, which provide comprehensive search tools for potential industrial applications and basic research. The identified novel transcripts and improved annotation will help to understand the gene regulatory landscape in C. thermophilum. The analysis pipeline developed here can be used to build transcriptome assemblies and identify coding and noncoding RNAs of other species. MDPI 2021-09-29 /pmc/articles/PMC8535861/ /pubmed/34680944 http://dx.doi.org/10.3390/genes12101549 Text en © 2021 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Singh, Amit
Schermann, Géza
Reislöhner, Sven
Kellner, Nikola
Hurt, Ed
Brunner, Michael
Global Transcriptome Characterization and Assembly of the Thermophilic Ascomycete Chaetomium thermophilum
title Global Transcriptome Characterization and Assembly of the Thermophilic Ascomycete Chaetomium thermophilum
title_full Global Transcriptome Characterization and Assembly of the Thermophilic Ascomycete Chaetomium thermophilum
title_fullStr Global Transcriptome Characterization and Assembly of the Thermophilic Ascomycete Chaetomium thermophilum
title_full_unstemmed Global Transcriptome Characterization and Assembly of the Thermophilic Ascomycete Chaetomium thermophilum
title_short Global Transcriptome Characterization and Assembly of the Thermophilic Ascomycete Chaetomium thermophilum
title_sort global transcriptome characterization and assembly of the thermophilic ascomycete chaetomium thermophilum
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8535861/
https://www.ncbi.nlm.nih.gov/pubmed/34680944
http://dx.doi.org/10.3390/genes12101549
work_keys_str_mv AT singhamit globaltranscriptomecharacterizationandassemblyofthethermophilicascomycetechaetomiumthermophilum
AT schermanngeza globaltranscriptomecharacterizationandassemblyofthethermophilicascomycetechaetomiumthermophilum
AT reislohnersven globaltranscriptomecharacterizationandassemblyofthethermophilicascomycetechaetomiumthermophilum
AT kellnernikola globaltranscriptomecharacterizationandassemblyofthethermophilicascomycetechaetomiumthermophilum
AT hurted globaltranscriptomecharacterizationandassemblyofthethermophilicascomycetechaetomiumthermophilum
AT brunnermichael globaltranscriptomecharacterizationandassemblyofthethermophilicascomycetechaetomiumthermophilum