Cargando…

Application of an optimized annotation pipeline to the Cryptococcus deuterogattii genome reveals dynamic primary metabolic gene clusters and genomic impact of RNAi loss

Evaluating the quality of a de novo annotation of a complex fungal genome based on RNA-seq data remains a challenge. In this study, we sequentially optimized a Cufflinks-CodingQuary-based bioinformatics pipeline fed with RNA-seq data using the manually annotated model pathogenic yeasts Cryptococcus...

Descripción completa

Detalles Bibliográficos
Autores principales: Gröhs Ferrareze, Patrícia Aline, Maufrais, Corinne, Silva Araujo Streit, Rodrigo, Priest, Shelby J, Cuomo, Christina A, Heitman, Joseph, Staats, Charley Christian, Janbon, Guilhem
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8022950/
https://www.ncbi.nlm.nih.gov/pubmed/33585873
http://dx.doi.org/10.1093/g3journal/jkaa070
_version_ 1783675033409814528
author Gröhs Ferrareze, Patrícia Aline
Maufrais, Corinne
Silva Araujo Streit, Rodrigo
Priest, Shelby J
Cuomo, Christina A
Heitman, Joseph
Staats, Charley Christian
Janbon, Guilhem
author_facet Gröhs Ferrareze, Patrícia Aline
Maufrais, Corinne
Silva Araujo Streit, Rodrigo
Priest, Shelby J
Cuomo, Christina A
Heitman, Joseph
Staats, Charley Christian
Janbon, Guilhem
author_sort Gröhs Ferrareze, Patrícia Aline
collection PubMed
description Evaluating the quality of a de novo annotation of a complex fungal genome based on RNA-seq data remains a challenge. In this study, we sequentially optimized a Cufflinks-CodingQuary-based bioinformatics pipeline fed with RNA-seq data using the manually annotated model pathogenic yeasts Cryptococcus neoformans and Cryptococcus deneoformans as test cases. Our results show that the quality of the annotation is sensitive to the quantity of RNA-seq data used and that the best quality is obtained with 5–10 million reads per RNA-seq replicate. We also showed that the number of introns predicted is an excellent a priori indicator of the quality of the final de novo annotation. We then used this pipeline to annotate the genome of the RNAi-deficient species Cryptococcus deuterogattii strain R265 using RNA-seq data. Dynamic transcriptome analysis revealed that intron retention is more prominent in C. deuterogattii than in the other RNAi-proficient species C. neoformans and C. deneoformans. In contrast, we observed that antisense transcription was not higher in C. deuterogattii than in the two other Cryptococcus species. Comparative gene content analysis identified 21 clusters enriched in transcription factors and transporters that have been lost. Interestingly, analysis of the subtelomeric regions in these three annotated species identified a similar gene enrichment, reminiscent of the structure of primary metabolic clusters. Our data suggest that there is active exchange between subtelomeric regions, and that other chromosomal regions might participate in adaptive diversification of Cryptococcus metabolite assimilation potential.
format Online
Article
Text
id pubmed-8022950
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-80229502021-04-09 Application of an optimized annotation pipeline to the Cryptococcus deuterogattii genome reveals dynamic primary metabolic gene clusters and genomic impact of RNAi loss Gröhs Ferrareze, Patrícia Aline Maufrais, Corinne Silva Araujo Streit, Rodrigo Priest, Shelby J Cuomo, Christina A Heitman, Joseph Staats, Charley Christian Janbon, Guilhem G3 (Bethesda) Fungal Genetics and Genomics Evaluating the quality of a de novo annotation of a complex fungal genome based on RNA-seq data remains a challenge. In this study, we sequentially optimized a Cufflinks-CodingQuary-based bioinformatics pipeline fed with RNA-seq data using the manually annotated model pathogenic yeasts Cryptococcus neoformans and Cryptococcus deneoformans as test cases. Our results show that the quality of the annotation is sensitive to the quantity of RNA-seq data used and that the best quality is obtained with 5–10 million reads per RNA-seq replicate. We also showed that the number of introns predicted is an excellent a priori indicator of the quality of the final de novo annotation. We then used this pipeline to annotate the genome of the RNAi-deficient species Cryptococcus deuterogattii strain R265 using RNA-seq data. Dynamic transcriptome analysis revealed that intron retention is more prominent in C. deuterogattii than in the other RNAi-proficient species C. neoformans and C. deneoformans. In contrast, we observed that antisense transcription was not higher in C. deuterogattii than in the two other Cryptococcus species. Comparative gene content analysis identified 21 clusters enriched in transcription factors and transporters that have been lost. Interestingly, analysis of the subtelomeric regions in these three annotated species identified a similar gene enrichment, reminiscent of the structure of primary metabolic clusters. Our data suggest that there is active exchange between subtelomeric regions, and that other chromosomal regions might participate in adaptive diversification of Cryptococcus metabolite assimilation potential. Oxford University Press 2021-01-11 /pmc/articles/PMC8022950/ /pubmed/33585873 http://dx.doi.org/10.1093/g3journal/jkaa070 Text en © The Author(s) 2021. Published by Oxford University Press on behalf of Genetics Society of America. http://creativecommons.org/licenses/by/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Fungal Genetics and Genomics
Gröhs Ferrareze, Patrícia Aline
Maufrais, Corinne
Silva Araujo Streit, Rodrigo
Priest, Shelby J
Cuomo, Christina A
Heitman, Joseph
Staats, Charley Christian
Janbon, Guilhem
Application of an optimized annotation pipeline to the Cryptococcus deuterogattii genome reveals dynamic primary metabolic gene clusters and genomic impact of RNAi loss
title Application of an optimized annotation pipeline to the Cryptococcus deuterogattii genome reveals dynamic primary metabolic gene clusters and genomic impact of RNAi loss
title_full Application of an optimized annotation pipeline to the Cryptococcus deuterogattii genome reveals dynamic primary metabolic gene clusters and genomic impact of RNAi loss
title_fullStr Application of an optimized annotation pipeline to the Cryptococcus deuterogattii genome reveals dynamic primary metabolic gene clusters and genomic impact of RNAi loss
title_full_unstemmed Application of an optimized annotation pipeline to the Cryptococcus deuterogattii genome reveals dynamic primary metabolic gene clusters and genomic impact of RNAi loss
title_short Application of an optimized annotation pipeline to the Cryptococcus deuterogattii genome reveals dynamic primary metabolic gene clusters and genomic impact of RNAi loss
title_sort application of an optimized annotation pipeline to the cryptococcus deuterogattii genome reveals dynamic primary metabolic gene clusters and genomic impact of rnai loss
topic Fungal Genetics and Genomics
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8022950/
https://www.ncbi.nlm.nih.gov/pubmed/33585873
http://dx.doi.org/10.1093/g3journal/jkaa070
work_keys_str_mv AT grohsferrarezepatriciaaline applicationofanoptimizedannotationpipelinetothecryptococcusdeuterogattiigenomerevealsdynamicprimarymetabolicgeneclustersandgenomicimpactofrnailoss
AT maufraiscorinne applicationofanoptimizedannotationpipelinetothecryptococcusdeuterogattiigenomerevealsdynamicprimarymetabolicgeneclustersandgenomicimpactofrnailoss
AT silvaaraujostreitrodrigo applicationofanoptimizedannotationpipelinetothecryptococcusdeuterogattiigenomerevealsdynamicprimarymetabolicgeneclustersandgenomicimpactofrnailoss
AT priestshelbyj applicationofanoptimizedannotationpipelinetothecryptococcusdeuterogattiigenomerevealsdynamicprimarymetabolicgeneclustersandgenomicimpactofrnailoss
AT cuomochristinaa applicationofanoptimizedannotationpipelinetothecryptococcusdeuterogattiigenomerevealsdynamicprimarymetabolicgeneclustersandgenomicimpactofrnailoss
AT heitmanjoseph applicationofanoptimizedannotationpipelinetothecryptococcusdeuterogattiigenomerevealsdynamicprimarymetabolicgeneclustersandgenomicimpactofrnailoss
AT staatscharleychristian applicationofanoptimizedannotationpipelinetothecryptococcusdeuterogattiigenomerevealsdynamicprimarymetabolicgeneclustersandgenomicimpactofrnailoss
AT janbonguilhem applicationofanoptimizedannotationpipelinetothecryptococcusdeuterogattiigenomerevealsdynamicprimarymetabolicgeneclustersandgenomicimpactofrnailoss