Cargando…

DeORFanizing Candida albicans Genes using Coexpression

Functional characterization of open reading frames in nonmodel organisms, such as the common opportunistic fungal pathogen Candida albicans, can be labor-intensive. To meet this challenge, we built a comprehensive and unbiased coexpression network for C. albicans, which we call CalCEN, from data col...

Descripción completa

Detalles Bibliográficos
Autores principales: O’Meara, Teresa R., O’Meara, Matthew J.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: American Society for Microbiology 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7845621/
https://www.ncbi.nlm.nih.gov/pubmed/33472984
http://dx.doi.org/10.1128/mSphere.01245-20
_version_ 1783644590586200064
author O’Meara, Teresa R.
O’Meara, Matthew J.
author_facet O’Meara, Teresa R.
O’Meara, Matthew J.
author_sort O’Meara, Teresa R.
collection PubMed
description Functional characterization of open reading frames in nonmodel organisms, such as the common opportunistic fungal pathogen Candida albicans, can be labor-intensive. To meet this challenge, we built a comprehensive and unbiased coexpression network for C. albicans, which we call CalCEN, from data collected from 853 RNA sequencing runs from 18 large-scale studies deposited in the NCBI Sequence Read Archive. Retrospectively, CalCEN is highly predictive of known gene function annotations and can be synergistically combined with sequence similarity and interaction networks in Saccharomyces cerevisiae through orthology for additional accuracy in gene function prediction. To prospectively demonstrate the utility of the coexpression network in C. albicans, we predicted the function of underannotated open reading frames (ORFs) and identified CCJ1 as a novel cell cycle regulator in C. albicans. This study provides a tool for future systems biology analyses of gene function in C. albicans. We provide a computational pipeline for building and analyzing the coexpression network and CalCEN itself at http://github.com/momeara/CalCEN. IMPORTANCE Candida albicans is a common and deadly fungal pathogen of humans, yet the genome of this organism contains many genes of unknown function. By determining gene function, we can help identify essential genes, new virulence factors, or new regulators of drug resistance, and thereby give new targets for antifungal development. Here, we use information from large-scale RNA sequencing (RNAseq) studies and generate a C. albicans coexpression network (CalCEN) that is robust and able to predict gene function. We demonstrate the utility of this network in both retrospective and prospective testing and use CalCEN to predict a role for C4_06590W/CCJ1 in cell cycle. This tool will allow for a better characterization of underannotated genes in pathogenic yeasts.
format Online
Article
Text
id pubmed-7845621
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher American Society for Microbiology
record_format MEDLINE/PubMed
spelling pubmed-78456212021-01-29 DeORFanizing Candida albicans Genes using Coexpression O’Meara, Teresa R. O’Meara, Matthew J. mSphere Research Article Functional characterization of open reading frames in nonmodel organisms, such as the common opportunistic fungal pathogen Candida albicans, can be labor-intensive. To meet this challenge, we built a comprehensive and unbiased coexpression network for C. albicans, which we call CalCEN, from data collected from 853 RNA sequencing runs from 18 large-scale studies deposited in the NCBI Sequence Read Archive. Retrospectively, CalCEN is highly predictive of known gene function annotations and can be synergistically combined with sequence similarity and interaction networks in Saccharomyces cerevisiae through orthology for additional accuracy in gene function prediction. To prospectively demonstrate the utility of the coexpression network in C. albicans, we predicted the function of underannotated open reading frames (ORFs) and identified CCJ1 as a novel cell cycle regulator in C. albicans. This study provides a tool for future systems biology analyses of gene function in C. albicans. We provide a computational pipeline for building and analyzing the coexpression network and CalCEN itself at http://github.com/momeara/CalCEN. IMPORTANCE Candida albicans is a common and deadly fungal pathogen of humans, yet the genome of this organism contains many genes of unknown function. By determining gene function, we can help identify essential genes, new virulence factors, or new regulators of drug resistance, and thereby give new targets for antifungal development. Here, we use information from large-scale RNA sequencing (RNAseq) studies and generate a C. albicans coexpression network (CalCEN) that is robust and able to predict gene function. We demonstrate the utility of this network in both retrospective and prospective testing and use CalCEN to predict a role for C4_06590W/CCJ1 in cell cycle. This tool will allow for a better characterization of underannotated genes in pathogenic yeasts. American Society for Microbiology 2021-01-20 /pmc/articles/PMC7845621/ /pubmed/33472984 http://dx.doi.org/10.1128/mSphere.01245-20 Text en Copyright © 2021 O’Meara and O’Meara. https://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution 4.0 International license (https://creativecommons.org/licenses/by/4.0/) .
spellingShingle Research Article
O’Meara, Teresa R.
O’Meara, Matthew J.
DeORFanizing Candida albicans Genes using Coexpression
title DeORFanizing Candida albicans Genes using Coexpression
title_full DeORFanizing Candida albicans Genes using Coexpression
title_fullStr DeORFanizing Candida albicans Genes using Coexpression
title_full_unstemmed DeORFanizing Candida albicans Genes using Coexpression
title_short DeORFanizing Candida albicans Genes using Coexpression
title_sort deorfanizing candida albicans genes using coexpression
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7845621/
https://www.ncbi.nlm.nih.gov/pubmed/33472984
http://dx.doi.org/10.1128/mSphere.01245-20
work_keys_str_mv AT omearateresar deorfanizingcandidaalbicansgenesusingcoexpression
AT omearamatthewj deorfanizingcandidaalbicansgenesusingcoexpression