Cargando…

From a genome assembly to full regulatory network prediction: the case study of Rhodotorula toruloides putative Haa1-regulon

Numerous genomes are sequenced and made available to the community through the NCBI portal. However, and, unlike what happens for gene function annotation, annotation of promoter sequences and the underlying prediction of regulatory associations is mostly unavailable, severely limiting the ability t...

Descripción completa

Detalles Bibliográficos
Autores principales: Oliveira, Jorge, Antunes, Miguel, Godinho, Claudia P., Teixeira, Miguel C., Sá-Correia, Isabel, Monteiro, Pedro T.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8353774/
https://www.ncbi.nlm.nih.gov/pubmed/34376148
http://dx.doi.org/10.1186/s12859-021-04312-3
_version_ 1783736472285741056
author Oliveira, Jorge
Antunes, Miguel
Godinho, Claudia P.
Teixeira, Miguel C.
Sá-Correia, Isabel
Monteiro, Pedro T.
author_facet Oliveira, Jorge
Antunes, Miguel
Godinho, Claudia P.
Teixeira, Miguel C.
Sá-Correia, Isabel
Monteiro, Pedro T.
author_sort Oliveira, Jorge
collection PubMed
description Numerous genomes are sequenced and made available to the community through the NCBI portal. However, and, unlike what happens for gene function annotation, annotation of promoter sequences and the underlying prediction of regulatory associations is mostly unavailable, severely limiting the ability to interpret genome sequences in a functional genomics perspective. Here we present an approach where one can download a genome of interest from NCBI in the GenBank Flat File (.gbff) format and, with a minimum set of commands, have all the information parsed, organized and made available through the platform web interface. Also, the new genomes are compared with a given genome of reference in search of homologous genes, shared regulatory elements and predicted transcription associations. We present this approach within the context of Community YEASTRACT of the YEASTRACT + portal, thus benefiting from immediate access to all the comparative genomics queries offered in the YEASTRACT + portal. Besides the yeast community, other communities can install the platform independently, without any constraints. In this work, we exemplify the usefulness of the presented tool, within Community YEASTRACT, in constructing a dedicated database and analysing the genome of the highly promising oleaginous red yeast species Rhodotorula toruloides currently poorly studied at the genome and transcriptome levels and with limited genome editing tools. Regulatory prediction is based on the conservation of promoter sequences and available regulatory networks. The case-study examined is focused on the Haa1 transcription factor—a key regulator of yeast resistance to acetic acid, an important inhibitor of industrial bioconversion of lignocellulosic hydrolysates. The new tool described here led to the prediction of a RtHaa1 regulon with expected impact in the optimization of R. toruloides robustness for lignocellulosic and pectin-rich residue biorefinery processes. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s12859-021-04312-3.
format Online
Article
Text
id pubmed-8353774
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-83537742021-08-10 From a genome assembly to full regulatory network prediction: the case study of Rhodotorula toruloides putative Haa1-regulon Oliveira, Jorge Antunes, Miguel Godinho, Claudia P. Teixeira, Miguel C. Sá-Correia, Isabel Monteiro, Pedro T. BMC Bioinformatics Methodology Article Numerous genomes are sequenced and made available to the community through the NCBI portal. However, and, unlike what happens for gene function annotation, annotation of promoter sequences and the underlying prediction of regulatory associations is mostly unavailable, severely limiting the ability to interpret genome sequences in a functional genomics perspective. Here we present an approach where one can download a genome of interest from NCBI in the GenBank Flat File (.gbff) format and, with a minimum set of commands, have all the information parsed, organized and made available through the platform web interface. Also, the new genomes are compared with a given genome of reference in search of homologous genes, shared regulatory elements and predicted transcription associations. We present this approach within the context of Community YEASTRACT of the YEASTRACT + portal, thus benefiting from immediate access to all the comparative genomics queries offered in the YEASTRACT + portal. Besides the yeast community, other communities can install the platform independently, without any constraints. In this work, we exemplify the usefulness of the presented tool, within Community YEASTRACT, in constructing a dedicated database and analysing the genome of the highly promising oleaginous red yeast species Rhodotorula toruloides currently poorly studied at the genome and transcriptome levels and with limited genome editing tools. Regulatory prediction is based on the conservation of promoter sequences and available regulatory networks. The case-study examined is focused on the Haa1 transcription factor—a key regulator of yeast resistance to acetic acid, an important inhibitor of industrial bioconversion of lignocellulosic hydrolysates. The new tool described here led to the prediction of a RtHaa1 regulon with expected impact in the optimization of R. toruloides robustness for lignocellulosic and pectin-rich residue biorefinery processes. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s12859-021-04312-3. BioMed Central 2021-08-10 /pmc/articles/PMC8353774/ /pubmed/34376148 http://dx.doi.org/10.1186/s12859-021-04312-3 Text en © The Author(s) 2021 https://creativecommons.org/licenses/by/4.0/Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/ (https://creativecommons.org/publicdomain/zero/1.0/) ) applies to the data made available in this article, unless otherwise stated in a credit line to the data.
spellingShingle Methodology Article
Oliveira, Jorge
Antunes, Miguel
Godinho, Claudia P.
Teixeira, Miguel C.
Sá-Correia, Isabel
Monteiro, Pedro T.
From a genome assembly to full regulatory network prediction: the case study of Rhodotorula toruloides putative Haa1-regulon
title From a genome assembly to full regulatory network prediction: the case study of Rhodotorula toruloides putative Haa1-regulon
title_full From a genome assembly to full regulatory network prediction: the case study of Rhodotorula toruloides putative Haa1-regulon
title_fullStr From a genome assembly to full regulatory network prediction: the case study of Rhodotorula toruloides putative Haa1-regulon
title_full_unstemmed From a genome assembly to full regulatory network prediction: the case study of Rhodotorula toruloides putative Haa1-regulon
title_short From a genome assembly to full regulatory network prediction: the case study of Rhodotorula toruloides putative Haa1-regulon
title_sort from a genome assembly to full regulatory network prediction: the case study of rhodotorula toruloides putative haa1-regulon
topic Methodology Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8353774/
https://www.ncbi.nlm.nih.gov/pubmed/34376148
http://dx.doi.org/10.1186/s12859-021-04312-3
work_keys_str_mv AT oliveirajorge fromagenomeassemblytofullregulatorynetworkpredictionthecasestudyofrhodotorulatoruloidesputativehaa1regulon
AT antunesmiguel fromagenomeassemblytofullregulatorynetworkpredictionthecasestudyofrhodotorulatoruloidesputativehaa1regulon
AT godinhoclaudiap fromagenomeassemblytofullregulatorynetworkpredictionthecasestudyofrhodotorulatoruloidesputativehaa1regulon
AT teixeiramiguelc fromagenomeassemblytofullregulatorynetworkpredictionthecasestudyofrhodotorulatoruloidesputativehaa1regulon
AT sacorreiaisabel fromagenomeassemblytofullregulatorynetworkpredictionthecasestudyofrhodotorulatoruloidesputativehaa1regulon
AT monteiropedrot fromagenomeassemblytofullregulatorynetworkpredictionthecasestudyofrhodotorulatoruloidesputativehaa1regulon