Cargando…

ParaDB: A manually curated database containing genomic annotation for the human pathogenic fungi Paracoccidioides spp.

BACKGROUND: The genus Paracoccidioides consists of thermodymorphic fungi responsible for Paracoccidioidomycosis (PCM), a systemic mycosis that has been registered to affect ~10 million people in Latin America. Biogeographical data subdivided the genus Paracoccidioides in five divergent subgroups, wh...

Descripción completa

Detalles Bibliográficos
Autores principales: Aciole Barbosa, David, Menegidio, Fabiano Bezerra, Alencar, Valquíria Campos, Gonçalves, Rafael S., Silva, Juliana de Fátima Santos, Vilas Boas, Renata Ozelami, Faustino de Maria, Yara Natércia Lima, Jabes, Daniela Leite, Costa de Oliveira, Regina, Nunes, Luiz R.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6658007/
https://www.ncbi.nlm.nih.gov/pubmed/31306428
http://dx.doi.org/10.1371/journal.pntd.0007576
_version_ 1783438892629753856
author Aciole Barbosa, David
Menegidio, Fabiano Bezerra
Alencar, Valquíria Campos
Gonçalves, Rafael S.
Silva, Juliana de Fátima Santos
Vilas Boas, Renata Ozelami
Faustino de Maria, Yara Natércia Lima
Jabes, Daniela Leite
Costa de Oliveira, Regina
Nunes, Luiz R.
author_facet Aciole Barbosa, David
Menegidio, Fabiano Bezerra
Alencar, Valquíria Campos
Gonçalves, Rafael S.
Silva, Juliana de Fátima Santos
Vilas Boas, Renata Ozelami
Faustino de Maria, Yara Natércia Lima
Jabes, Daniela Leite
Costa de Oliveira, Regina
Nunes, Luiz R.
author_sort Aciole Barbosa, David
collection PubMed
description BACKGROUND: The genus Paracoccidioides consists of thermodymorphic fungi responsible for Paracoccidioidomycosis (PCM), a systemic mycosis that has been registered to affect ~10 million people in Latin America. Biogeographical data subdivided the genus Paracoccidioides in five divergent subgroups, which have been recently classified as different species. Genomic sequencing of five Paracoccidioides isolates, representing each of these subgroups/species provided an important framework for the development of post-genomic studies with these fungi. However, functional annotations of these genomes have not been submitted to manual curation and, as a result, ~60–90% of the Paracoccidioides protein-coding genes (depending on isolate/annotation) are currently described as responsible for hypothetical proteins, without any further functional/structural description. PRINCIPAL FINDINGS: The present work reviews the functional assignment of Paracoccidioides genes, reducing the number of hypothetical proteins to ~25–28%. These results were compiled in a relational database called ParaDB, dedicated to the main representatives of Paracoccidioides spp. ParaDB can be accessed through a friendly graphical interface, which offers search tools based on keywords or protein/DNA sequences. All data contained in ParaDB can be partially or completely downloaded through spreadsheet, multi-fasta and GFF3-formatted files, which can be subsequently used in a variety of downstream functional analyses. Moreover, the entire ParaDB environment has been configured in a Docker service, which has been submitted to the GitHub repository, ensuring long-term data availability to researchers. This service can be downloaded and used to perform fully functional local installations of the database in alternative computing ecosystems, allowing users to conduct their data mining and analyses in a personal and stable working environment. CONCLUSIONS: These new annotations greatly reduce the number of genes identified solely as hypothetical proteins and are integrated into a dedicated database, providing resources to assist researchers in this field to conduct post-genomic studies with this group of human pathogenic fungi.
format Online
Article
Text
id pubmed-6658007
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-66580072019-08-06 ParaDB: A manually curated database containing genomic annotation for the human pathogenic fungi Paracoccidioides spp. Aciole Barbosa, David Menegidio, Fabiano Bezerra Alencar, Valquíria Campos Gonçalves, Rafael S. Silva, Juliana de Fátima Santos Vilas Boas, Renata Ozelami Faustino de Maria, Yara Natércia Lima Jabes, Daniela Leite Costa de Oliveira, Regina Nunes, Luiz R. PLoS Negl Trop Dis Research Article BACKGROUND: The genus Paracoccidioides consists of thermodymorphic fungi responsible for Paracoccidioidomycosis (PCM), a systemic mycosis that has been registered to affect ~10 million people in Latin America. Biogeographical data subdivided the genus Paracoccidioides in five divergent subgroups, which have been recently classified as different species. Genomic sequencing of five Paracoccidioides isolates, representing each of these subgroups/species provided an important framework for the development of post-genomic studies with these fungi. However, functional annotations of these genomes have not been submitted to manual curation and, as a result, ~60–90% of the Paracoccidioides protein-coding genes (depending on isolate/annotation) are currently described as responsible for hypothetical proteins, without any further functional/structural description. PRINCIPAL FINDINGS: The present work reviews the functional assignment of Paracoccidioides genes, reducing the number of hypothetical proteins to ~25–28%. These results were compiled in a relational database called ParaDB, dedicated to the main representatives of Paracoccidioides spp. ParaDB can be accessed through a friendly graphical interface, which offers search tools based on keywords or protein/DNA sequences. All data contained in ParaDB can be partially or completely downloaded through spreadsheet, multi-fasta and GFF3-formatted files, which can be subsequently used in a variety of downstream functional analyses. Moreover, the entire ParaDB environment has been configured in a Docker service, which has been submitted to the GitHub repository, ensuring long-term data availability to researchers. This service can be downloaded and used to perform fully functional local installations of the database in alternative computing ecosystems, allowing users to conduct their data mining and analyses in a personal and stable working environment. CONCLUSIONS: These new annotations greatly reduce the number of genes identified solely as hypothetical proteins and are integrated into a dedicated database, providing resources to assist researchers in this field to conduct post-genomic studies with this group of human pathogenic fungi. Public Library of Science 2019-07-15 /pmc/articles/PMC6658007/ /pubmed/31306428 http://dx.doi.org/10.1371/journal.pntd.0007576 Text en © 2019 Aciole Barbosa et al http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle Research Article
Aciole Barbosa, David
Menegidio, Fabiano Bezerra
Alencar, Valquíria Campos
Gonçalves, Rafael S.
Silva, Juliana de Fátima Santos
Vilas Boas, Renata Ozelami
Faustino de Maria, Yara Natércia Lima
Jabes, Daniela Leite
Costa de Oliveira, Regina
Nunes, Luiz R.
ParaDB: A manually curated database containing genomic annotation for the human pathogenic fungi Paracoccidioides spp.
title ParaDB: A manually curated database containing genomic annotation for the human pathogenic fungi Paracoccidioides spp.
title_full ParaDB: A manually curated database containing genomic annotation for the human pathogenic fungi Paracoccidioides spp.
title_fullStr ParaDB: A manually curated database containing genomic annotation for the human pathogenic fungi Paracoccidioides spp.
title_full_unstemmed ParaDB: A manually curated database containing genomic annotation for the human pathogenic fungi Paracoccidioides spp.
title_short ParaDB: A manually curated database containing genomic annotation for the human pathogenic fungi Paracoccidioides spp.
title_sort paradb: a manually curated database containing genomic annotation for the human pathogenic fungi paracoccidioides spp.
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6658007/
https://www.ncbi.nlm.nih.gov/pubmed/31306428
http://dx.doi.org/10.1371/journal.pntd.0007576
work_keys_str_mv AT aciolebarbosadavid paradbamanuallycurateddatabasecontaininggenomicannotationforthehumanpathogenicfungiparacoccidioidesspp
AT menegidiofabianobezerra paradbamanuallycurateddatabasecontaininggenomicannotationforthehumanpathogenicfungiparacoccidioidesspp
AT alencarvalquiriacampos paradbamanuallycurateddatabasecontaininggenomicannotationforthehumanpathogenicfungiparacoccidioidesspp
AT goncalvesrafaels paradbamanuallycurateddatabasecontaininggenomicannotationforthehumanpathogenicfungiparacoccidioidesspp
AT silvajulianadefatimasantos paradbamanuallycurateddatabasecontaininggenomicannotationforthehumanpathogenicfungiparacoccidioidesspp
AT vilasboasrenataozelami paradbamanuallycurateddatabasecontaininggenomicannotationforthehumanpathogenicfungiparacoccidioidesspp
AT faustinodemariayaranatercialima paradbamanuallycurateddatabasecontaininggenomicannotationforthehumanpathogenicfungiparacoccidioidesspp
AT jabesdanielaleite paradbamanuallycurateddatabasecontaininggenomicannotationforthehumanpathogenicfungiparacoccidioidesspp
AT costadeoliveiraregina paradbamanuallycurateddatabasecontaininggenomicannotationforthehumanpathogenicfungiparacoccidioidesspp
AT nunesluizr paradbamanuallycurateddatabasecontaininggenomicannotationforthehumanpathogenicfungiparacoccidioidesspp