Cargando…

LeishDB: a database of coding gene annotation and non-coding RNAs in Leishmania braziliensis

Leishmania braziliensis is the etiological agent of cutaneous leishmaniasis, a disease with high public health importance, affecting 12 million people worldwide. Although its genome sequence was originally published in 2007, the two reference public annotations still presents at least 80% of the gen...

Descripción completa

Detalles Bibliográficos
Autores principales: Torres, Felipe, Arias-Carrasco, Raúl, Caris-Maldonado, José C., Barral, Aldina, Maracaja-Coutinho, Vinicius, De Queiroz, Artur T. L.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2017
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5502370/
https://www.ncbi.nlm.nih.gov/pubmed/29220437
http://dx.doi.org/10.1093/database/bax047
_version_ 1783248943635759104
author Torres, Felipe
Arias-Carrasco, Raúl
Caris-Maldonado, José C.
Barral, Aldina
Maracaja-Coutinho, Vinicius
De Queiroz, Artur T. L.
author_facet Torres, Felipe
Arias-Carrasco, Raúl
Caris-Maldonado, José C.
Barral, Aldina
Maracaja-Coutinho, Vinicius
De Queiroz, Artur T. L.
author_sort Torres, Felipe
collection PubMed
description Leishmania braziliensis is the etiological agent of cutaneous leishmaniasis, a disease with high public health importance, affecting 12 million people worldwide. Although its genome sequence was originally published in 2007, the two reference public annotations still presents at least 80% of the genes simply classified as hypothetical or putative proteins. Furthermore, it is notable the absence of non-coding RNA (ncRNA) sequences from Leishmania species in public databases. These poorly annotated coding genes and ncRNAs could be important players for the understanding of this protozoan biology, the mechanisms behind host-parasite interactions and disease control. Herein, we performed a new prediction and annotation of L. braziliensis protein-coding genes and non-coding RNAs, using recently developed predictive algorithms and updated databases. In summary, we identified 11 491 ORFs, with 5263 (45.80%) of them associated with proteins available in public databases. Moreover, we identified for the first time the repertoire of 11 243 ncRNAs belonging to different classes distributed along the genome. The accuracy of our predictions was verified by transcriptional evidence using RNA-seq, confirming that they are actually generating real transcripts. These data were organized in a public repository named LeishDB (www.leishdb.com), which represents an improvement on the publicly available data related to genomic annotation for L. braziliensis. This updated information can be useful for future genomics, transcriptomics and metabolomics studies; being an additional tool for genome annotation pipelines and novel studies associated with the understanding of this protozoan genome complexity, organization, biology, and development of innovative methodologies for disease control and diagnostics. Database URL: www.leishdb.com
format Online
Article
Text
id pubmed-5502370
institution National Center for Biotechnology Information
language English
publishDate 2017
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-55023702017-07-17 LeishDB: a database of coding gene annotation and non-coding RNAs in Leishmania braziliensis Torres, Felipe Arias-Carrasco, Raúl Caris-Maldonado, José C. Barral, Aldina Maracaja-Coutinho, Vinicius De Queiroz, Artur T. L. Database (Oxford) Database Tool Leishmania braziliensis is the etiological agent of cutaneous leishmaniasis, a disease with high public health importance, affecting 12 million people worldwide. Although its genome sequence was originally published in 2007, the two reference public annotations still presents at least 80% of the genes simply classified as hypothetical or putative proteins. Furthermore, it is notable the absence of non-coding RNA (ncRNA) sequences from Leishmania species in public databases. These poorly annotated coding genes and ncRNAs could be important players for the understanding of this protozoan biology, the mechanisms behind host-parasite interactions and disease control. Herein, we performed a new prediction and annotation of L. braziliensis protein-coding genes and non-coding RNAs, using recently developed predictive algorithms and updated databases. In summary, we identified 11 491 ORFs, with 5263 (45.80%) of them associated with proteins available in public databases. Moreover, we identified for the first time the repertoire of 11 243 ncRNAs belonging to different classes distributed along the genome. The accuracy of our predictions was verified by transcriptional evidence using RNA-seq, confirming that they are actually generating real transcripts. These data were organized in a public repository named LeishDB (www.leishdb.com), which represents an improvement on the publicly available data related to genomic annotation for L. braziliensis. This updated information can be useful for future genomics, transcriptomics and metabolomics studies; being an additional tool for genome annotation pipelines and novel studies associated with the understanding of this protozoan genome complexity, organization, biology, and development of innovative methodologies for disease control and diagnostics. Database URL: www.leishdb.com Oxford University Press 2017-06-13 /pmc/articles/PMC5502370/ /pubmed/29220437 http://dx.doi.org/10.1093/database/bax047 Text en © The Author(s) 2017. Published by Oxford University Press. http://creativecommons.org/licenses/by/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Database Tool
Torres, Felipe
Arias-Carrasco, Raúl
Caris-Maldonado, José C.
Barral, Aldina
Maracaja-Coutinho, Vinicius
De Queiroz, Artur T. L.
LeishDB: a database of coding gene annotation and non-coding RNAs in Leishmania braziliensis
title LeishDB: a database of coding gene annotation and non-coding RNAs in Leishmania braziliensis
title_full LeishDB: a database of coding gene annotation and non-coding RNAs in Leishmania braziliensis
title_fullStr LeishDB: a database of coding gene annotation and non-coding RNAs in Leishmania braziliensis
title_full_unstemmed LeishDB: a database of coding gene annotation and non-coding RNAs in Leishmania braziliensis
title_short LeishDB: a database of coding gene annotation and non-coding RNAs in Leishmania braziliensis
title_sort leishdb: a database of coding gene annotation and non-coding rnas in leishmania braziliensis
topic Database Tool
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5502370/
https://www.ncbi.nlm.nih.gov/pubmed/29220437
http://dx.doi.org/10.1093/database/bax047
work_keys_str_mv AT torresfelipe leishdbadatabaseofcodinggeneannotationandnoncodingrnasinleishmaniabraziliensis
AT ariascarrascoraul leishdbadatabaseofcodinggeneannotationandnoncodingrnasinleishmaniabraziliensis
AT carismaldonadojosec leishdbadatabaseofcodinggeneannotationandnoncodingrnasinleishmaniabraziliensis
AT barralaldina leishdbadatabaseofcodinggeneannotationandnoncodingrnasinleishmaniabraziliensis
AT maracajacoutinhovinicius leishdbadatabaseofcodinggeneannotationandnoncodingrnasinleishmaniabraziliensis
AT dequeirozarturtl leishdbadatabaseofcodinggeneannotationandnoncodingrnasinleishmaniabraziliensis