Cargando…
LeishDB: a database of coding gene annotation and non-coding RNAs in Leishmania braziliensis
Leishmania braziliensis is the etiological agent of cutaneous leishmaniasis, a disease with high public health importance, affecting 12 million people worldwide. Although its genome sequence was originally published in 2007, the two reference public annotations still presents at least 80% of the gen...
Autores principales: | , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Oxford University Press
2017
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5502370/ https://www.ncbi.nlm.nih.gov/pubmed/29220437 http://dx.doi.org/10.1093/database/bax047 |
_version_ | 1783248943635759104 |
---|---|
author | Torres, Felipe Arias-Carrasco, Raúl Caris-Maldonado, José C. Barral, Aldina Maracaja-Coutinho, Vinicius De Queiroz, Artur T. L. |
author_facet | Torres, Felipe Arias-Carrasco, Raúl Caris-Maldonado, José C. Barral, Aldina Maracaja-Coutinho, Vinicius De Queiroz, Artur T. L. |
author_sort | Torres, Felipe |
collection | PubMed |
description | Leishmania braziliensis is the etiological agent of cutaneous leishmaniasis, a disease with high public health importance, affecting 12 million people worldwide. Although its genome sequence was originally published in 2007, the two reference public annotations still presents at least 80% of the genes simply classified as hypothetical or putative proteins. Furthermore, it is notable the absence of non-coding RNA (ncRNA) sequences from Leishmania species in public databases. These poorly annotated coding genes and ncRNAs could be important players for the understanding of this protozoan biology, the mechanisms behind host-parasite interactions and disease control. Herein, we performed a new prediction and annotation of L. braziliensis protein-coding genes and non-coding RNAs, using recently developed predictive algorithms and updated databases. In summary, we identified 11 491 ORFs, with 5263 (45.80%) of them associated with proteins available in public databases. Moreover, we identified for the first time the repertoire of 11 243 ncRNAs belonging to different classes distributed along the genome. The accuracy of our predictions was verified by transcriptional evidence using RNA-seq, confirming that they are actually generating real transcripts. These data were organized in a public repository named LeishDB (www.leishdb.com), which represents an improvement on the publicly available data related to genomic annotation for L. braziliensis. This updated information can be useful for future genomics, transcriptomics and metabolomics studies; being an additional tool for genome annotation pipelines and novel studies associated with the understanding of this protozoan genome complexity, organization, biology, and development of innovative methodologies for disease control and diagnostics. Database URL: www.leishdb.com |
format | Online Article Text |
id | pubmed-5502370 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2017 |
publisher | Oxford University Press |
record_format | MEDLINE/PubMed |
spelling | pubmed-55023702017-07-17 LeishDB: a database of coding gene annotation and non-coding RNAs in Leishmania braziliensis Torres, Felipe Arias-Carrasco, Raúl Caris-Maldonado, José C. Barral, Aldina Maracaja-Coutinho, Vinicius De Queiroz, Artur T. L. Database (Oxford) Database Tool Leishmania braziliensis is the etiological agent of cutaneous leishmaniasis, a disease with high public health importance, affecting 12 million people worldwide. Although its genome sequence was originally published in 2007, the two reference public annotations still presents at least 80% of the genes simply classified as hypothetical or putative proteins. Furthermore, it is notable the absence of non-coding RNA (ncRNA) sequences from Leishmania species in public databases. These poorly annotated coding genes and ncRNAs could be important players for the understanding of this protozoan biology, the mechanisms behind host-parasite interactions and disease control. Herein, we performed a new prediction and annotation of L. braziliensis protein-coding genes and non-coding RNAs, using recently developed predictive algorithms and updated databases. In summary, we identified 11 491 ORFs, with 5263 (45.80%) of them associated with proteins available in public databases. Moreover, we identified for the first time the repertoire of 11 243 ncRNAs belonging to different classes distributed along the genome. The accuracy of our predictions was verified by transcriptional evidence using RNA-seq, confirming that they are actually generating real transcripts. These data were organized in a public repository named LeishDB (www.leishdb.com), which represents an improvement on the publicly available data related to genomic annotation for L. braziliensis. This updated information can be useful for future genomics, transcriptomics and metabolomics studies; being an additional tool for genome annotation pipelines and novel studies associated with the understanding of this protozoan genome complexity, organization, biology, and development of innovative methodologies for disease control and diagnostics. Database URL: www.leishdb.com Oxford University Press 2017-06-13 /pmc/articles/PMC5502370/ /pubmed/29220437 http://dx.doi.org/10.1093/database/bax047 Text en © The Author(s) 2017. Published by Oxford University Press. http://creativecommons.org/licenses/by/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Database Tool Torres, Felipe Arias-Carrasco, Raúl Caris-Maldonado, José C. Barral, Aldina Maracaja-Coutinho, Vinicius De Queiroz, Artur T. L. LeishDB: a database of coding gene annotation and non-coding RNAs in Leishmania braziliensis |
title | LeishDB: a database of coding gene annotation and non-coding RNAs in Leishmania braziliensis |
title_full | LeishDB: a database of coding gene annotation and non-coding RNAs in Leishmania braziliensis |
title_fullStr | LeishDB: a database of coding gene annotation and non-coding RNAs in Leishmania braziliensis |
title_full_unstemmed | LeishDB: a database of coding gene annotation and non-coding RNAs in Leishmania braziliensis |
title_short | LeishDB: a database of coding gene annotation and non-coding RNAs in Leishmania braziliensis |
title_sort | leishdb: a database of coding gene annotation and non-coding rnas in leishmania braziliensis |
topic | Database Tool |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5502370/ https://www.ncbi.nlm.nih.gov/pubmed/29220437 http://dx.doi.org/10.1093/database/bax047 |
work_keys_str_mv | AT torresfelipe leishdbadatabaseofcodinggeneannotationandnoncodingrnasinleishmaniabraziliensis AT ariascarrascoraul leishdbadatabaseofcodinggeneannotationandnoncodingrnasinleishmaniabraziliensis AT carismaldonadojosec leishdbadatabaseofcodinggeneannotationandnoncodingrnasinleishmaniabraziliensis AT barralaldina leishdbadatabaseofcodinggeneannotationandnoncodingrnasinleishmaniabraziliensis AT maracajacoutinhovinicius leishdbadatabaseofcodinggeneannotationandnoncodingrnasinleishmaniabraziliensis AT dequeirozarturtl leishdbadatabaseofcodinggeneannotationandnoncodingrnasinleishmaniabraziliensis |