Cargando…

SinEx DB: a database for single exon coding sequences in mammalian genomes

Eukaryotic genes are typically interrupted by intragenic, noncoding sequences termed introns. However, some genes lack introns in their coding sequence (CDS) and are generally known as ‘single exon genes’ (SEGs). In this work, a SEG is defined as a nuclear, protein-coding gene that lacks introns in...

Descripción completa

Detalles Bibliográficos
Autores principales: Jorquera, Roddy, Ortiz, Rodrigo, Ossandon, F., Cárdenas, Juan Pablo, Sepúlveda, Rene, González, Carolina, Holmes, David S.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2016
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4897596/
https://www.ncbi.nlm.nih.gov/pubmed/27278816
http://dx.doi.org/10.1093/database/baw095
_version_ 1782436194476359680
author Jorquera, Roddy
Ortiz, Rodrigo
Ossandon, F.
Cárdenas, Juan Pablo
Sepúlveda, Rene
González, Carolina
Holmes, David S.
author_facet Jorquera, Roddy
Ortiz, Rodrigo
Ossandon, F.
Cárdenas, Juan Pablo
Sepúlveda, Rene
González, Carolina
Holmes, David S.
author_sort Jorquera, Roddy
collection PubMed
description Eukaryotic genes are typically interrupted by intragenic, noncoding sequences termed introns. However, some genes lack introns in their coding sequence (CDS) and are generally known as ‘single exon genes’ (SEGs). In this work, a SEG is defined as a nuclear, protein-coding gene that lacks introns in its CDS. Whereas, many public databases of Eukaryotic multi-exon genes are available, there are only two specialized databases for SEGs. The present work addresses the need for a more extensive and diverse database by creating SinEx DB, a publicly available, searchable database of predicted SEGs from 10 completely sequenced mammalian genomes including human. SinEx DB houses the DNA and protein sequence information of these SEGs and includes their functional predictions (KOG) and the relative distribution of these functions within species. The information is stored in a relational database built with My SQL Server 5.1.33 and the complete dataset of SEG sequences and their functional predictions are available for downloading. SinEx DB can be interrogated by: (i) a browsable phylogenetic schema, (ii) carrying out BLAST searches to the in-house SinEx DB of SEGs and (iii) via an advanced search mode in which the database can be searched by key words and any combination of searches by species and predicted functions. SinEx DB provides a rich source of information for advancing our understanding of the evolution and function of SEGs. Database URL: www.sinex.cl
format Online
Article
Text
id pubmed-4897596
institution National Center for Biotechnology Information
language English
publishDate 2016
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-48975962016-06-10 SinEx DB: a database for single exon coding sequences in mammalian genomes Jorquera, Roddy Ortiz, Rodrigo Ossandon, F. Cárdenas, Juan Pablo Sepúlveda, Rene González, Carolina Holmes, David S. Database (Oxford) Database Tool Eukaryotic genes are typically interrupted by intragenic, noncoding sequences termed introns. However, some genes lack introns in their coding sequence (CDS) and are generally known as ‘single exon genes’ (SEGs). In this work, a SEG is defined as a nuclear, protein-coding gene that lacks introns in its CDS. Whereas, many public databases of Eukaryotic multi-exon genes are available, there are only two specialized databases for SEGs. The present work addresses the need for a more extensive and diverse database by creating SinEx DB, a publicly available, searchable database of predicted SEGs from 10 completely sequenced mammalian genomes including human. SinEx DB houses the DNA and protein sequence information of these SEGs and includes their functional predictions (KOG) and the relative distribution of these functions within species. The information is stored in a relational database built with My SQL Server 5.1.33 and the complete dataset of SEG sequences and their functional predictions are available for downloading. SinEx DB can be interrogated by: (i) a browsable phylogenetic schema, (ii) carrying out BLAST searches to the in-house SinEx DB of SEGs and (iii) via an advanced search mode in which the database can be searched by key words and any combination of searches by species and predicted functions. SinEx DB provides a rich source of information for advancing our understanding of the evolution and function of SEGs. Database URL: www.sinex.cl Oxford University Press 2016-06-07 /pmc/articles/PMC4897596/ /pubmed/27278816 http://dx.doi.org/10.1093/database/baw095 Text en © The Author(s) 2016. Published by Oxford University Press. http://creativecommons.org/licenses/by/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Database Tool
Jorquera, Roddy
Ortiz, Rodrigo
Ossandon, F.
Cárdenas, Juan Pablo
Sepúlveda, Rene
González, Carolina
Holmes, David S.
SinEx DB: a database for single exon coding sequences in mammalian genomes
title SinEx DB: a database for single exon coding sequences in mammalian genomes
title_full SinEx DB: a database for single exon coding sequences in mammalian genomes
title_fullStr SinEx DB: a database for single exon coding sequences in mammalian genomes
title_full_unstemmed SinEx DB: a database for single exon coding sequences in mammalian genomes
title_short SinEx DB: a database for single exon coding sequences in mammalian genomes
title_sort sinex db: a database for single exon coding sequences in mammalian genomes
topic Database Tool
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4897596/
https://www.ncbi.nlm.nih.gov/pubmed/27278816
http://dx.doi.org/10.1093/database/baw095
work_keys_str_mv AT jorqueraroddy sinexdbadatabaseforsingleexoncodingsequencesinmammaliangenomes
AT ortizrodrigo sinexdbadatabaseforsingleexoncodingsequencesinmammaliangenomes
AT ossandonf sinexdbadatabaseforsingleexoncodingsequencesinmammaliangenomes
AT cardenasjuanpablo sinexdbadatabaseforsingleexoncodingsequencesinmammaliangenomes
AT sepulvedarene sinexdbadatabaseforsingleexoncodingsequencesinmammaliangenomes
AT gonzalezcarolina sinexdbadatabaseforsingleexoncodingsequencesinmammaliangenomes
AT holmesdavids sinexdbadatabaseforsingleexoncodingsequencesinmammaliangenomes