Cargando…

Identification of minimal eukaryotic introns through GeneBase, a user-friendly tool for parsing the NCBI Gene databank

We have developed GeneBase, a full parser of the National Center for Biotechnology Information (NCBI) Gene database, which generates a fully structured local database with an intuitive user-friendly graphic interface for personal computers. Features of all the annotated eukaryotic genes are accessib...

Descripción completa

Detalles Bibliográficos
Autores principales: Piovesan, Allison, Caracausi, Maria, Ricci, Marco, Strippoli, Pierluigi, Vitale, Lorenza, Pelleri, Maria Chiara
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2015
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4675715/
https://www.ncbi.nlm.nih.gov/pubmed/26581719
http://dx.doi.org/10.1093/dnares/dsv028
_version_ 1782405047125016576
author Piovesan, Allison
Caracausi, Maria
Ricci, Marco
Strippoli, Pierluigi
Vitale, Lorenza
Pelleri, Maria Chiara
author_facet Piovesan, Allison
Caracausi, Maria
Ricci, Marco
Strippoli, Pierluigi
Vitale, Lorenza
Pelleri, Maria Chiara
author_sort Piovesan, Allison
collection PubMed
description We have developed GeneBase, a full parser of the National Center for Biotechnology Information (NCBI) Gene database, which generates a fully structured local database with an intuitive user-friendly graphic interface for personal computers. Features of all the annotated eukaryotic genes are accessible through three main software tables, including for each entry details such as the gene summary, the gene exon/intron structure and the specific Gene Ontology attributions. The structuring of the data, the creation of additional calculation fields and the integration with nucleotide sequences allow users to make many types of comparisons and calculations that are useful for data retrieval and analysis. We provide an original example analysis of the existing introns across all the available species, through which the classic biological problem of the ‘minimal intron’ may find a solution using available data. Based on all currently available data, we can define the shortest known eukaryotic GT-AG intron length, setting the physical limit at the 30 base pair intron belonging to the human MST1L gene. This ‘model intron’ will shed light on the minimal requirement elements of recognition used for conventional splicing functioning. Remarkably, this size is indeed consistent with the sum of the splicing consensus sequence lengths.
format Online
Article
Text
id pubmed-4675715
institution National Center for Biotechnology Information
language English
publishDate 2015
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-46757152015-12-11 Identification of minimal eukaryotic introns through GeneBase, a user-friendly tool for parsing the NCBI Gene databank Piovesan, Allison Caracausi, Maria Ricci, Marco Strippoli, Pierluigi Vitale, Lorenza Pelleri, Maria Chiara DNA Res Full Papers We have developed GeneBase, a full parser of the National Center for Biotechnology Information (NCBI) Gene database, which generates a fully structured local database with an intuitive user-friendly graphic interface for personal computers. Features of all the annotated eukaryotic genes are accessible through three main software tables, including for each entry details such as the gene summary, the gene exon/intron structure and the specific Gene Ontology attributions. The structuring of the data, the creation of additional calculation fields and the integration with nucleotide sequences allow users to make many types of comparisons and calculations that are useful for data retrieval and analysis. We provide an original example analysis of the existing introns across all the available species, through which the classic biological problem of the ‘minimal intron’ may find a solution using available data. Based on all currently available data, we can define the shortest known eukaryotic GT-AG intron length, setting the physical limit at the 30 base pair intron belonging to the human MST1L gene. This ‘model intron’ will shed light on the minimal requirement elements of recognition used for conventional splicing functioning. Remarkably, this size is indeed consistent with the sum of the splicing consensus sequence lengths. Oxford University Press 2015-12 2015-11-17 /pmc/articles/PMC4675715/ /pubmed/26581719 http://dx.doi.org/10.1093/dnares/dsv028 Text en © The Author 2015. Published by Oxford University Press on behalf of Kazusa DNA Research Institute. http://creativecommons.org/licenses/by-nc/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/4.0/), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial re-use, please contact journals.permissions@oup.com
spellingShingle Full Papers
Piovesan, Allison
Caracausi, Maria
Ricci, Marco
Strippoli, Pierluigi
Vitale, Lorenza
Pelleri, Maria Chiara
Identification of minimal eukaryotic introns through GeneBase, a user-friendly tool for parsing the NCBI Gene databank
title Identification of minimal eukaryotic introns through GeneBase, a user-friendly tool for parsing the NCBI Gene databank
title_full Identification of minimal eukaryotic introns through GeneBase, a user-friendly tool for parsing the NCBI Gene databank
title_fullStr Identification of minimal eukaryotic introns through GeneBase, a user-friendly tool for parsing the NCBI Gene databank
title_full_unstemmed Identification of minimal eukaryotic introns through GeneBase, a user-friendly tool for parsing the NCBI Gene databank
title_short Identification of minimal eukaryotic introns through GeneBase, a user-friendly tool for parsing the NCBI Gene databank
title_sort identification of minimal eukaryotic introns through genebase, a user-friendly tool for parsing the ncbi gene databank
topic Full Papers
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4675715/
https://www.ncbi.nlm.nih.gov/pubmed/26581719
http://dx.doi.org/10.1093/dnares/dsv028
work_keys_str_mv AT piovesanallison identificationofminimaleukaryoticintronsthroughgenebaseauserfriendlytoolforparsingthencbigenedatabank
AT caracausimaria identificationofminimaleukaryoticintronsthroughgenebaseauserfriendlytoolforparsingthencbigenedatabank
AT riccimarco identificationofminimaleukaryoticintronsthroughgenebaseauserfriendlytoolforparsingthencbigenedatabank
AT strippolipierluigi identificationofminimaleukaryoticintronsthroughgenebaseauserfriendlytoolforparsingthencbigenedatabank
AT vitalelorenza identificationofminimaleukaryoticintronsthroughgenebaseauserfriendlytoolforparsingthencbigenedatabank
AT pellerimariachiara identificationofminimaleukaryoticintronsthroughgenebaseauserfriendlytoolforparsingthencbigenedatabank