Cargando…

WISCOD: A Statistical Web-Enabled Tool for the Identification of Significant Protein Coding Regions

Classically, gene prediction programs are based on detecting signals such as boundary sites (splice sites, starts, and stops) and coding regions in the DNA sequence in order to build potential exons and join them into a gene structure. Although nowadays it is possible to improve their performance wi...

Descripción completa

Detalles Bibliográficos
Autores principales: Vilardell, Mireia, Parra, Genis, Civit, Sergi
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Hindawi Publishing Corporation 2014
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4181902/
https://www.ncbi.nlm.nih.gov/pubmed/25313355
http://dx.doi.org/10.1155/2014/282343
_version_ 1782337441642840064
author Vilardell, Mireia
Parra, Genis
Civit, Sergi
author_facet Vilardell, Mireia
Parra, Genis
Civit, Sergi
author_sort Vilardell, Mireia
collection PubMed
description Classically, gene prediction programs are based on detecting signals such as boundary sites (splice sites, starts, and stops) and coding regions in the DNA sequence in order to build potential exons and join them into a gene structure. Although nowadays it is possible to improve their performance with additional information from related species or/and cDNA databases, further improvement at any step could help to obtain better predictions. Here, we present WISCOD, a web-enabled tool for the identification of significant protein coding regions, a novel software tool that tackles the exon prediction problem in eukaryotic genomes. WISCOD has the capacity to detect real exons from large lists of potential exons, and it provides an easy way to use global P value called expected probability of being a false exon (EPFE) that is useful for ranking potential exons in a probabilistic framework, without additional computational costs. The advantage of our approach is that it significantly increases the specificity and sensitivity (both between 80% and 90%) in comparison to other ab initio methods (where they are in the range of 70–75%). WISCOD is written in JAVA and R and is available to download and to run in a local mode on Linux and Windows platforms.
format Online
Article
Text
id pubmed-4181902
institution National Center for Biotechnology Information
language English
publishDate 2014
publisher Hindawi Publishing Corporation
record_format MEDLINE/PubMed
spelling pubmed-41819022014-10-13 WISCOD: A Statistical Web-Enabled Tool for the Identification of Significant Protein Coding Regions Vilardell, Mireia Parra, Genis Civit, Sergi Biomed Res Int Research Article Classically, gene prediction programs are based on detecting signals such as boundary sites (splice sites, starts, and stops) and coding regions in the DNA sequence in order to build potential exons and join them into a gene structure. Although nowadays it is possible to improve their performance with additional information from related species or/and cDNA databases, further improvement at any step could help to obtain better predictions. Here, we present WISCOD, a web-enabled tool for the identification of significant protein coding regions, a novel software tool that tackles the exon prediction problem in eukaryotic genomes. WISCOD has the capacity to detect real exons from large lists of potential exons, and it provides an easy way to use global P value called expected probability of being a false exon (EPFE) that is useful for ranking potential exons in a probabilistic framework, without additional computational costs. The advantage of our approach is that it significantly increases the specificity and sensitivity (both between 80% and 90%) in comparison to other ab initio methods (where they are in the range of 70–75%). WISCOD is written in JAVA and R and is available to download and to run in a local mode on Linux and Windows platforms. Hindawi Publishing Corporation 2014 2014-09-15 /pmc/articles/PMC4181902/ /pubmed/25313355 http://dx.doi.org/10.1155/2014/282343 Text en Copyright © 2014 Mireia Vilardell et al. https://creativecommons.org/licenses/by/3.0/ This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research Article
Vilardell, Mireia
Parra, Genis
Civit, Sergi
WISCOD: A Statistical Web-Enabled Tool for the Identification of Significant Protein Coding Regions
title WISCOD: A Statistical Web-Enabled Tool for the Identification of Significant Protein Coding Regions
title_full WISCOD: A Statistical Web-Enabled Tool for the Identification of Significant Protein Coding Regions
title_fullStr WISCOD: A Statistical Web-Enabled Tool for the Identification of Significant Protein Coding Regions
title_full_unstemmed WISCOD: A Statistical Web-Enabled Tool for the Identification of Significant Protein Coding Regions
title_short WISCOD: A Statistical Web-Enabled Tool for the Identification of Significant Protein Coding Regions
title_sort wiscod: a statistical web-enabled tool for the identification of significant protein coding regions
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4181902/
https://www.ncbi.nlm.nih.gov/pubmed/25313355
http://dx.doi.org/10.1155/2014/282343
work_keys_str_mv AT vilardellmireia wiscodastatisticalwebenabledtoolfortheidentificationofsignificantproteincodingregions
AT parragenis wiscodastatisticalwebenabledtoolfortheidentificationofsignificantproteincodingregions
AT civitsergi wiscodastatisticalwebenabledtoolfortheidentificationofsignificantproteincodingregions