Cargando…
WISCOD: A Statistical Web-Enabled Tool for the Identification of Significant Protein Coding Regions
Classically, gene prediction programs are based on detecting signals such as boundary sites (splice sites, starts, and stops) and coding regions in the DNA sequence in order to build potential exons and join them into a gene structure. Although nowadays it is possible to improve their performance wi...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Hindawi Publishing Corporation
2014
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4181902/ https://www.ncbi.nlm.nih.gov/pubmed/25313355 http://dx.doi.org/10.1155/2014/282343 |
_version_ | 1782337441642840064 |
---|---|
author | Vilardell, Mireia Parra, Genis Civit, Sergi |
author_facet | Vilardell, Mireia Parra, Genis Civit, Sergi |
author_sort | Vilardell, Mireia |
collection | PubMed |
description | Classically, gene prediction programs are based on detecting signals such as boundary sites (splice sites, starts, and stops) and coding regions in the DNA sequence in order to build potential exons and join them into a gene structure. Although nowadays it is possible to improve their performance with additional information from related species or/and cDNA databases, further improvement at any step could help to obtain better predictions. Here, we present WISCOD, a web-enabled tool for the identification of significant protein coding regions, a novel software tool that tackles the exon prediction problem in eukaryotic genomes. WISCOD has the capacity to detect real exons from large lists of potential exons, and it provides an easy way to use global P value called expected probability of being a false exon (EPFE) that is useful for ranking potential exons in a probabilistic framework, without additional computational costs. The advantage of our approach is that it significantly increases the specificity and sensitivity (both between 80% and 90%) in comparison to other ab initio methods (where they are in the range of 70–75%). WISCOD is written in JAVA and R and is available to download and to run in a local mode on Linux and Windows platforms. |
format | Online Article Text |
id | pubmed-4181902 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2014 |
publisher | Hindawi Publishing Corporation |
record_format | MEDLINE/PubMed |
spelling | pubmed-41819022014-10-13 WISCOD: A Statistical Web-Enabled Tool for the Identification of Significant Protein Coding Regions Vilardell, Mireia Parra, Genis Civit, Sergi Biomed Res Int Research Article Classically, gene prediction programs are based on detecting signals such as boundary sites (splice sites, starts, and stops) and coding regions in the DNA sequence in order to build potential exons and join them into a gene structure. Although nowadays it is possible to improve their performance with additional information from related species or/and cDNA databases, further improvement at any step could help to obtain better predictions. Here, we present WISCOD, a web-enabled tool for the identification of significant protein coding regions, a novel software tool that tackles the exon prediction problem in eukaryotic genomes. WISCOD has the capacity to detect real exons from large lists of potential exons, and it provides an easy way to use global P value called expected probability of being a false exon (EPFE) that is useful for ranking potential exons in a probabilistic framework, without additional computational costs. The advantage of our approach is that it significantly increases the specificity and sensitivity (both between 80% and 90%) in comparison to other ab initio methods (where they are in the range of 70–75%). WISCOD is written in JAVA and R and is available to download and to run in a local mode on Linux and Windows platforms. Hindawi Publishing Corporation 2014 2014-09-15 /pmc/articles/PMC4181902/ /pubmed/25313355 http://dx.doi.org/10.1155/2014/282343 Text en Copyright © 2014 Mireia Vilardell et al. https://creativecommons.org/licenses/by/3.0/ This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Research Article Vilardell, Mireia Parra, Genis Civit, Sergi WISCOD: A Statistical Web-Enabled Tool for the Identification of Significant Protein Coding Regions |
title | WISCOD: A Statistical Web-Enabled Tool for the Identification of Significant Protein Coding Regions |
title_full | WISCOD: A Statistical Web-Enabled Tool for the Identification of Significant Protein Coding Regions |
title_fullStr | WISCOD: A Statistical Web-Enabled Tool for the Identification of Significant Protein Coding Regions |
title_full_unstemmed | WISCOD: A Statistical Web-Enabled Tool for the Identification of Significant Protein Coding Regions |
title_short | WISCOD: A Statistical Web-Enabled Tool for the Identification of Significant Protein Coding Regions |
title_sort | wiscod: a statistical web-enabled tool for the identification of significant protein coding regions |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4181902/ https://www.ncbi.nlm.nih.gov/pubmed/25313355 http://dx.doi.org/10.1155/2014/282343 |
work_keys_str_mv | AT vilardellmireia wiscodastatisticalwebenabledtoolfortheidentificationofsignificantproteincodingregions AT parragenis wiscodastatisticalwebenabledtoolfortheidentificationofsignificantproteincodingregions AT civitsergi wiscodastatisticalwebenabledtoolfortheidentificationofsignificantproteincodingregions |