Cargando…

iSS-PseDNC: Identifying Splicing Sites Using Pseudo Dinucleotide Composition

In eukaryotic genes, exons are generally interrupted by introns. Accurately removing introns and joining exons together are essential processes in eukaryotic gene expression. With the avalanche of genome sequences generated in the postgenomic age, it is highly desired to develop automated methods fo...

Descripción completa

Detalles Bibliográficos
Autores principales: Chen, Wei, Feng, Peng-Mian, Lin, Hao, Chou, Kuo-Chen
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Hindawi Publishing Corporation 2014
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4055483/
https://www.ncbi.nlm.nih.gov/pubmed/24967386
http://dx.doi.org/10.1155/2014/623149
_version_ 1782320670100684800
author Chen, Wei
Feng, Peng-Mian
Lin, Hao
Chou, Kuo-Chen
author_facet Chen, Wei
Feng, Peng-Mian
Lin, Hao
Chou, Kuo-Chen
author_sort Chen, Wei
collection PubMed
description In eukaryotic genes, exons are generally interrupted by introns. Accurately removing introns and joining exons together are essential processes in eukaryotic gene expression. With the avalanche of genome sequences generated in the postgenomic age, it is highly desired to develop automated methods for rapid and effective detection of splice sites that play important roles in gene structure annotation and even in RNA splicing. Although a series of computational methods were proposed for splice site identification, most of them neglected the intrinsic local structural properties. In the present study, a predictor called “iSS-PseDNC” was developed for identifying splice sites. In the new predictor, the sequences were formulated by a novel feature-vector called “pseudo dinucleotide composition” (PseDNC) into which six DNA local structural properties were incorporated. It was observed by the rigorous cross-validation tests on two benchmark datasets that the overall success rates achieved by iSS-PseDNC in identifying splice donor site and splice acceptor site were 85.45% and 87.73%, respectively. It is anticipated that iSS-PseDNC may become a useful tool for identifying splice sites and that the six DNA local structural properties described in this paper may provide novel insights for in-depth investigations into the mechanism of RNA splicing.
format Online
Article
Text
id pubmed-4055483
institution National Center for Biotechnology Information
language English
publishDate 2014
publisher Hindawi Publishing Corporation
record_format MEDLINE/PubMed
spelling pubmed-40554832014-06-25 iSS-PseDNC: Identifying Splicing Sites Using Pseudo Dinucleotide Composition Chen, Wei Feng, Peng-Mian Lin, Hao Chou, Kuo-Chen Biomed Res Int Research Article In eukaryotic genes, exons are generally interrupted by introns. Accurately removing introns and joining exons together are essential processes in eukaryotic gene expression. With the avalanche of genome sequences generated in the postgenomic age, it is highly desired to develop automated methods for rapid and effective detection of splice sites that play important roles in gene structure annotation and even in RNA splicing. Although a series of computational methods were proposed for splice site identification, most of them neglected the intrinsic local structural properties. In the present study, a predictor called “iSS-PseDNC” was developed for identifying splice sites. In the new predictor, the sequences were formulated by a novel feature-vector called “pseudo dinucleotide composition” (PseDNC) into which six DNA local structural properties were incorporated. It was observed by the rigorous cross-validation tests on two benchmark datasets that the overall success rates achieved by iSS-PseDNC in identifying splice donor site and splice acceptor site were 85.45% and 87.73%, respectively. It is anticipated that iSS-PseDNC may become a useful tool for identifying splice sites and that the six DNA local structural properties described in this paper may provide novel insights for in-depth investigations into the mechanism of RNA splicing. Hindawi Publishing Corporation 2014 2014-05-21 /pmc/articles/PMC4055483/ /pubmed/24967386 http://dx.doi.org/10.1155/2014/623149 Text en Copyright © 2014 Wei Chen et al. https://creativecommons.org/licenses/by/3.0/ This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research Article
Chen, Wei
Feng, Peng-Mian
Lin, Hao
Chou, Kuo-Chen
iSS-PseDNC: Identifying Splicing Sites Using Pseudo Dinucleotide Composition
title iSS-PseDNC: Identifying Splicing Sites Using Pseudo Dinucleotide Composition
title_full iSS-PseDNC: Identifying Splicing Sites Using Pseudo Dinucleotide Composition
title_fullStr iSS-PseDNC: Identifying Splicing Sites Using Pseudo Dinucleotide Composition
title_full_unstemmed iSS-PseDNC: Identifying Splicing Sites Using Pseudo Dinucleotide Composition
title_short iSS-PseDNC: Identifying Splicing Sites Using Pseudo Dinucleotide Composition
title_sort iss-psednc: identifying splicing sites using pseudo dinucleotide composition
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4055483/
https://www.ncbi.nlm.nih.gov/pubmed/24967386
http://dx.doi.org/10.1155/2014/623149
work_keys_str_mv AT chenwei isspsedncidentifyingsplicingsitesusingpseudodinucleotidecomposition
AT fengpengmian isspsedncidentifyingsplicingsitesusingpseudodinucleotidecomposition
AT linhao isspsedncidentifyingsplicingsitesusingpseudodinucleotidecomposition
AT choukuochen isspsedncidentifyingsplicingsitesusingpseudodinucleotidecomposition