Cargando…

A Modified Statistically Optimal Null Filter Method for Recognizing Protein-coding Regions

Computer-aided protein-coding gene prediction in uncharacterized genomic DNA sequences is one of the most important issues of biological signal processing. A modified filter method based on a statistically optimal null filter (SONF) theory is proposed for recognizing protein-coding regions. The squa...

Descripción completa

Detalles Bibliográficos
Autores principales: Zhang, Lei, Tian, Fengchun, Wang, Shiyuan
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Elsevier 2012
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5054498/
https://www.ncbi.nlm.nih.gov/pubmed/22917190
http://dx.doi.org/10.1016/j.gpb.2012.02.001
_version_ 1782458612617052160
author Zhang, Lei
Tian, Fengchun
Wang, Shiyuan
author_facet Zhang, Lei
Tian, Fengchun
Wang, Shiyuan
author_sort Zhang, Lei
collection PubMed
description Computer-aided protein-coding gene prediction in uncharacterized genomic DNA sequences is one of the most important issues of biological signal processing. A modified filter method based on a statistically optimal null filter (SONF) theory is proposed for recognizing protein-coding regions. The square deviation gain (SDG) between the input and output of the model is used to identify the coding regions. The effective SDG amplification model with Class I and Class II enhancement is designed to suppress the non-coding regions. Also, an evaluation algorithm has been used to compare the modified model with most gene prediction methods currently available in terms of sensitivity, specificity and precision. The performance for identification of protein-coding regions has been evaluated at the nucleotide level using benchmark datasets and 91.4%, 96%, 93.7% were obtained for sensitivity, specificity and precision, respectively. These results suggest that the proposed model is potentially useful in gene finding field, which can help recognize protein-coding regions with higher precision and speed than present algorithms.
format Online
Article
Text
id pubmed-5054498
institution National Center for Biotechnology Information
language English
publishDate 2012
publisher Elsevier
record_format MEDLINE/PubMed
spelling pubmed-50544982016-10-14 A Modified Statistically Optimal Null Filter Method for Recognizing Protein-coding Regions Zhang, Lei Tian, Fengchun Wang, Shiyuan Genomics Proteomics Bioinformatics Original Research Computer-aided protein-coding gene prediction in uncharacterized genomic DNA sequences is one of the most important issues of biological signal processing. A modified filter method based on a statistically optimal null filter (SONF) theory is proposed for recognizing protein-coding regions. The square deviation gain (SDG) between the input and output of the model is used to identify the coding regions. The effective SDG amplification model with Class I and Class II enhancement is designed to suppress the non-coding regions. Also, an evaluation algorithm has been used to compare the modified model with most gene prediction methods currently available in terms of sensitivity, specificity and precision. The performance for identification of protein-coding regions has been evaluated at the nucleotide level using benchmark datasets and 91.4%, 96%, 93.7% were obtained for sensitivity, specificity and precision, respectively. These results suggest that the proposed model is potentially useful in gene finding field, which can help recognize protein-coding regions with higher precision and speed than present algorithms. Elsevier 2012-06 2012-06-18 /pmc/articles/PMC5054498/ /pubmed/22917190 http://dx.doi.org/10.1016/j.gpb.2012.02.001 Text en © 2012 Beijing Institute of Genomics, Chinese Academy of Sciences and Genetics Society of China. Published by Elsevier Ltd and Science Press. All rights reserved. http://creativecommons.org/licenses/by-nc-sa/3.0/ This is an open access article under the CC BY-NC-SA license (http://creativecommons.org/licenses/by-nc-sa/3.0/).
spellingShingle Original Research
Zhang, Lei
Tian, Fengchun
Wang, Shiyuan
A Modified Statistically Optimal Null Filter Method for Recognizing Protein-coding Regions
title A Modified Statistically Optimal Null Filter Method for Recognizing Protein-coding Regions
title_full A Modified Statistically Optimal Null Filter Method for Recognizing Protein-coding Regions
title_fullStr A Modified Statistically Optimal Null Filter Method for Recognizing Protein-coding Regions
title_full_unstemmed A Modified Statistically Optimal Null Filter Method for Recognizing Protein-coding Regions
title_short A Modified Statistically Optimal Null Filter Method for Recognizing Protein-coding Regions
title_sort modified statistically optimal null filter method for recognizing protein-coding regions
topic Original Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5054498/
https://www.ncbi.nlm.nih.gov/pubmed/22917190
http://dx.doi.org/10.1016/j.gpb.2012.02.001
work_keys_str_mv AT zhanglei amodifiedstatisticallyoptimalnullfiltermethodforrecognizingproteincodingregions
AT tianfengchun amodifiedstatisticallyoptimalnullfiltermethodforrecognizingproteincodingregions
AT wangshiyuan amodifiedstatisticallyoptimalnullfiltermethodforrecognizingproteincodingregions
AT zhanglei modifiedstatisticallyoptimalnullfiltermethodforrecognizingproteincodingregions
AT tianfengchun modifiedstatisticallyoptimalnullfiltermethodforrecognizingproteincodingregions
AT wangshiyuan modifiedstatisticallyoptimalnullfiltermethodforrecognizingproteincodingregions