Cargando…

IN-MACA-MCC: Integrated Multiple Attractor Cellular Automata with Modified Clonal Classifier for Human Protein Coding and Promoter Prediction

Protein coding and promoter region predictions are very important challenges of bioinformatics (Attwood and Teresa, 2000). The identification of these regions plays a crucial role in understanding the genes. Many novel computational and mathematical methods are introduced as well as existing methods...

Descripción completa

Detalles Bibliográficos
Autores principales: Pokkuluri, Kiran Sree, Inampudi, Ramesh Babu, Nedunuri, S. S. S. N. Usha Devi
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Hindawi Publishing Corporation 2014
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4123571/
https://www.ncbi.nlm.nih.gov/pubmed/25132849
http://dx.doi.org/10.1155/2014/261362
_version_ 1782329510972096512
author Pokkuluri, Kiran Sree
Inampudi, Ramesh Babu
Nedunuri, S. S. S. N. Usha Devi
author_facet Pokkuluri, Kiran Sree
Inampudi, Ramesh Babu
Nedunuri, S. S. S. N. Usha Devi
author_sort Pokkuluri, Kiran Sree
collection PubMed
description Protein coding and promoter region predictions are very important challenges of bioinformatics (Attwood and Teresa, 2000). The identification of these regions plays a crucial role in understanding the genes. Many novel computational and mathematical methods are introduced as well as existing methods that are getting refined for predicting both of the regions separately; still there is a scope for improvement. We propose a classifier that is built with MACA (multiple attractor cellular automata) and MCC (modified clonal classifier) to predict both regions with a single classifier. The proposed classifier is trained and tested with Fickett and Tung (1992) datasets for protein coding region prediction for DNA sequences of lengths 54, 108, and 162. This classifier is trained and tested with MMCRI datasets for protein coding region prediction for DNA sequences of lengths 252 and 354. The proposed classifier is trained and tested with promoter sequences from DBTSS (Yamashita et al., 2006) dataset and nonpromoters from EID (Saxonov et al., 2000) and UTRdb (Pesole et al., 2002) datasets. The proposed model can predict both regions with an average accuracy of 90.5% for promoter and 89.6% for protein coding region predictions. The specificity and sensitivity values of promoter and protein coding region predictions are 0.89 and 0.92, respectively.
format Online
Article
Text
id pubmed-4123571
institution National Center for Biotechnology Information
language English
publishDate 2014
publisher Hindawi Publishing Corporation
record_format MEDLINE/PubMed
spelling pubmed-41235712014-08-17 IN-MACA-MCC: Integrated Multiple Attractor Cellular Automata with Modified Clonal Classifier for Human Protein Coding and Promoter Prediction Pokkuluri, Kiran Sree Inampudi, Ramesh Babu Nedunuri, S. S. S. N. Usha Devi Adv Bioinformatics Research Article Protein coding and promoter region predictions are very important challenges of bioinformatics (Attwood and Teresa, 2000). The identification of these regions plays a crucial role in understanding the genes. Many novel computational and mathematical methods are introduced as well as existing methods that are getting refined for predicting both of the regions separately; still there is a scope for improvement. We propose a classifier that is built with MACA (multiple attractor cellular automata) and MCC (modified clonal classifier) to predict both regions with a single classifier. The proposed classifier is trained and tested with Fickett and Tung (1992) datasets for protein coding region prediction for DNA sequences of lengths 54, 108, and 162. This classifier is trained and tested with MMCRI datasets for protein coding region prediction for DNA sequences of lengths 252 and 354. The proposed classifier is trained and tested with promoter sequences from DBTSS (Yamashita et al., 2006) dataset and nonpromoters from EID (Saxonov et al., 2000) and UTRdb (Pesole et al., 2002) datasets. The proposed model can predict both regions with an average accuracy of 90.5% for promoter and 89.6% for protein coding region predictions. The specificity and sensitivity values of promoter and protein coding region predictions are 0.89 and 0.92, respectively. Hindawi Publishing Corporation 2014 2014-07-15 /pmc/articles/PMC4123571/ /pubmed/25132849 http://dx.doi.org/10.1155/2014/261362 Text en Copyright © 2014 Kiran Sree Pokkuluri et al. https://creativecommons.org/licenses/by/3.0/ This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research Article
Pokkuluri, Kiran Sree
Inampudi, Ramesh Babu
Nedunuri, S. S. S. N. Usha Devi
IN-MACA-MCC: Integrated Multiple Attractor Cellular Automata with Modified Clonal Classifier for Human Protein Coding and Promoter Prediction
title IN-MACA-MCC: Integrated Multiple Attractor Cellular Automata with Modified Clonal Classifier for Human Protein Coding and Promoter Prediction
title_full IN-MACA-MCC: Integrated Multiple Attractor Cellular Automata with Modified Clonal Classifier for Human Protein Coding and Promoter Prediction
title_fullStr IN-MACA-MCC: Integrated Multiple Attractor Cellular Automata with Modified Clonal Classifier for Human Protein Coding and Promoter Prediction
title_full_unstemmed IN-MACA-MCC: Integrated Multiple Attractor Cellular Automata with Modified Clonal Classifier for Human Protein Coding and Promoter Prediction
title_short IN-MACA-MCC: Integrated Multiple Attractor Cellular Automata with Modified Clonal Classifier for Human Protein Coding and Promoter Prediction
title_sort in-maca-mcc: integrated multiple attractor cellular automata with modified clonal classifier for human protein coding and promoter prediction
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4123571/
https://www.ncbi.nlm.nih.gov/pubmed/25132849
http://dx.doi.org/10.1155/2014/261362
work_keys_str_mv AT pokkulurikiransree inmacamccintegratedmultipleattractorcellularautomatawithmodifiedclonalclassifierforhumanproteincodingandpromoterprediction
AT inampudirameshbabu inmacamccintegratedmultipleattractorcellularautomatawithmodifiedclonalclassifierforhumanproteincodingandpromoterprediction
AT nedunurisssnushadevi inmacamccintegratedmultipleattractorcellularautomatawithmodifiedclonalclassifierforhumanproteincodingandpromoterprediction