Cargando…
IN-MACA-MCC: Integrated Multiple Attractor Cellular Automata with Modified Clonal Classifier for Human Protein Coding and Promoter Prediction
Protein coding and promoter region predictions are very important challenges of bioinformatics (Attwood and Teresa, 2000). The identification of these regions plays a crucial role in understanding the genes. Many novel computational and mathematical methods are introduced as well as existing methods...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Hindawi Publishing Corporation
2014
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4123571/ https://www.ncbi.nlm.nih.gov/pubmed/25132849 http://dx.doi.org/10.1155/2014/261362 |
_version_ | 1782329510972096512 |
---|---|
author | Pokkuluri, Kiran Sree Inampudi, Ramesh Babu Nedunuri, S. S. S. N. Usha Devi |
author_facet | Pokkuluri, Kiran Sree Inampudi, Ramesh Babu Nedunuri, S. S. S. N. Usha Devi |
author_sort | Pokkuluri, Kiran Sree |
collection | PubMed |
description | Protein coding and promoter region predictions are very important challenges of bioinformatics (Attwood and Teresa, 2000). The identification of these regions plays a crucial role in understanding the genes. Many novel computational and mathematical methods are introduced as well as existing methods that are getting refined for predicting both of the regions separately; still there is a scope for improvement. We propose a classifier that is built with MACA (multiple attractor cellular automata) and MCC (modified clonal classifier) to predict both regions with a single classifier. The proposed classifier is trained and tested with Fickett and Tung (1992) datasets for protein coding region prediction for DNA sequences of lengths 54, 108, and 162. This classifier is trained and tested with MMCRI datasets for protein coding region prediction for DNA sequences of lengths 252 and 354. The proposed classifier is trained and tested with promoter sequences from DBTSS (Yamashita et al., 2006) dataset and nonpromoters from EID (Saxonov et al., 2000) and UTRdb (Pesole et al., 2002) datasets. The proposed model can predict both regions with an average accuracy of 90.5% for promoter and 89.6% for protein coding region predictions. The specificity and sensitivity values of promoter and protein coding region predictions are 0.89 and 0.92, respectively. |
format | Online Article Text |
id | pubmed-4123571 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2014 |
publisher | Hindawi Publishing Corporation |
record_format | MEDLINE/PubMed |
spelling | pubmed-41235712014-08-17 IN-MACA-MCC: Integrated Multiple Attractor Cellular Automata with Modified Clonal Classifier for Human Protein Coding and Promoter Prediction Pokkuluri, Kiran Sree Inampudi, Ramesh Babu Nedunuri, S. S. S. N. Usha Devi Adv Bioinformatics Research Article Protein coding and promoter region predictions are very important challenges of bioinformatics (Attwood and Teresa, 2000). The identification of these regions plays a crucial role in understanding the genes. Many novel computational and mathematical methods are introduced as well as existing methods that are getting refined for predicting both of the regions separately; still there is a scope for improvement. We propose a classifier that is built with MACA (multiple attractor cellular automata) and MCC (modified clonal classifier) to predict both regions with a single classifier. The proposed classifier is trained and tested with Fickett and Tung (1992) datasets for protein coding region prediction for DNA sequences of lengths 54, 108, and 162. This classifier is trained and tested with MMCRI datasets for protein coding region prediction for DNA sequences of lengths 252 and 354. The proposed classifier is trained and tested with promoter sequences from DBTSS (Yamashita et al., 2006) dataset and nonpromoters from EID (Saxonov et al., 2000) and UTRdb (Pesole et al., 2002) datasets. The proposed model can predict both regions with an average accuracy of 90.5% for promoter and 89.6% for protein coding region predictions. The specificity and sensitivity values of promoter and protein coding region predictions are 0.89 and 0.92, respectively. Hindawi Publishing Corporation 2014 2014-07-15 /pmc/articles/PMC4123571/ /pubmed/25132849 http://dx.doi.org/10.1155/2014/261362 Text en Copyright © 2014 Kiran Sree Pokkuluri et al. https://creativecommons.org/licenses/by/3.0/ This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Research Article Pokkuluri, Kiran Sree Inampudi, Ramesh Babu Nedunuri, S. S. S. N. Usha Devi IN-MACA-MCC: Integrated Multiple Attractor Cellular Automata with Modified Clonal Classifier for Human Protein Coding and Promoter Prediction |
title | IN-MACA-MCC: Integrated Multiple Attractor Cellular Automata with Modified Clonal Classifier for Human Protein Coding and Promoter Prediction |
title_full | IN-MACA-MCC: Integrated Multiple Attractor Cellular Automata with Modified Clonal Classifier for Human Protein Coding and Promoter Prediction |
title_fullStr | IN-MACA-MCC: Integrated Multiple Attractor Cellular Automata with Modified Clonal Classifier for Human Protein Coding and Promoter Prediction |
title_full_unstemmed | IN-MACA-MCC: Integrated Multiple Attractor Cellular Automata with Modified Clonal Classifier for Human Protein Coding and Promoter Prediction |
title_short | IN-MACA-MCC: Integrated Multiple Attractor Cellular Automata with Modified Clonal Classifier for Human Protein Coding and Promoter Prediction |
title_sort | in-maca-mcc: integrated multiple attractor cellular automata with modified clonal classifier for human protein coding and promoter prediction |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4123571/ https://www.ncbi.nlm.nih.gov/pubmed/25132849 http://dx.doi.org/10.1155/2014/261362 |
work_keys_str_mv | AT pokkulurikiransree inmacamccintegratedmultipleattractorcellularautomatawithmodifiedclonalclassifierforhumanproteincodingandpromoterprediction AT inampudirameshbabu inmacamccintegratedmultipleattractorcellularautomatawithmodifiedclonalclassifierforhumanproteincodingandpromoterprediction AT nedunurisssnushadevi inmacamccintegratedmultipleattractorcellularautomatawithmodifiedclonalclassifierforhumanproteincodingandpromoterprediction |