Cargando…
An ANN-GA model based promoter prediction in Arabidopsis thaliana using tilling microarray data
Identification of promoter region is an important part of gene annotation. Identification of promoters in eukaryotes is important as promoters modulate various metabolic functions and cellular stress responses. In this work, a novel approach utilizing intensity values of tilling microarray data for...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Biomedical Informatics
2011
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3159145/ https://www.ncbi.nlm.nih.gov/pubmed/21887014 |
_version_ | 1782210420675706880 |
---|---|
author | Mishra, Hrishikesh Singh, Nitya Misra, Krishna Lahiri, Tapobrata |
author_facet | Mishra, Hrishikesh Singh, Nitya Misra, Krishna Lahiri, Tapobrata |
author_sort | Mishra, Hrishikesh |
collection | PubMed |
description | Identification of promoter region is an important part of gene annotation. Identification of promoters in eukaryotes is important as promoters modulate various metabolic functions and cellular stress responses. In this work, a novel approach utilizing intensity values of tilling microarray data for a model eukaryotic plant Arabidopsis thaliana, was used to specify promoter region from non-promoter region. A feed-forward back propagation neural network model supported by genetic algorithm was employed to predict the class of data with a window size of 41. A dataset comprising of 2992 data vectors representing both promoter and non-promoter regions, chosen randomly from probe intensity vectors for whole genome of Arabidopsis thaliana generated through tilling microarray technique was used. The classifier model shows prediction accuracy of 69.73% and 65.36% on training and validation sets, respectively. Further, a concept of distance based class membership was used to validate reliability of classifier, which showed promising results. The study shows the usability of micro-array probe intensities to predict the promoter regions in eukaryotic genomes. |
format | Online Article Text |
id | pubmed-3159145 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2011 |
publisher | Biomedical Informatics |
record_format | MEDLINE/PubMed |
spelling | pubmed-31591452011-09-01 An ANN-GA model based promoter prediction in Arabidopsis thaliana using tilling microarray data Mishra, Hrishikesh Singh, Nitya Misra, Krishna Lahiri, Tapobrata Bioinformation Prediction Model Identification of promoter region is an important part of gene annotation. Identification of promoters in eukaryotes is important as promoters modulate various metabolic functions and cellular stress responses. In this work, a novel approach utilizing intensity values of tilling microarray data for a model eukaryotic plant Arabidopsis thaliana, was used to specify promoter region from non-promoter region. A feed-forward back propagation neural network model supported by genetic algorithm was employed to predict the class of data with a window size of 41. A dataset comprising of 2992 data vectors representing both promoter and non-promoter regions, chosen randomly from probe intensity vectors for whole genome of Arabidopsis thaliana generated through tilling microarray technique was used. The classifier model shows prediction accuracy of 69.73% and 65.36% on training and validation sets, respectively. Further, a concept of distance based class membership was used to validate reliability of classifier, which showed promising results. The study shows the usability of micro-array probe intensities to predict the promoter regions in eukaryotic genomes. Biomedical Informatics 2011-06-06 /pmc/articles/PMC3159145/ /pubmed/21887014 Text en © 2011 Biomedical Informatics This is an open-access article, which permits unrestricted use, distribution, and reproduction in any medium, for non-commercial purposes, provided the original author and source are credited. |
spellingShingle | Prediction Model Mishra, Hrishikesh Singh, Nitya Misra, Krishna Lahiri, Tapobrata An ANN-GA model based promoter prediction in Arabidopsis thaliana using tilling microarray data |
title | An ANN-GA model based promoter prediction in Arabidopsis thaliana using tilling microarray data |
title_full | An ANN-GA model based promoter prediction in Arabidopsis thaliana using tilling microarray data |
title_fullStr | An ANN-GA model based promoter prediction in Arabidopsis thaliana using tilling microarray data |
title_full_unstemmed | An ANN-GA model based promoter prediction in Arabidopsis thaliana using tilling microarray data |
title_short | An ANN-GA model based promoter prediction in Arabidopsis thaliana using tilling microarray data |
title_sort | ann-ga model based promoter prediction in arabidopsis thaliana using tilling microarray data |
topic | Prediction Model |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3159145/ https://www.ncbi.nlm.nih.gov/pubmed/21887014 |
work_keys_str_mv | AT mishrahrishikesh ananngamodelbasedpromoterpredictioninarabidopsisthalianausingtillingmicroarraydata AT singhnitya ananngamodelbasedpromoterpredictioninarabidopsisthalianausingtillingmicroarraydata AT misrakrishna ananngamodelbasedpromoterpredictioninarabidopsisthalianausingtillingmicroarraydata AT lahiritapobrata ananngamodelbasedpromoterpredictioninarabidopsisthalianausingtillingmicroarraydata AT mishrahrishikesh anngamodelbasedpromoterpredictioninarabidopsisthalianausingtillingmicroarraydata AT singhnitya anngamodelbasedpromoterpredictioninarabidopsisthalianausingtillingmicroarraydata AT misrakrishna anngamodelbasedpromoterpredictioninarabidopsisthalianausingtillingmicroarraydata AT lahiritapobrata anngamodelbasedpromoterpredictioninarabidopsisthalianausingtillingmicroarraydata |