Cargando…

An ANN-GA model based promoter prediction in Arabidopsis thaliana using tilling microarray data

Identification of promoter region is an important part of gene annotation. Identification of promoters in eukaryotes is important as promoters modulate various metabolic functions and cellular stress responses. In this work, a novel approach utilizing intensity values of tilling microarray data for...

Descripción completa

Detalles Bibliográficos
Autores principales: Mishra, Hrishikesh, Singh, Nitya, Misra, Krishna, Lahiri, Tapobrata
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Biomedical Informatics 2011
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3159145/
https://www.ncbi.nlm.nih.gov/pubmed/21887014
_version_ 1782210420675706880
author Mishra, Hrishikesh
Singh, Nitya
Misra, Krishna
Lahiri, Tapobrata
author_facet Mishra, Hrishikesh
Singh, Nitya
Misra, Krishna
Lahiri, Tapobrata
author_sort Mishra, Hrishikesh
collection PubMed
description Identification of promoter region is an important part of gene annotation. Identification of promoters in eukaryotes is important as promoters modulate various metabolic functions and cellular stress responses. In this work, a novel approach utilizing intensity values of tilling microarray data for a model eukaryotic plant Arabidopsis thaliana, was used to specify promoter region from non-promoter region. A feed-forward back propagation neural network model supported by genetic algorithm was employed to predict the class of data with a window size of 41. A dataset comprising of 2992 data vectors representing both promoter and non-promoter regions, chosen randomly from probe intensity vectors for whole genome of Arabidopsis thaliana generated through tilling microarray technique was used. The classifier model shows prediction accuracy of 69.73% and 65.36% on training and validation sets, respectively. Further, a concept of distance based class membership was used to validate reliability of classifier, which showed promising results. The study shows the usability of micro-array probe intensities to predict the promoter regions in eukaryotic genomes.
format Online
Article
Text
id pubmed-3159145
institution National Center for Biotechnology Information
language English
publishDate 2011
publisher Biomedical Informatics
record_format MEDLINE/PubMed
spelling pubmed-31591452011-09-01 An ANN-GA model based promoter prediction in Arabidopsis thaliana using tilling microarray data Mishra, Hrishikesh Singh, Nitya Misra, Krishna Lahiri, Tapobrata Bioinformation Prediction Model Identification of promoter region is an important part of gene annotation. Identification of promoters in eukaryotes is important as promoters modulate various metabolic functions and cellular stress responses. In this work, a novel approach utilizing intensity values of tilling microarray data for a model eukaryotic plant Arabidopsis thaliana, was used to specify promoter region from non-promoter region. A feed-forward back propagation neural network model supported by genetic algorithm was employed to predict the class of data with a window size of 41. A dataset comprising of 2992 data vectors representing both promoter and non-promoter regions, chosen randomly from probe intensity vectors for whole genome of Arabidopsis thaliana generated through tilling microarray technique was used. The classifier model shows prediction accuracy of 69.73% and 65.36% on training and validation sets, respectively. Further, a concept of distance based class membership was used to validate reliability of classifier, which showed promising results. The study shows the usability of micro-array probe intensities to predict the promoter regions in eukaryotic genomes. Biomedical Informatics 2011-06-06 /pmc/articles/PMC3159145/ /pubmed/21887014 Text en © 2011 Biomedical Informatics This is an open-access article, which permits unrestricted use, distribution, and reproduction in any medium, for non-commercial purposes, provided the original author and source are credited.
spellingShingle Prediction Model
Mishra, Hrishikesh
Singh, Nitya
Misra, Krishna
Lahiri, Tapobrata
An ANN-GA model based promoter prediction in Arabidopsis thaliana using tilling microarray data
title An ANN-GA model based promoter prediction in Arabidopsis thaliana using tilling microarray data
title_full An ANN-GA model based promoter prediction in Arabidopsis thaliana using tilling microarray data
title_fullStr An ANN-GA model based promoter prediction in Arabidopsis thaliana using tilling microarray data
title_full_unstemmed An ANN-GA model based promoter prediction in Arabidopsis thaliana using tilling microarray data
title_short An ANN-GA model based promoter prediction in Arabidopsis thaliana using tilling microarray data
title_sort ann-ga model based promoter prediction in arabidopsis thaliana using tilling microarray data
topic Prediction Model
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3159145/
https://www.ncbi.nlm.nih.gov/pubmed/21887014
work_keys_str_mv AT mishrahrishikesh ananngamodelbasedpromoterpredictioninarabidopsisthalianausingtillingmicroarraydata
AT singhnitya ananngamodelbasedpromoterpredictioninarabidopsisthalianausingtillingmicroarraydata
AT misrakrishna ananngamodelbasedpromoterpredictioninarabidopsisthalianausingtillingmicroarraydata
AT lahiritapobrata ananngamodelbasedpromoterpredictioninarabidopsisthalianausingtillingmicroarraydata
AT mishrahrishikesh anngamodelbasedpromoterpredictioninarabidopsisthalianausingtillingmicroarraydata
AT singhnitya anngamodelbasedpromoterpredictioninarabidopsisthalianausingtillingmicroarraydata
AT misrakrishna anngamodelbasedpromoterpredictioninarabidopsisthalianausingtillingmicroarraydata
AT lahiritapobrata anngamodelbasedpromoterpredictioninarabidopsisthalianausingtillingmicroarraydata