Cargando…

A novel sequence and context based method for promoter recognition

Identification of promoters in DNA sequence using computational techniques is a significant research area because of its direct association in transcription regulation. A wide range of algorithms are available for promoter prediction. Most of them are polymerase dependent and cannot handle eukaryote...

Descripción completa

Detalles Bibliográficos
Autores principales: P, Umesh, Dubey, Jitendra Kumar, RV, Karthika, Cherian, Betsy Sheena, Gopalakrishnan, Gopakumar, Nair, Achuthsankar Sukumaran
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Biomedical Informatics 2014
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4070045/
https://www.ncbi.nlm.nih.gov/pubmed/24966516
http://dx.doi.org/10.6026/97320630010175
_version_ 1782322632106967040
author P, Umesh
Dubey, Jitendra Kumar
RV, Karthika
Cherian, Betsy Sheena
Gopalakrishnan, Gopakumar
Nair, Achuthsankar Sukumaran
author_facet P, Umesh
Dubey, Jitendra Kumar
RV, Karthika
Cherian, Betsy Sheena
Gopalakrishnan, Gopakumar
Nair, Achuthsankar Sukumaran
author_sort P, Umesh
collection PubMed
description Identification of promoters in DNA sequence using computational techniques is a significant research area because of its direct association in transcription regulation. A wide range of algorithms are available for promoter prediction. Most of them are polymerase dependent and cannot handle eukaryotes and prokaryotes alike. This study proposes a polymerase independent algorithm, which can predict whether a given DNA fragment is a promoter or not, based on the sequence features and statistical elements. This algorithm considers all possible pentamers formed from the nucleotides A, C, G, and T along with CpG islands, TATA box, initiator elements, and downstream promoter elements. The highlight of the algorithm is that it is not polymerase specific and can predict for both eukaryotes and prokaryotes in the same computational manner even though the underlying biological mechanisms of promoter recognition differ greatly. The proposed Method, Promoter Prediction System - PPS-CBM achieved a sensitivity, specificity, and accuracy percentages of 75.08, 83.58 and 79.33 on E. coli data set and 86.67, 88.41 and 87.58 on human data set. We have developed a tool based on PPS-CBM, the proposed algorithm, with which multiple sequences of varying lengths can be tested simultaneously and the result is reported in a comprehensive tabular format. The tool also reports the strength of the prediction. AVAILABILITY: The tool and source code of PPS-CBM is available at http://keralabs.org
format Online
Article
Text
id pubmed-4070045
institution National Center for Biotechnology Information
language English
publishDate 2014
publisher Biomedical Informatics
record_format MEDLINE/PubMed
spelling pubmed-40700452014-06-25 A novel sequence and context based method for promoter recognition P, Umesh Dubey, Jitendra Kumar RV, Karthika Cherian, Betsy Sheena Gopalakrishnan, Gopakumar Nair, Achuthsankar Sukumaran Bioinformation Hypothesis Identification of promoters in DNA sequence using computational techniques is a significant research area because of its direct association in transcription regulation. A wide range of algorithms are available for promoter prediction. Most of them are polymerase dependent and cannot handle eukaryotes and prokaryotes alike. This study proposes a polymerase independent algorithm, which can predict whether a given DNA fragment is a promoter or not, based on the sequence features and statistical elements. This algorithm considers all possible pentamers formed from the nucleotides A, C, G, and T along with CpG islands, TATA box, initiator elements, and downstream promoter elements. The highlight of the algorithm is that it is not polymerase specific and can predict for both eukaryotes and prokaryotes in the same computational manner even though the underlying biological mechanisms of promoter recognition differ greatly. The proposed Method, Promoter Prediction System - PPS-CBM achieved a sensitivity, specificity, and accuracy percentages of 75.08, 83.58 and 79.33 on E. coli data set and 86.67, 88.41 and 87.58 on human data set. We have developed a tool based on PPS-CBM, the proposed algorithm, with which multiple sequences of varying lengths can be tested simultaneously and the result is reported in a comprehensive tabular format. The tool also reports the strength of the prediction. AVAILABILITY: The tool and source code of PPS-CBM is available at http://keralabs.org Biomedical Informatics 2014-04-23 /pmc/articles/PMC4070045/ /pubmed/24966516 http://dx.doi.org/10.6026/97320630010175 Text en © 2014 Biomedical Informatics This is an open-access article, which permits unrestricted use, distribution, and reproduction in any medium, for non-commercial purposes, provided the original author and source are credited.
spellingShingle Hypothesis
P, Umesh
Dubey, Jitendra Kumar
RV, Karthika
Cherian, Betsy Sheena
Gopalakrishnan, Gopakumar
Nair, Achuthsankar Sukumaran
A novel sequence and context based method for promoter recognition
title A novel sequence and context based method for promoter recognition
title_full A novel sequence and context based method for promoter recognition
title_fullStr A novel sequence and context based method for promoter recognition
title_full_unstemmed A novel sequence and context based method for promoter recognition
title_short A novel sequence and context based method for promoter recognition
title_sort novel sequence and context based method for promoter recognition
topic Hypothesis
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4070045/
https://www.ncbi.nlm.nih.gov/pubmed/24966516
http://dx.doi.org/10.6026/97320630010175
work_keys_str_mv AT pumesh anovelsequenceandcontextbasedmethodforpromoterrecognition
AT dubeyjitendrakumar anovelsequenceandcontextbasedmethodforpromoterrecognition
AT rvkarthika anovelsequenceandcontextbasedmethodforpromoterrecognition
AT cherianbetsysheena anovelsequenceandcontextbasedmethodforpromoterrecognition
AT gopalakrishnangopakumar anovelsequenceandcontextbasedmethodforpromoterrecognition
AT nairachuthsankarsukumaran anovelsequenceandcontextbasedmethodforpromoterrecognition
AT pumesh novelsequenceandcontextbasedmethodforpromoterrecognition
AT dubeyjitendrakumar novelsequenceandcontextbasedmethodforpromoterrecognition
AT rvkarthika novelsequenceandcontextbasedmethodforpromoterrecognition
AT cherianbetsysheena novelsequenceandcontextbasedmethodforpromoterrecognition
AT gopalakrishnangopakumar novelsequenceandcontextbasedmethodforpromoterrecognition
AT nairachuthsankarsukumaran novelsequenceandcontextbasedmethodforpromoterrecognition