Cargando…

RBF-TSS: Identification of Transcription Start Site in Human Using Radial Basis Functions Network and Oligonucleotide Positional Frequencies

Accurate identification of promoter regions and transcription start sites (TSS) in genomic DNA allows for a more complete understanding of the structure of genes and gene regulation within a given genome. Many recently published methods have achieved high identification accuracy of TSS. However, mod...

Descripción completa

Detalles Bibliográficos
Autores principales: Mahdi, Rami N., Rouchka, Eric C.
Formato: Texto
Lenguaje:English
Publicado: Public Library of Science 2009
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2654504/
https://www.ncbi.nlm.nih.gov/pubmed/19287502
http://dx.doi.org/10.1371/journal.pone.0004878
_version_ 1782165377178927104
author Mahdi, Rami N.
Rouchka, Eric C.
author_facet Mahdi, Rami N.
Rouchka, Eric C.
author_sort Mahdi, Rami N.
collection PubMed
description Accurate identification of promoter regions and transcription start sites (TSS) in genomic DNA allows for a more complete understanding of the structure of genes and gene regulation within a given genome. Many recently published methods have achieved high identification accuracy of TSS. However, models providing more accurate modeling of promoters and TSS are needed. A novel identification method for identifying transcription start sites that improves the accuracy of TSS recognition for recently published methods is proposed. This method incorporates a metric feature based on oligonucleotide positional frequencies, taking into account the nature of promoters. A radial basis function neural network for identifying transcription start sites (RBF-TSS) is proposed and employed as a classification algorithm. Using non-overlapping chunks (windows) of size 50 and 500 on the human genome, the proposed method achieves an area under the Receiver Operator Characteristic curve (auROC) of 94.75% and 95.08% respectively, providing increased performance over existing TSS prediction methods.
format Text
id pubmed-2654504
institution National Center for Biotechnology Information
language English
publishDate 2009
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-26545042009-03-16 RBF-TSS: Identification of Transcription Start Site in Human Using Radial Basis Functions Network and Oligonucleotide Positional Frequencies Mahdi, Rami N. Rouchka, Eric C. PLoS One Research Article Accurate identification of promoter regions and transcription start sites (TSS) in genomic DNA allows for a more complete understanding of the structure of genes and gene regulation within a given genome. Many recently published methods have achieved high identification accuracy of TSS. However, models providing more accurate modeling of promoters and TSS are needed. A novel identification method for identifying transcription start sites that improves the accuracy of TSS recognition for recently published methods is proposed. This method incorporates a metric feature based on oligonucleotide positional frequencies, taking into account the nature of promoters. A radial basis function neural network for identifying transcription start sites (RBF-TSS) is proposed and employed as a classification algorithm. Using non-overlapping chunks (windows) of size 50 and 500 on the human genome, the proposed method achieves an area under the Receiver Operator Characteristic curve (auROC) of 94.75% and 95.08% respectively, providing increased performance over existing TSS prediction methods. Public Library of Science 2009-03-16 /pmc/articles/PMC2654504/ /pubmed/19287502 http://dx.doi.org/10.1371/journal.pone.0004878 Text en Mahdi et al. http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited.
spellingShingle Research Article
Mahdi, Rami N.
Rouchka, Eric C.
RBF-TSS: Identification of Transcription Start Site in Human Using Radial Basis Functions Network and Oligonucleotide Positional Frequencies
title RBF-TSS: Identification of Transcription Start Site in Human Using Radial Basis Functions Network and Oligonucleotide Positional Frequencies
title_full RBF-TSS: Identification of Transcription Start Site in Human Using Radial Basis Functions Network and Oligonucleotide Positional Frequencies
title_fullStr RBF-TSS: Identification of Transcription Start Site in Human Using Radial Basis Functions Network and Oligonucleotide Positional Frequencies
title_full_unstemmed RBF-TSS: Identification of Transcription Start Site in Human Using Radial Basis Functions Network and Oligonucleotide Positional Frequencies
title_short RBF-TSS: Identification of Transcription Start Site in Human Using Radial Basis Functions Network and Oligonucleotide Positional Frequencies
title_sort rbf-tss: identification of transcription start site in human using radial basis functions network and oligonucleotide positional frequencies
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2654504/
https://www.ncbi.nlm.nih.gov/pubmed/19287502
http://dx.doi.org/10.1371/journal.pone.0004878
work_keys_str_mv AT mahdiramin rbftssidentificationoftranscriptionstartsiteinhumanusingradialbasisfunctionsnetworkandoligonucleotidepositionalfrequencies
AT rouchkaericc rbftssidentificationoftranscriptionstartsiteinhumanusingradialbasisfunctionsnetworkandoligonucleotidepositionalfrequencies