Cargando…

Scoring functions for transcription factor binding site prediction

BACKGROUND: Transcription factor binding site (TFBS) prediction is a difficult problem, which requires a good scoring function to discriminate between real binding sites and background noise. Many scoring functions have been proposed in the literature, but it is difficult to assess their relative pe...

Descripción completa

Detalles Bibliográficos
Autores principales: Friberg, Markus, von Rohr, Peter, Gonnet, Gaston
Formato: Texto
Lenguaje:English
Publicado: BioMed Central 2005
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1140076/
https://www.ncbi.nlm.nih.gov/pubmed/15807889
http://dx.doi.org/10.1186/1471-2105-6-84
Descripción
Sumario:BACKGROUND: Transcription factor binding site (TFBS) prediction is a difficult problem, which requires a good scoring function to discriminate between real binding sites and background noise. Many scoring functions have been proposed in the literature, but it is difficult to assess their relative performance, because they are implemented in different software tools using different search methods and different TFBS representations. RESULTS: Here we compare how several scoring functions perform on both real and semi-simulated data sets in a common test environment. We have also developed two new scoring functions and included them in the comparison. The data sets are from the yeast (S. cerevisiae) genome. Our new scoring function LLBG (least likely under the background model) performs best in this study. It achieves the best average rank for the correct motifs. Scoring functions based on positional bias performed quite poorly in this study. CONCLUSION: LLBG may provide an interesting alternative to current scoring functions for TFBS prediction.