Cargando…

Detection and identification of cis-regulatory elements using change-point and classification algorithms

BACKGROUND: Transcriptional regulation is primarily mediated by the binding of factors to non-coding regions in DNA. Identification of these binding regions enhances understanding of tissue formation and potentially facilitates the development of gene therapies. However, successful identification of...

Descripción completa

Detalles Bibliográficos
Autores principales: Maderazo, Dominic, Flegg, Jennifer A., Algama, Manjula, Ramialison, Mirana, Keith, Jonathan
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8790847/
https://www.ncbi.nlm.nih.gov/pubmed/35078412
http://dx.doi.org/10.1186/s12864-021-08190-0
Descripción
Sumario:BACKGROUND: Transcriptional regulation is primarily mediated by the binding of factors to non-coding regions in DNA. Identification of these binding regions enhances understanding of tissue formation and potentially facilitates the development of gene therapies. However, successful identification of binding regions is made difficult by the lack of a universal biological code for their characterisation. RESULTS: We extend an alignment-based method, changept, and identify clusters of biological significance, through ontology and de novo motif analysis. Further, we apply a Bayesian method to estimate and combine binary classifiers on the clusters we identify to produce a better performing composite. CONCLUSIONS: The analysis we describe provides a computational method for identification of conserved binding sites in the human genome and facilitates an alternative interrogation of combinations of existing data sets with alignment data. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at (10.1186/s12864-021-08190-0).