Cargando…

Detection and identification of cis-regulatory elements using change-point and classification algorithms

BACKGROUND: Transcriptional regulation is primarily mediated by the binding of factors to non-coding regions in DNA. Identification of these binding regions enhances understanding of tissue formation and potentially facilitates the development of gene therapies. However, successful identification of...

Descripción completa

Detalles Bibliográficos
Autores principales: Maderazo, Dominic, Flegg, Jennifer A., Algama, Manjula, Ramialison, Mirana, Keith, Jonathan
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8790847/
https://www.ncbi.nlm.nih.gov/pubmed/35078412
http://dx.doi.org/10.1186/s12864-021-08190-0
_version_ 1784640105208610816
author Maderazo, Dominic
Flegg, Jennifer A.
Algama, Manjula
Ramialison, Mirana
Keith, Jonathan
author_facet Maderazo, Dominic
Flegg, Jennifer A.
Algama, Manjula
Ramialison, Mirana
Keith, Jonathan
author_sort Maderazo, Dominic
collection PubMed
description BACKGROUND: Transcriptional regulation is primarily mediated by the binding of factors to non-coding regions in DNA. Identification of these binding regions enhances understanding of tissue formation and potentially facilitates the development of gene therapies. However, successful identification of binding regions is made difficult by the lack of a universal biological code for their characterisation. RESULTS: We extend an alignment-based method, changept, and identify clusters of biological significance, through ontology and de novo motif analysis. Further, we apply a Bayesian method to estimate and combine binary classifiers on the clusters we identify to produce a better performing composite. CONCLUSIONS: The analysis we describe provides a computational method for identification of conserved binding sites in the human genome and facilitates an alternative interrogation of combinations of existing data sets with alignment data. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at (10.1186/s12864-021-08190-0).
format Online
Article
Text
id pubmed-8790847
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-87908472022-01-26 Detection and identification of cis-regulatory elements using change-point and classification algorithms Maderazo, Dominic Flegg, Jennifer A. Algama, Manjula Ramialison, Mirana Keith, Jonathan BMC Genomics Methodology Article BACKGROUND: Transcriptional regulation is primarily mediated by the binding of factors to non-coding regions in DNA. Identification of these binding regions enhances understanding of tissue formation and potentially facilitates the development of gene therapies. However, successful identification of binding regions is made difficult by the lack of a universal biological code for their characterisation. RESULTS: We extend an alignment-based method, changept, and identify clusters of biological significance, through ontology and de novo motif analysis. Further, we apply a Bayesian method to estimate and combine binary classifiers on the clusters we identify to produce a better performing composite. CONCLUSIONS: The analysis we describe provides a computational method for identification of conserved binding sites in the human genome and facilitates an alternative interrogation of combinations of existing data sets with alignment data. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at (10.1186/s12864-021-08190-0). BioMed Central 2022-01-25 /pmc/articles/PMC8790847/ /pubmed/35078412 http://dx.doi.org/10.1186/s12864-021-08190-0 Text en © The Author(s) 2022 https://creativecommons.org/licenses/by/4.0/Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/ (https://creativecommons.org/publicdomain/zero/1.0/) ) applies to the data made available in this article, unless otherwise stated in a credit line to the data.
spellingShingle Methodology Article
Maderazo, Dominic
Flegg, Jennifer A.
Algama, Manjula
Ramialison, Mirana
Keith, Jonathan
Detection and identification of cis-regulatory elements using change-point and classification algorithms
title Detection and identification of cis-regulatory elements using change-point and classification algorithms
title_full Detection and identification of cis-regulatory elements using change-point and classification algorithms
title_fullStr Detection and identification of cis-regulatory elements using change-point and classification algorithms
title_full_unstemmed Detection and identification of cis-regulatory elements using change-point and classification algorithms
title_short Detection and identification of cis-regulatory elements using change-point and classification algorithms
title_sort detection and identification of cis-regulatory elements using change-point and classification algorithms
topic Methodology Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8790847/
https://www.ncbi.nlm.nih.gov/pubmed/35078412
http://dx.doi.org/10.1186/s12864-021-08190-0
work_keys_str_mv AT maderazodominic detectionandidentificationofcisregulatoryelementsusingchangepointandclassificationalgorithms
AT fleggjennifera detectionandidentificationofcisregulatoryelementsusingchangepointandclassificationalgorithms
AT algamamanjula detectionandidentificationofcisregulatoryelementsusingchangepointandclassificationalgorithms
AT ramialisonmirana detectionandidentificationofcisregulatoryelementsusingchangepointandclassificationalgorithms
AT keithjonathan detectionandidentificationofcisregulatoryelementsusingchangepointandclassificationalgorithms