Cargando…
Detection and identification of cis-regulatory elements using change-point and classification algorithms
BACKGROUND: Transcriptional regulation is primarily mediated by the binding of factors to non-coding regions in DNA. Identification of these binding regions enhances understanding of tissue formation and potentially facilitates the development of gene therapies. However, successful identification of...
Autores principales: | , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8790847/ https://www.ncbi.nlm.nih.gov/pubmed/35078412 http://dx.doi.org/10.1186/s12864-021-08190-0 |
_version_ | 1784640105208610816 |
---|---|
author | Maderazo, Dominic Flegg, Jennifer A. Algama, Manjula Ramialison, Mirana Keith, Jonathan |
author_facet | Maderazo, Dominic Flegg, Jennifer A. Algama, Manjula Ramialison, Mirana Keith, Jonathan |
author_sort | Maderazo, Dominic |
collection | PubMed |
description | BACKGROUND: Transcriptional regulation is primarily mediated by the binding of factors to non-coding regions in DNA. Identification of these binding regions enhances understanding of tissue formation and potentially facilitates the development of gene therapies. However, successful identification of binding regions is made difficult by the lack of a universal biological code for their characterisation. RESULTS: We extend an alignment-based method, changept, and identify clusters of biological significance, through ontology and de novo motif analysis. Further, we apply a Bayesian method to estimate and combine binary classifiers on the clusters we identify to produce a better performing composite. CONCLUSIONS: The analysis we describe provides a computational method for identification of conserved binding sites in the human genome and facilitates an alternative interrogation of combinations of existing data sets with alignment data. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at (10.1186/s12864-021-08190-0). |
format | Online Article Text |
id | pubmed-8790847 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-87908472022-01-26 Detection and identification of cis-regulatory elements using change-point and classification algorithms Maderazo, Dominic Flegg, Jennifer A. Algama, Manjula Ramialison, Mirana Keith, Jonathan BMC Genomics Methodology Article BACKGROUND: Transcriptional regulation is primarily mediated by the binding of factors to non-coding regions in DNA. Identification of these binding regions enhances understanding of tissue formation and potentially facilitates the development of gene therapies. However, successful identification of binding regions is made difficult by the lack of a universal biological code for their characterisation. RESULTS: We extend an alignment-based method, changept, and identify clusters of biological significance, through ontology and de novo motif analysis. Further, we apply a Bayesian method to estimate and combine binary classifiers on the clusters we identify to produce a better performing composite. CONCLUSIONS: The analysis we describe provides a computational method for identification of conserved binding sites in the human genome and facilitates an alternative interrogation of combinations of existing data sets with alignment data. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at (10.1186/s12864-021-08190-0). BioMed Central 2022-01-25 /pmc/articles/PMC8790847/ /pubmed/35078412 http://dx.doi.org/10.1186/s12864-021-08190-0 Text en © The Author(s) 2022 https://creativecommons.org/licenses/by/4.0/Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/ (https://creativecommons.org/publicdomain/zero/1.0/) ) applies to the data made available in this article, unless otherwise stated in a credit line to the data. |
spellingShingle | Methodology Article Maderazo, Dominic Flegg, Jennifer A. Algama, Manjula Ramialison, Mirana Keith, Jonathan Detection and identification of cis-regulatory elements using change-point and classification algorithms |
title | Detection and identification of cis-regulatory elements using change-point and classification algorithms |
title_full | Detection and identification of cis-regulatory elements using change-point and classification algorithms |
title_fullStr | Detection and identification of cis-regulatory elements using change-point and classification algorithms |
title_full_unstemmed | Detection and identification of cis-regulatory elements using change-point and classification algorithms |
title_short | Detection and identification of cis-regulatory elements using change-point and classification algorithms |
title_sort | detection and identification of cis-regulatory elements using change-point and classification algorithms |
topic | Methodology Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8790847/ https://www.ncbi.nlm.nih.gov/pubmed/35078412 http://dx.doi.org/10.1186/s12864-021-08190-0 |
work_keys_str_mv | AT maderazodominic detectionandidentificationofcisregulatoryelementsusingchangepointandclassificationalgorithms AT fleggjennifera detectionandidentificationofcisregulatoryelementsusingchangepointandclassificationalgorithms AT algamamanjula detectionandidentificationofcisregulatoryelementsusingchangepointandclassificationalgorithms AT ramialisonmirana detectionandidentificationofcisregulatoryelementsusingchangepointandclassificationalgorithms AT keithjonathan detectionandidentificationofcisregulatoryelementsusingchangepointandclassificationalgorithms |