Cargando…

Improved Prediction of Regulatory Element Using Hybrid Abelian Complexity Features with DNA Sequences

Deciphering the code of cis-regulatory element (CRE) is one of the core issues of current biology. As an important category of CRE, enhancers play crucial roles in gene transcriptional regulations in a distant manner. Further, the disruption of an enhancer can cause abnormal transcription and, thus,...

Descripción completa

Detalles Bibliográficos
Autores principales: Wu, Chengchao, Chen, Jin, Liu, Yunxia, Hu, Xuehai
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6480087/
https://www.ncbi.nlm.nih.gov/pubmed/30959806
http://dx.doi.org/10.3390/ijms20071704
_version_ 1783413495369302016
author Wu, Chengchao
Chen, Jin
Liu, Yunxia
Hu, Xuehai
author_facet Wu, Chengchao
Chen, Jin
Liu, Yunxia
Hu, Xuehai
author_sort Wu, Chengchao
collection PubMed
description Deciphering the code of cis-regulatory element (CRE) is one of the core issues of current biology. As an important category of CRE, enhancers play crucial roles in gene transcriptional regulations in a distant manner. Further, the disruption of an enhancer can cause abnormal transcription and, thus, trigger human diseases, which means that its accurate identification is currently of broad interest. Here, we introduce an innovative concept, i.e., abelian complexity function (ACF), which is a more complex extension of the classic subword complexity function, for a new coding of DNA sequences. After feature selection by an upper bound estimation and integration with DNA composition features, we developed an enhancer prediction model with hybrid abelian complexity features (HACF). Compared with existing methods, HACF shows consistently superior performance on three sources of enhancer datasets. We tested the generalization ability of HACF by scanning human chromosome 22 to validate previously reported super-enhancers. Meanwhile, we identified novel candidate enhancers which have supports from enhancer-related ENCODE ChIP-seq signals. In summary, HACF improves current enhancer prediction and may be beneficial for further prioritization of functional noncoding variants.
format Online
Article
Text
id pubmed-6480087
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-64800872019-04-29 Improved Prediction of Regulatory Element Using Hybrid Abelian Complexity Features with DNA Sequences Wu, Chengchao Chen, Jin Liu, Yunxia Hu, Xuehai Int J Mol Sci Article Deciphering the code of cis-regulatory element (CRE) is one of the core issues of current biology. As an important category of CRE, enhancers play crucial roles in gene transcriptional regulations in a distant manner. Further, the disruption of an enhancer can cause abnormal transcription and, thus, trigger human diseases, which means that its accurate identification is currently of broad interest. Here, we introduce an innovative concept, i.e., abelian complexity function (ACF), which is a more complex extension of the classic subword complexity function, for a new coding of DNA sequences. After feature selection by an upper bound estimation and integration with DNA composition features, we developed an enhancer prediction model with hybrid abelian complexity features (HACF). Compared with existing methods, HACF shows consistently superior performance on three sources of enhancer datasets. We tested the generalization ability of HACF by scanning human chromosome 22 to validate previously reported super-enhancers. Meanwhile, we identified novel candidate enhancers which have supports from enhancer-related ENCODE ChIP-seq signals. In summary, HACF improves current enhancer prediction and may be beneficial for further prioritization of functional noncoding variants. MDPI 2019-04-05 /pmc/articles/PMC6480087/ /pubmed/30959806 http://dx.doi.org/10.3390/ijms20071704 Text en © 2019 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Wu, Chengchao
Chen, Jin
Liu, Yunxia
Hu, Xuehai
Improved Prediction of Regulatory Element Using Hybrid Abelian Complexity Features with DNA Sequences
title Improved Prediction of Regulatory Element Using Hybrid Abelian Complexity Features with DNA Sequences
title_full Improved Prediction of Regulatory Element Using Hybrid Abelian Complexity Features with DNA Sequences
title_fullStr Improved Prediction of Regulatory Element Using Hybrid Abelian Complexity Features with DNA Sequences
title_full_unstemmed Improved Prediction of Regulatory Element Using Hybrid Abelian Complexity Features with DNA Sequences
title_short Improved Prediction of Regulatory Element Using Hybrid Abelian Complexity Features with DNA Sequences
title_sort improved prediction of regulatory element using hybrid abelian complexity features with dna sequences
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6480087/
https://www.ncbi.nlm.nih.gov/pubmed/30959806
http://dx.doi.org/10.3390/ijms20071704
work_keys_str_mv AT wuchengchao improvedpredictionofregulatoryelementusinghybridabeliancomplexityfeatureswithdnasequences
AT chenjin improvedpredictionofregulatoryelementusinghybridabeliancomplexityfeatureswithdnasequences
AT liuyunxia improvedpredictionofregulatoryelementusinghybridabeliancomplexityfeatureswithdnasequences
AT huxuehai improvedpredictionofregulatoryelementusinghybridabeliancomplexityfeatureswithdnasequences