Cargando…

EnhancerPred: a predictor for discovering enhancers based on the combination and selection of multiple features

Enhancers are cis elements that play an important role in regulating gene expression by enhancing it. Recent study of modifications revealed that enhancers are a large group of functional elements with many different subgroups, which have different biological activities and regulatory effects on tar...

Descripción completa

Detalles Bibliográficos
Autores principales: Jia, Cangzhi, He, Wenying
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Nature Publishing Group 2016
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5150536/
https://www.ncbi.nlm.nih.gov/pubmed/27941893
http://dx.doi.org/10.1038/srep38741
_version_ 1782474219637964800
author Jia, Cangzhi
He, Wenying
author_facet Jia, Cangzhi
He, Wenying
author_sort Jia, Cangzhi
collection PubMed
description Enhancers are cis elements that play an important role in regulating gene expression by enhancing it. Recent study of modifications revealed that enhancers are a large group of functional elements with many different subgroups, which have different biological activities and regulatory effects on target genes. As powerful auxiliary tools, several computational methods have been proposed to distinguish enhancers from other regulatory elements, but only one method has been considered to clustering them into subgroups. In this study, we developed a predictor (called EnhancerPred) to distinguish between enhancers and nonenhancers and to determine enhancers’ strength. A two-step wrapper-based feature selection method was applied in high dimension feature vector from bi-profile Bayes and pseudo-nucleotide composition. Finally, the combination of 104 features from bi-profile Bayes, 1 feature from nucleotide composition and 9 features from pseudo-nucleotide composition yielded the best performance for identifying enhancers and nonenhancers, with overall Acc of 77.39%. The combination of 89 features from bi-profile Bayes and 10 features from pseudo-nucleotide composition yielded the best performance for identifying strong and weak enhancers, with overall Acc of 68.19%. The process and steps of feature optimization illustrated that it is necessary to construct a particular model for identifying strong enhancers and weak enhancers.
format Online
Article
Text
id pubmed-5150536
institution National Center for Biotechnology Information
language English
publishDate 2016
publisher Nature Publishing Group
record_format MEDLINE/PubMed
spelling pubmed-51505362016-12-19 EnhancerPred: a predictor for discovering enhancers based on the combination and selection of multiple features Jia, Cangzhi He, Wenying Sci Rep Article Enhancers are cis elements that play an important role in regulating gene expression by enhancing it. Recent study of modifications revealed that enhancers are a large group of functional elements with many different subgroups, which have different biological activities and regulatory effects on target genes. As powerful auxiliary tools, several computational methods have been proposed to distinguish enhancers from other regulatory elements, but only one method has been considered to clustering them into subgroups. In this study, we developed a predictor (called EnhancerPred) to distinguish between enhancers and nonenhancers and to determine enhancers’ strength. A two-step wrapper-based feature selection method was applied in high dimension feature vector from bi-profile Bayes and pseudo-nucleotide composition. Finally, the combination of 104 features from bi-profile Bayes, 1 feature from nucleotide composition and 9 features from pseudo-nucleotide composition yielded the best performance for identifying enhancers and nonenhancers, with overall Acc of 77.39%. The combination of 89 features from bi-profile Bayes and 10 features from pseudo-nucleotide composition yielded the best performance for identifying strong and weak enhancers, with overall Acc of 68.19%. The process and steps of feature optimization illustrated that it is necessary to construct a particular model for identifying strong enhancers and weak enhancers. Nature Publishing Group 2016-12-12 /pmc/articles/PMC5150536/ /pubmed/27941893 http://dx.doi.org/10.1038/srep38741 Text en Copyright © 2016, The Author(s) http://creativecommons.org/licenses/by/4.0/ This work is licensed under a Creative Commons Attribution 4.0 International License. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in the credit line; if the material is not included under the Creative Commons license, users will need to obtain permission from the license holder to reproduce the material. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/
spellingShingle Article
Jia, Cangzhi
He, Wenying
EnhancerPred: a predictor for discovering enhancers based on the combination and selection of multiple features
title EnhancerPred: a predictor for discovering enhancers based on the combination and selection of multiple features
title_full EnhancerPred: a predictor for discovering enhancers based on the combination and selection of multiple features
title_fullStr EnhancerPred: a predictor for discovering enhancers based on the combination and selection of multiple features
title_full_unstemmed EnhancerPred: a predictor for discovering enhancers based on the combination and selection of multiple features
title_short EnhancerPred: a predictor for discovering enhancers based on the combination and selection of multiple features
title_sort enhancerpred: a predictor for discovering enhancers based on the combination and selection of multiple features
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5150536/
https://www.ncbi.nlm.nih.gov/pubmed/27941893
http://dx.doi.org/10.1038/srep38741
work_keys_str_mv AT jiacangzhi enhancerpredapredictorfordiscoveringenhancersbasedonthecombinationandselectionofmultiplefeatures
AT hewenying enhancerpredapredictorfordiscoveringenhancersbasedonthecombinationandselectionofmultiplefeatures