Cargando…

In silico identification of enhancers on the basis of a combination of transcription factor binding motif occurrences

Enhancers interact with gene promoters and form chromatin looping structures that serve important functions in various biological processes, such as the regulation of gene transcription and cell differentiation. However, enhancers are difficult to identify because they generally do not have fixed po...

Descripción completa

Detalles Bibliográficos
Autores principales: Fang, Yaping, Wang, Yunlong, Zhu, Qin, Wang, Jia, Li, Guoliang
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Nature Publishing Group 2016
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5007594/
https://www.ncbi.nlm.nih.gov/pubmed/27582178
http://dx.doi.org/10.1038/srep32476
_version_ 1782451239848509440
author Fang, Yaping
Wang, Yunlong
Zhu, Qin
Wang, Jia
Li, Guoliang
author_facet Fang, Yaping
Wang, Yunlong
Zhu, Qin
Wang, Jia
Li, Guoliang
author_sort Fang, Yaping
collection PubMed
description Enhancers interact with gene promoters and form chromatin looping structures that serve important functions in various biological processes, such as the regulation of gene transcription and cell differentiation. However, enhancers are difficult to identify because they generally do not have fixed positions or consensus sequence features, and biological experiments for enhancer identification are costly in terms of labor and expense. In this work, several models were built by using various sequence-based feature sets and their combinations for enhancer prediction. The selected features derived from a recursive feature elimination method showed that the model using a combination of 141 transcription factor binding motif occurrences from 1,422 transcription factor position weight matrices achieved a favorably high prediction accuracy superior to that of other reported methods. The models demonstrated good prediction accuracy for different enhancer datasets obtained from different cell lines/tissues. In addition, prediction accuracy was further improved by integration of chromatin state features. Our method is complementary to wet-lab experimental methods and provides an additional method to identify enhancers.
format Online
Article
Text
id pubmed-5007594
institution National Center for Biotechnology Information
language English
publishDate 2016
publisher Nature Publishing Group
record_format MEDLINE/PubMed
spelling pubmed-50075942016-09-08 In silico identification of enhancers on the basis of a combination of transcription factor binding motif occurrences Fang, Yaping Wang, Yunlong Zhu, Qin Wang, Jia Li, Guoliang Sci Rep Article Enhancers interact with gene promoters and form chromatin looping structures that serve important functions in various biological processes, such as the regulation of gene transcription and cell differentiation. However, enhancers are difficult to identify because they generally do not have fixed positions or consensus sequence features, and biological experiments for enhancer identification are costly in terms of labor and expense. In this work, several models were built by using various sequence-based feature sets and their combinations for enhancer prediction. The selected features derived from a recursive feature elimination method showed that the model using a combination of 141 transcription factor binding motif occurrences from 1,422 transcription factor position weight matrices achieved a favorably high prediction accuracy superior to that of other reported methods. The models demonstrated good prediction accuracy for different enhancer datasets obtained from different cell lines/tissues. In addition, prediction accuracy was further improved by integration of chromatin state features. Our method is complementary to wet-lab experimental methods and provides an additional method to identify enhancers. Nature Publishing Group 2016-09-01 /pmc/articles/PMC5007594/ /pubmed/27582178 http://dx.doi.org/10.1038/srep32476 Text en Copyright © 2016, The Author(s) http://creativecommons.org/licenses/by/4.0/ This work is licensed under a Creative Commons Attribution 4.0 International License. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in the credit line; if the material is not included under the Creative Commons license, users will need to obtain permission from the license holder to reproduce the material. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/
spellingShingle Article
Fang, Yaping
Wang, Yunlong
Zhu, Qin
Wang, Jia
Li, Guoliang
In silico identification of enhancers on the basis of a combination of transcription factor binding motif occurrences
title In silico identification of enhancers on the basis of a combination of transcription factor binding motif occurrences
title_full In silico identification of enhancers on the basis of a combination of transcription factor binding motif occurrences
title_fullStr In silico identification of enhancers on the basis of a combination of transcription factor binding motif occurrences
title_full_unstemmed In silico identification of enhancers on the basis of a combination of transcription factor binding motif occurrences
title_short In silico identification of enhancers on the basis of a combination of transcription factor binding motif occurrences
title_sort in silico identification of enhancers on the basis of a combination of transcription factor binding motif occurrences
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5007594/
https://www.ncbi.nlm.nih.gov/pubmed/27582178
http://dx.doi.org/10.1038/srep32476
work_keys_str_mv AT fangyaping insilicoidentificationofenhancersonthebasisofacombinationoftranscriptionfactorbindingmotifoccurrences
AT wangyunlong insilicoidentificationofenhancersonthebasisofacombinationoftranscriptionfactorbindingmotifoccurrences
AT zhuqin insilicoidentificationofenhancersonthebasisofacombinationoftranscriptionfactorbindingmotifoccurrences
AT wangjia insilicoidentificationofenhancersonthebasisofacombinationoftranscriptionfactorbindingmotifoccurrences
AT liguoliang insilicoidentificationofenhancersonthebasisofacombinationoftranscriptionfactorbindingmotifoccurrences