Cargando…
In silico identification of enhancers on the basis of a combination of transcription factor binding motif occurrences
Enhancers interact with gene promoters and form chromatin looping structures that serve important functions in various biological processes, such as the regulation of gene transcription and cell differentiation. However, enhancers are difficult to identify because they generally do not have fixed po...
Autores principales: | , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Nature Publishing Group
2016
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5007594/ https://www.ncbi.nlm.nih.gov/pubmed/27582178 http://dx.doi.org/10.1038/srep32476 |
_version_ | 1782451239848509440 |
---|---|
author | Fang, Yaping Wang, Yunlong Zhu, Qin Wang, Jia Li, Guoliang |
author_facet | Fang, Yaping Wang, Yunlong Zhu, Qin Wang, Jia Li, Guoliang |
author_sort | Fang, Yaping |
collection | PubMed |
description | Enhancers interact with gene promoters and form chromatin looping structures that serve important functions in various biological processes, such as the regulation of gene transcription and cell differentiation. However, enhancers are difficult to identify because they generally do not have fixed positions or consensus sequence features, and biological experiments for enhancer identification are costly in terms of labor and expense. In this work, several models were built by using various sequence-based feature sets and their combinations for enhancer prediction. The selected features derived from a recursive feature elimination method showed that the model using a combination of 141 transcription factor binding motif occurrences from 1,422 transcription factor position weight matrices achieved a favorably high prediction accuracy superior to that of other reported methods. The models demonstrated good prediction accuracy for different enhancer datasets obtained from different cell lines/tissues. In addition, prediction accuracy was further improved by integration of chromatin state features. Our method is complementary to wet-lab experimental methods and provides an additional method to identify enhancers. |
format | Online Article Text |
id | pubmed-5007594 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2016 |
publisher | Nature Publishing Group |
record_format | MEDLINE/PubMed |
spelling | pubmed-50075942016-09-08 In silico identification of enhancers on the basis of a combination of transcription factor binding motif occurrences Fang, Yaping Wang, Yunlong Zhu, Qin Wang, Jia Li, Guoliang Sci Rep Article Enhancers interact with gene promoters and form chromatin looping structures that serve important functions in various biological processes, such as the regulation of gene transcription and cell differentiation. However, enhancers are difficult to identify because they generally do not have fixed positions or consensus sequence features, and biological experiments for enhancer identification are costly in terms of labor and expense. In this work, several models were built by using various sequence-based feature sets and their combinations for enhancer prediction. The selected features derived from a recursive feature elimination method showed that the model using a combination of 141 transcription factor binding motif occurrences from 1,422 transcription factor position weight matrices achieved a favorably high prediction accuracy superior to that of other reported methods. The models demonstrated good prediction accuracy for different enhancer datasets obtained from different cell lines/tissues. In addition, prediction accuracy was further improved by integration of chromatin state features. Our method is complementary to wet-lab experimental methods and provides an additional method to identify enhancers. Nature Publishing Group 2016-09-01 /pmc/articles/PMC5007594/ /pubmed/27582178 http://dx.doi.org/10.1038/srep32476 Text en Copyright © 2016, The Author(s) http://creativecommons.org/licenses/by/4.0/ This work is licensed under a Creative Commons Attribution 4.0 International License. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in the credit line; if the material is not included under the Creative Commons license, users will need to obtain permission from the license holder to reproduce the material. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/ |
spellingShingle | Article Fang, Yaping Wang, Yunlong Zhu, Qin Wang, Jia Li, Guoliang In silico identification of enhancers on the basis of a combination of transcription factor binding motif occurrences |
title | In silico identification of enhancers on the basis of a combination of transcription factor binding motif occurrences |
title_full | In silico identification of enhancers on the basis of a combination of transcription factor binding motif occurrences |
title_fullStr | In silico identification of enhancers on the basis of a combination of transcription factor binding motif occurrences |
title_full_unstemmed | In silico identification of enhancers on the basis of a combination of transcription factor binding motif occurrences |
title_short | In silico identification of enhancers on the basis of a combination of transcription factor binding motif occurrences |
title_sort | in silico identification of enhancers on the basis of a combination of transcription factor binding motif occurrences |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5007594/ https://www.ncbi.nlm.nih.gov/pubmed/27582178 http://dx.doi.org/10.1038/srep32476 |
work_keys_str_mv | AT fangyaping insilicoidentificationofenhancersonthebasisofacombinationoftranscriptionfactorbindingmotifoccurrences AT wangyunlong insilicoidentificationofenhancersonthebasisofacombinationoftranscriptionfactorbindingmotifoccurrences AT zhuqin insilicoidentificationofenhancersonthebasisofacombinationoftranscriptionfactorbindingmotifoccurrences AT wangjia insilicoidentificationofenhancersonthebasisofacombinationoftranscriptionfactorbindingmotifoccurrences AT liguoliang insilicoidentificationofenhancersonthebasisofacombinationoftranscriptionfactorbindingmotifoccurrences |