Cargando…
Machine learning predicts nucleosome binding modes of transcription factors
BACKGROUND: Most transcription factors (TFs) compete with nucleosomes to gain access to their cognate binding sites. Recent studies have identified several TF-nucleosome interaction modes including end binding (EB), oriented binding, periodic binding, dyad binding, groove binding, and gyre spanning....
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2021
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8008688/ https://www.ncbi.nlm.nih.gov/pubmed/33784978 http://dx.doi.org/10.1186/s12859-021-04093-9 |
_version_ | 1783672736978042880 |
---|---|
author | Kishan, K. C. Subramanya, Sridevi K. Li, Rui Cui, Feng |
author_facet | Kishan, K. C. Subramanya, Sridevi K. Li, Rui Cui, Feng |
author_sort | Kishan, K. C. |
collection | PubMed |
description | BACKGROUND: Most transcription factors (TFs) compete with nucleosomes to gain access to their cognate binding sites. Recent studies have identified several TF-nucleosome interaction modes including end binding (EB), oriented binding, periodic binding, dyad binding, groove binding, and gyre spanning. However, there are substantial experimental challenges in measuring nucleosome binding modes for thousands of TFs in different species. RESULTS: We present a computational prediction of the binding modes based on TF protein sequences. With a nested cross-validation procedure, our model outperforms several fine-tuned off-the-shelf machine learning (ML) methods in the multi-label classification task. Our binary classifier for the EB mode performs better than these ML methods with the area under precision-recall curve achieving 75%. The end preference of most TFs is consistent with low nucleosome occupancy around their binding site in GM12878 cells. The nucleosome occupancy data is used as an alternative dataset to confirm the superiority of our EB classifier. CONCLUSIONS: We develop the first ML-based approach for efficient and comprehensive analysis of nucleosome binding modes of TFs. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s12859-021-04093-9. |
format | Online Article Text |
id | pubmed-8008688 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2021 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-80086882021-03-31 Machine learning predicts nucleosome binding modes of transcription factors Kishan, K. C. Subramanya, Sridevi K. Li, Rui Cui, Feng BMC Bioinformatics Research Article BACKGROUND: Most transcription factors (TFs) compete with nucleosomes to gain access to their cognate binding sites. Recent studies have identified several TF-nucleosome interaction modes including end binding (EB), oriented binding, periodic binding, dyad binding, groove binding, and gyre spanning. However, there are substantial experimental challenges in measuring nucleosome binding modes for thousands of TFs in different species. RESULTS: We present a computational prediction of the binding modes based on TF protein sequences. With a nested cross-validation procedure, our model outperforms several fine-tuned off-the-shelf machine learning (ML) methods in the multi-label classification task. Our binary classifier for the EB mode performs better than these ML methods with the area under precision-recall curve achieving 75%. The end preference of most TFs is consistent with low nucleosome occupancy around their binding site in GM12878 cells. The nucleosome occupancy data is used as an alternative dataset to confirm the superiority of our EB classifier. CONCLUSIONS: We develop the first ML-based approach for efficient and comprehensive analysis of nucleosome binding modes of TFs. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s12859-021-04093-9. BioMed Central 2021-03-30 /pmc/articles/PMC8008688/ /pubmed/33784978 http://dx.doi.org/10.1186/s12859-021-04093-9 Text en © The Author(s) 2021 Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated in a credit line to the data. |
spellingShingle | Research Article Kishan, K. C. Subramanya, Sridevi K. Li, Rui Cui, Feng Machine learning predicts nucleosome binding modes of transcription factors |
title | Machine learning predicts nucleosome binding modes of transcription factors |
title_full | Machine learning predicts nucleosome binding modes of transcription factors |
title_fullStr | Machine learning predicts nucleosome binding modes of transcription factors |
title_full_unstemmed | Machine learning predicts nucleosome binding modes of transcription factors |
title_short | Machine learning predicts nucleosome binding modes of transcription factors |
title_sort | machine learning predicts nucleosome binding modes of transcription factors |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8008688/ https://www.ncbi.nlm.nih.gov/pubmed/33784978 http://dx.doi.org/10.1186/s12859-021-04093-9 |
work_keys_str_mv | AT kishankc machinelearningpredictsnucleosomebindingmodesoftranscriptionfactors AT subramanyasridevik machinelearningpredictsnucleosomebindingmodesoftranscriptionfactors AT lirui machinelearningpredictsnucleosomebindingmodesoftranscriptionfactors AT cuifeng machinelearningpredictsnucleosomebindingmodesoftranscriptionfactors |