Cargando…

An Information Entropy-Based Approach for Computationally Identifying Histone Lysine Butyrylation

Butyrylation plays a crucial role in the cellular processes. Due to limit of techniques, it is a challenging task to identify histone butyrylation sites on a large scale. To fill the gap, we propose an approach based on information entropy and machine learning for computationally identifying histone...

Descripción completa

Detalles Bibliográficos
Autores principales: Huang, Guohua, Zheng, Yang, Wu, Yao-Qun, Han, Guo-Sheng, Yu, Zu-Guo
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Frontiers Media S.A. 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7033570/
https://www.ncbi.nlm.nih.gov/pubmed/32117407
http://dx.doi.org/10.3389/fgene.2019.01325
_version_ 1783499698568429568
author Huang, Guohua
Zheng, Yang
Wu, Yao-Qun
Han, Guo-Sheng
Yu, Zu-Guo
author_facet Huang, Guohua
Zheng, Yang
Wu, Yao-Qun
Han, Guo-Sheng
Yu, Zu-Guo
author_sort Huang, Guohua
collection PubMed
description Butyrylation plays a crucial role in the cellular processes. Due to limit of techniques, it is a challenging task to identify histone butyrylation sites on a large scale. To fill the gap, we propose an approach based on information entropy and machine learning for computationally identifying histone butyrylation sites. The proposed method achieves 0.92 of area under the receiver operating characteristic (ROC) curve over the training set by 3-fold cross validation and 0.80 over the testing set by independent test. Feature analysis implies that amino acid residues in the down/upstream of butyrylation sites would exhibit specific sequence motif to a certain extent. Functional analysis suggests that histone butyrylation was most possibly associated with four pathways (systemic lupus erythematosus, alcoholism, viral carcinogenesis and transcriptional misregulation in cancer), was involved in binding with other molecules, processes of biosynthesis, assembly, arrangement or disassembly and was located in such complex as consists of DNA, RNA, protein, etc. The proposed method is useful to predict histone butyrylation sites. Analysis of feature and function improves understanding of histone butyrylation and increases knowledge of functions of butyrylated histones.
format Online
Article
Text
id pubmed-7033570
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher Frontiers Media S.A.
record_format MEDLINE/PubMed
spelling pubmed-70335702020-02-28 An Information Entropy-Based Approach for Computationally Identifying Histone Lysine Butyrylation Huang, Guohua Zheng, Yang Wu, Yao-Qun Han, Guo-Sheng Yu, Zu-Guo Front Genet Genetics Butyrylation plays a crucial role in the cellular processes. Due to limit of techniques, it is a challenging task to identify histone butyrylation sites on a large scale. To fill the gap, we propose an approach based on information entropy and machine learning for computationally identifying histone butyrylation sites. The proposed method achieves 0.92 of area under the receiver operating characteristic (ROC) curve over the training set by 3-fold cross validation and 0.80 over the testing set by independent test. Feature analysis implies that amino acid residues in the down/upstream of butyrylation sites would exhibit specific sequence motif to a certain extent. Functional analysis suggests that histone butyrylation was most possibly associated with four pathways (systemic lupus erythematosus, alcoholism, viral carcinogenesis and transcriptional misregulation in cancer), was involved in binding with other molecules, processes of biosynthesis, assembly, arrangement or disassembly and was located in such complex as consists of DNA, RNA, protein, etc. The proposed method is useful to predict histone butyrylation sites. Analysis of feature and function improves understanding of histone butyrylation and increases knowledge of functions of butyrylated histones. Frontiers Media S.A. 2020-02-14 /pmc/articles/PMC7033570/ /pubmed/32117407 http://dx.doi.org/10.3389/fgene.2019.01325 Text en Copyright © 2020 Huang, Zheng, Wu, Han and Yu http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle Genetics
Huang, Guohua
Zheng, Yang
Wu, Yao-Qun
Han, Guo-Sheng
Yu, Zu-Guo
An Information Entropy-Based Approach for Computationally Identifying Histone Lysine Butyrylation
title An Information Entropy-Based Approach for Computationally Identifying Histone Lysine Butyrylation
title_full An Information Entropy-Based Approach for Computationally Identifying Histone Lysine Butyrylation
title_fullStr An Information Entropy-Based Approach for Computationally Identifying Histone Lysine Butyrylation
title_full_unstemmed An Information Entropy-Based Approach for Computationally Identifying Histone Lysine Butyrylation
title_short An Information Entropy-Based Approach for Computationally Identifying Histone Lysine Butyrylation
title_sort information entropy-based approach for computationally identifying histone lysine butyrylation
topic Genetics
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7033570/
https://www.ncbi.nlm.nih.gov/pubmed/32117407
http://dx.doi.org/10.3389/fgene.2019.01325
work_keys_str_mv AT huangguohua aninformationentropybasedapproachforcomputationallyidentifyinghistonelysinebutyrylation
AT zhengyang aninformationentropybasedapproachforcomputationallyidentifyinghistonelysinebutyrylation
AT wuyaoqun aninformationentropybasedapproachforcomputationallyidentifyinghistonelysinebutyrylation
AT hanguosheng aninformationentropybasedapproachforcomputationallyidentifyinghistonelysinebutyrylation
AT yuzuguo aninformationentropybasedapproachforcomputationallyidentifyinghistonelysinebutyrylation
AT huangguohua informationentropybasedapproachforcomputationallyidentifyinghistonelysinebutyrylation
AT zhengyang informationentropybasedapproachforcomputationallyidentifyinghistonelysinebutyrylation
AT wuyaoqun informationentropybasedapproachforcomputationallyidentifyinghistonelysinebutyrylation
AT hanguosheng informationentropybasedapproachforcomputationallyidentifyinghistonelysinebutyrylation
AT yuzuguo informationentropybasedapproachforcomputationallyidentifyinghistonelysinebutyrylation