Cargando…
M6A-BiNP: predicting N(6)-methyladenosine sites based on bidirectional position-specific propensities of polynucleotides and pointwise joint mutual information
N(6)-methyladenosine (m(6)A) plays an important role in various biological processes. Identifying m(6)A site is a key step in exploring its biological functions. One of the biggest challenges in identifying m(6)A sites is how to extract features comprising rich categorical information to distinguish...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Taylor & Francis
2021
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8632114/ https://www.ncbi.nlm.nih.gov/pubmed/34161188 http://dx.doi.org/10.1080/15476286.2021.1930729 |
_version_ | 1784607694341013504 |
---|---|
author | Wang, Mingzhao Xie, Juanying Xu, Shengquan |
author_facet | Wang, Mingzhao Xie, Juanying Xu, Shengquan |
author_sort | Wang, Mingzhao |
collection | PubMed |
description | N(6)-methyladenosine (m(6)A) plays an important role in various biological processes. Identifying m(6)A site is a key step in exploring its biological functions. One of the biggest challenges in identifying m(6)A sites is how to extract features comprising rich categorical information to distinguish m(6)A and non-m(6)A sites. To address this challenge, we propose bidirectional dinucleotide and trinucleotide position-specific propensities, respectively, in this paper. Based on this, we propose two feature-encoding algorithms: Position-Specific Propensities and Pointwise Mutual Information (PSP-PMI) and Position-Specific Propensities and Pointwise Joint Mutual Information (PSP-PJMI). PSP-PMI is based on the bidirectional dinucleotide propensity and the pointwise mutual information, while PSP-PJMI is based on the bidirectional trinucleotide position-specific propensity and the proposed pointwise joint mutual information in this paper. We introduce parameters [Image: see text] and [Image: see text] in PSP-PMI and PSP-PJMI, respectively, to represent the distance from the nucleotide to its forward or backward adjacent nucleotide or dinucleotide, so as to extract features containing local and global classification information. Finally, we propose the M6A-BiNP predictor based on PSP-PMI or PSP-PJMI and SVM classifier. The 10-fold cross-validation experimental results on the benchmark datasets of non-single-base resolution and single-base resolution demonstrate that PSP-PMI and PSP-PJMI can extract features with strong capabilities to identify m(6)A and non-m(6)A sites. The M6A-BiNP predictor based on our proposed feature encoding algorithm PSP-PJMI is better than the state-of-the-art predictors, and it is so far the best model to identify m(6)A and non-m(6)A sites. |
format | Online Article Text |
id | pubmed-8632114 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2021 |
publisher | Taylor & Francis |
record_format | MEDLINE/PubMed |
spelling | pubmed-86321142021-12-01 M6A-BiNP: predicting N(6)-methyladenosine sites based on bidirectional position-specific propensities of polynucleotides and pointwise joint mutual information Wang, Mingzhao Xie, Juanying Xu, Shengquan RNA Biol Research Paper N(6)-methyladenosine (m(6)A) plays an important role in various biological processes. Identifying m(6)A site is a key step in exploring its biological functions. One of the biggest challenges in identifying m(6)A sites is how to extract features comprising rich categorical information to distinguish m(6)A and non-m(6)A sites. To address this challenge, we propose bidirectional dinucleotide and trinucleotide position-specific propensities, respectively, in this paper. Based on this, we propose two feature-encoding algorithms: Position-Specific Propensities and Pointwise Mutual Information (PSP-PMI) and Position-Specific Propensities and Pointwise Joint Mutual Information (PSP-PJMI). PSP-PMI is based on the bidirectional dinucleotide propensity and the pointwise mutual information, while PSP-PJMI is based on the bidirectional trinucleotide position-specific propensity and the proposed pointwise joint mutual information in this paper. We introduce parameters [Image: see text] and [Image: see text] in PSP-PMI and PSP-PJMI, respectively, to represent the distance from the nucleotide to its forward or backward adjacent nucleotide or dinucleotide, so as to extract features containing local and global classification information. Finally, we propose the M6A-BiNP predictor based on PSP-PMI or PSP-PJMI and SVM classifier. The 10-fold cross-validation experimental results on the benchmark datasets of non-single-base resolution and single-base resolution demonstrate that PSP-PMI and PSP-PJMI can extract features with strong capabilities to identify m(6)A and non-m(6)A sites. The M6A-BiNP predictor based on our proposed feature encoding algorithm PSP-PJMI is better than the state-of-the-art predictors, and it is so far the best model to identify m(6)A and non-m(6)A sites. Taylor & Francis 2021-06-23 /pmc/articles/PMC8632114/ /pubmed/34161188 http://dx.doi.org/10.1080/15476286.2021.1930729 Text en © 2021 The Author(s). Published by Informa UK Limited, trading as Taylor & Francis Group. https://creativecommons.org/licenses/by-nc-nd/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution-NonCommercial-NoDerivatives License (http://creativecommons.org/licenses/by-nc-nd/4.0/ (https://creativecommons.org/licenses/by-nc-nd/4.0/) ), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited, and is not altered, transformed, or built upon in any way. |
spellingShingle | Research Paper Wang, Mingzhao Xie, Juanying Xu, Shengquan M6A-BiNP: predicting N(6)-methyladenosine sites based on bidirectional position-specific propensities of polynucleotides and pointwise joint mutual information |
title | M6A-BiNP: predicting N(6)-methyladenosine sites based on bidirectional position-specific propensities of polynucleotides and pointwise joint mutual information |
title_full | M6A-BiNP: predicting N(6)-methyladenosine sites based on bidirectional position-specific propensities of polynucleotides and pointwise joint mutual information |
title_fullStr | M6A-BiNP: predicting N(6)-methyladenosine sites based on bidirectional position-specific propensities of polynucleotides and pointwise joint mutual information |
title_full_unstemmed | M6A-BiNP: predicting N(6)-methyladenosine sites based on bidirectional position-specific propensities of polynucleotides and pointwise joint mutual information |
title_short | M6A-BiNP: predicting N(6)-methyladenosine sites based on bidirectional position-specific propensities of polynucleotides and pointwise joint mutual information |
title_sort | m6a-binp: predicting n(6)-methyladenosine sites based on bidirectional position-specific propensities of polynucleotides and pointwise joint mutual information |
topic | Research Paper |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8632114/ https://www.ncbi.nlm.nih.gov/pubmed/34161188 http://dx.doi.org/10.1080/15476286.2021.1930729 |
work_keys_str_mv | AT wangmingzhao m6abinppredictingn6methyladenosinesitesbasedonbidirectionalpositionspecificpropensitiesofpolynucleotidesandpointwisejointmutualinformation AT xiejuanying m6abinppredictingn6methyladenosinesitesbasedonbidirectionalpositionspecificpropensitiesofpolynucleotidesandpointwisejointmutualinformation AT xushengquan m6abinppredictingn6methyladenosinesitesbasedonbidirectionalpositionspecificpropensitiesofpolynucleotidesandpointwisejointmutualinformation |