Cargando…

M6A-BiNP: predicting N(6)-methyladenosine sites based on bidirectional position-specific propensities of polynucleotides and pointwise joint mutual information

N(6)-methyladenosine (m(6)A) plays an important role in various biological processes. Identifying m(6)A site is a key step in exploring its biological functions. One of the biggest challenges in identifying m(6)A sites is how to extract features comprising rich categorical information to distinguish...

Descripción completa

Detalles Bibliográficos
Autores principales: Wang, Mingzhao, Xie, Juanying, Xu, Shengquan
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Taylor & Francis 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8632114/
https://www.ncbi.nlm.nih.gov/pubmed/34161188
http://dx.doi.org/10.1080/15476286.2021.1930729
_version_ 1784607694341013504
author Wang, Mingzhao
Xie, Juanying
Xu, Shengquan
author_facet Wang, Mingzhao
Xie, Juanying
Xu, Shengquan
author_sort Wang, Mingzhao
collection PubMed
description N(6)-methyladenosine (m(6)A) plays an important role in various biological processes. Identifying m(6)A site is a key step in exploring its biological functions. One of the biggest challenges in identifying m(6)A sites is how to extract features comprising rich categorical information to distinguish m(6)A and non-m(6)A sites. To address this challenge, we propose bidirectional dinucleotide and trinucleotide position-specific propensities, respectively, in this paper. Based on this, we propose two feature-encoding algorithms: Position-Specific Propensities and Pointwise Mutual Information (PSP-PMI) and Position-Specific Propensities and Pointwise Joint Mutual Information (PSP-PJMI). PSP-PMI is based on the bidirectional dinucleotide propensity and the pointwise mutual information, while PSP-PJMI is based on the bidirectional trinucleotide position-specific propensity and the proposed pointwise joint mutual information in this paper. We introduce parameters [Image: see text] and [Image: see text] in PSP-PMI and PSP-PJMI, respectively, to represent the distance from the nucleotide to its forward or backward adjacent nucleotide or dinucleotide, so as to extract features containing local and global classification information. Finally, we propose the M6A-BiNP predictor based on PSP-PMI or PSP-PJMI and SVM classifier. The 10-fold cross-validation experimental results on the benchmark datasets of non-single-base resolution and single-base resolution demonstrate that PSP-PMI and PSP-PJMI can extract features with strong capabilities to identify m(6)A and non-m(6)A sites. The M6A-BiNP predictor based on our proposed feature encoding algorithm PSP-PJMI is better than the state-of-the-art predictors, and it is so far the best model to identify m(6)A and non-m(6)A sites.
format Online
Article
Text
id pubmed-8632114
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Taylor & Francis
record_format MEDLINE/PubMed
spelling pubmed-86321142021-12-01 M6A-BiNP: predicting N(6)-methyladenosine sites based on bidirectional position-specific propensities of polynucleotides and pointwise joint mutual information Wang, Mingzhao Xie, Juanying Xu, Shengquan RNA Biol Research Paper N(6)-methyladenosine (m(6)A) plays an important role in various biological processes. Identifying m(6)A site is a key step in exploring its biological functions. One of the biggest challenges in identifying m(6)A sites is how to extract features comprising rich categorical information to distinguish m(6)A and non-m(6)A sites. To address this challenge, we propose bidirectional dinucleotide and trinucleotide position-specific propensities, respectively, in this paper. Based on this, we propose two feature-encoding algorithms: Position-Specific Propensities and Pointwise Mutual Information (PSP-PMI) and Position-Specific Propensities and Pointwise Joint Mutual Information (PSP-PJMI). PSP-PMI is based on the bidirectional dinucleotide propensity and the pointwise mutual information, while PSP-PJMI is based on the bidirectional trinucleotide position-specific propensity and the proposed pointwise joint mutual information in this paper. We introduce parameters [Image: see text] and [Image: see text] in PSP-PMI and PSP-PJMI, respectively, to represent the distance from the nucleotide to its forward or backward adjacent nucleotide or dinucleotide, so as to extract features containing local and global classification information. Finally, we propose the M6A-BiNP predictor based on PSP-PMI or PSP-PJMI and SVM classifier. The 10-fold cross-validation experimental results on the benchmark datasets of non-single-base resolution and single-base resolution demonstrate that PSP-PMI and PSP-PJMI can extract features with strong capabilities to identify m(6)A and non-m(6)A sites. The M6A-BiNP predictor based on our proposed feature encoding algorithm PSP-PJMI is better than the state-of-the-art predictors, and it is so far the best model to identify m(6)A and non-m(6)A sites. Taylor & Francis 2021-06-23 /pmc/articles/PMC8632114/ /pubmed/34161188 http://dx.doi.org/10.1080/15476286.2021.1930729 Text en © 2021 The Author(s). Published by Informa UK Limited, trading as Taylor & Francis Group. https://creativecommons.org/licenses/by-nc-nd/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution-NonCommercial-NoDerivatives License (http://creativecommons.org/licenses/by-nc-nd/4.0/ (https://creativecommons.org/licenses/by-nc-nd/4.0/) ), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited, and is not altered, transformed, or built upon in any way.
spellingShingle Research Paper
Wang, Mingzhao
Xie, Juanying
Xu, Shengquan
M6A-BiNP: predicting N(6)-methyladenosine sites based on bidirectional position-specific propensities of polynucleotides and pointwise joint mutual information
title M6A-BiNP: predicting N(6)-methyladenosine sites based on bidirectional position-specific propensities of polynucleotides and pointwise joint mutual information
title_full M6A-BiNP: predicting N(6)-methyladenosine sites based on bidirectional position-specific propensities of polynucleotides and pointwise joint mutual information
title_fullStr M6A-BiNP: predicting N(6)-methyladenosine sites based on bidirectional position-specific propensities of polynucleotides and pointwise joint mutual information
title_full_unstemmed M6A-BiNP: predicting N(6)-methyladenosine sites based on bidirectional position-specific propensities of polynucleotides and pointwise joint mutual information
title_short M6A-BiNP: predicting N(6)-methyladenosine sites based on bidirectional position-specific propensities of polynucleotides and pointwise joint mutual information
title_sort m6a-binp: predicting n(6)-methyladenosine sites based on bidirectional position-specific propensities of polynucleotides and pointwise joint mutual information
topic Research Paper
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8632114/
https://www.ncbi.nlm.nih.gov/pubmed/34161188
http://dx.doi.org/10.1080/15476286.2021.1930729
work_keys_str_mv AT wangmingzhao m6abinppredictingn6methyladenosinesitesbasedonbidirectionalpositionspecificpropensitiesofpolynucleotidesandpointwisejointmutualinformation
AT xiejuanying m6abinppredictingn6methyladenosinesitesbasedonbidirectionalpositionspecificpropensitiesofpolynucleotidesandpointwisejointmutualinformation
AT xushengquan m6abinppredictingn6methyladenosinesitesbasedonbidirectionalpositionspecificpropensitiesofpolynucleotidesandpointwisejointmutualinformation