Cargando…

A Linear Regression Predictor for Identifying N(6)-Methyladenosine Sites Using Frequent Gapped K-mer Pattern

N6-methyladenosine (m(6)A) is one of the most common and abundant modifications in RNA, which is related to many biological processes in humans. Abnormal RNA modifications are often associated with a series of diseases, including tumors, neurogenic diseases, and embryonic retardation. Therefore, ide...

Descripción completa

Detalles Bibliográficos
Autores principales: Zhuang, Y.Y., Liu, H.J., Song, X., Ju, Y., Peng, H.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: American Society of Gene & Cell Therapy 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6849367/
https://www.ncbi.nlm.nih.gov/pubmed/31707204
http://dx.doi.org/10.1016/j.omtn.2019.10.001
_version_ 1783469197266780160
author Zhuang, Y.Y.
Liu, H.J.
Song, X.
Ju, Y.
Peng, H.
author_facet Zhuang, Y.Y.
Liu, H.J.
Song, X.
Ju, Y.
Peng, H.
author_sort Zhuang, Y.Y.
collection PubMed
description N6-methyladenosine (m(6)A) is one of the most common and abundant modifications in RNA, which is related to many biological processes in humans. Abnormal RNA modifications are often associated with a series of diseases, including tumors, neurogenic diseases, and embryonic retardation. Therefore, identifying m(6)A sites is of paramount importance in the post-genomic age. Although many lab-based methods have been proposed to annotate m(6)A sites, they are time consuming and cost ineffective. In view of the drawbacks of the intrinsic methods in RNA sequence recognition, computational methods are suggested as a supplement to identify m(6)A sites. In this study, we develop a novel feature extraction algorithm based on the frequent gapped k-mer pattern (FGKP) and apply the linear regression to construct the prediction model. The new predictor is used to identify m(6)A sites in the Saccharomyces cerevisiae database. It has been shown by the 10-fold cross-validation that the performance is better than that of recent methods. Comparative results indicate that our model has great potential to become a useful and effective tool for genome analysis and gain more insights for locating m(6)A sites.
format Online
Article
Text
id pubmed-6849367
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher American Society of Gene & Cell Therapy
record_format MEDLINE/PubMed
spelling pubmed-68493672019-11-15 A Linear Regression Predictor for Identifying N(6)-Methyladenosine Sites Using Frequent Gapped K-mer Pattern Zhuang, Y.Y. Liu, H.J. Song, X. Ju, Y. Peng, H. Mol Ther Nucleic Acids Article N6-methyladenosine (m(6)A) is one of the most common and abundant modifications in RNA, which is related to many biological processes in humans. Abnormal RNA modifications are often associated with a series of diseases, including tumors, neurogenic diseases, and embryonic retardation. Therefore, identifying m(6)A sites is of paramount importance in the post-genomic age. Although many lab-based methods have been proposed to annotate m(6)A sites, they are time consuming and cost ineffective. In view of the drawbacks of the intrinsic methods in RNA sequence recognition, computational methods are suggested as a supplement to identify m(6)A sites. In this study, we develop a novel feature extraction algorithm based on the frequent gapped k-mer pattern (FGKP) and apply the linear regression to construct the prediction model. The new predictor is used to identify m(6)A sites in the Saccharomyces cerevisiae database. It has been shown by the 10-fold cross-validation that the performance is better than that of recent methods. Comparative results indicate that our model has great potential to become a useful and effective tool for genome analysis and gain more insights for locating m(6)A sites. American Society of Gene & Cell Therapy 2019-10-10 /pmc/articles/PMC6849367/ /pubmed/31707204 http://dx.doi.org/10.1016/j.omtn.2019.10.001 Text en © 2019 The Authors http://creativecommons.org/licenses/by/4.0/ This is an open access article under the CC BY license (http://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Zhuang, Y.Y.
Liu, H.J.
Song, X.
Ju, Y.
Peng, H.
A Linear Regression Predictor for Identifying N(6)-Methyladenosine Sites Using Frequent Gapped K-mer Pattern
title A Linear Regression Predictor for Identifying N(6)-Methyladenosine Sites Using Frequent Gapped K-mer Pattern
title_full A Linear Regression Predictor for Identifying N(6)-Methyladenosine Sites Using Frequent Gapped K-mer Pattern
title_fullStr A Linear Regression Predictor for Identifying N(6)-Methyladenosine Sites Using Frequent Gapped K-mer Pattern
title_full_unstemmed A Linear Regression Predictor for Identifying N(6)-Methyladenosine Sites Using Frequent Gapped K-mer Pattern
title_short A Linear Regression Predictor for Identifying N(6)-Methyladenosine Sites Using Frequent Gapped K-mer Pattern
title_sort linear regression predictor for identifying n(6)-methyladenosine sites using frequent gapped k-mer pattern
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6849367/
https://www.ncbi.nlm.nih.gov/pubmed/31707204
http://dx.doi.org/10.1016/j.omtn.2019.10.001
work_keys_str_mv AT zhuangyy alinearregressionpredictorforidentifyingn6methyladenosinesitesusingfrequentgappedkmerpattern
AT liuhj alinearregressionpredictorforidentifyingn6methyladenosinesitesusingfrequentgappedkmerpattern
AT songx alinearregressionpredictorforidentifyingn6methyladenosinesitesusingfrequentgappedkmerpattern
AT juy alinearregressionpredictorforidentifyingn6methyladenosinesitesusingfrequentgappedkmerpattern
AT pengh alinearregressionpredictorforidentifyingn6methyladenosinesitesusingfrequentgappedkmerpattern
AT zhuangyy linearregressionpredictorforidentifyingn6methyladenosinesitesusingfrequentgappedkmerpattern
AT liuhj linearregressionpredictorforidentifyingn6methyladenosinesitesusingfrequentgappedkmerpattern
AT songx linearregressionpredictorforidentifyingn6methyladenosinesitesusingfrequentgappedkmerpattern
AT juy linearregressionpredictorforidentifyingn6methyladenosinesitesusingfrequentgappedkmerpattern
AT pengh linearregressionpredictorforidentifyingn6methyladenosinesitesusingfrequentgappedkmerpattern