Cargando…

lncRScan-SVM: A Tool for Predicting Long Non-Coding RNAs Using Support Vector Machine

Functional long non-coding RNAs (lncRNAs) have been bringing novel insight into biological study, however it is still not trivial to accurately distinguish the lncRNA transcripts (LNCTs) from the protein coding ones (PCTs). As various information and data about lncRNAs are preserved by previous stud...

Descripción completa

Detalles Bibliográficos
Autores principales: Sun, Lei, Liu, Hui, Zhang, Lin, Meng, Jia
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2015
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4593643/
https://www.ncbi.nlm.nih.gov/pubmed/26437338
http://dx.doi.org/10.1371/journal.pone.0139654
_version_ 1782393353275441152
author Sun, Lei
Liu, Hui
Zhang, Lin
Meng, Jia
author_facet Sun, Lei
Liu, Hui
Zhang, Lin
Meng, Jia
author_sort Sun, Lei
collection PubMed
description Functional long non-coding RNAs (lncRNAs) have been bringing novel insight into biological study, however it is still not trivial to accurately distinguish the lncRNA transcripts (LNCTs) from the protein coding ones (PCTs). As various information and data about lncRNAs are preserved by previous studies, it is appealing to develop novel methods to identify the lncRNAs more accurately. Our method lncRScan-SVM aims at classifying PCTs and LNCTs using support vector machine (SVM). The gold-standard datasets for lncRScan-SVM model training, lncRNA prediction and method comparison were constructed according to the GENCODE gene annotations of human and mouse respectively. By integrating features derived from gene structure, transcript sequence, potential codon sequence and conservation, lncRScan-SVM outperforms other approaches, which is evaluated by several criteria such as sensitivity, specificity, accuracy, Matthews correlation coefficient (MCC) and area under curve (AUC). In addition, several known human lncRNA datasets were assessed using lncRScan-SVM. LncRScan-SVM is an efficient tool for predicting the lncRNAs, and it is quite useful for current lncRNA study.
format Online
Article
Text
id pubmed-4593643
institution National Center for Biotechnology Information
language English
publishDate 2015
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-45936432015-10-14 lncRScan-SVM: A Tool for Predicting Long Non-Coding RNAs Using Support Vector Machine Sun, Lei Liu, Hui Zhang, Lin Meng, Jia PLoS One Research Article Functional long non-coding RNAs (lncRNAs) have been bringing novel insight into biological study, however it is still not trivial to accurately distinguish the lncRNA transcripts (LNCTs) from the protein coding ones (PCTs). As various information and data about lncRNAs are preserved by previous studies, it is appealing to develop novel methods to identify the lncRNAs more accurately. Our method lncRScan-SVM aims at classifying PCTs and LNCTs using support vector machine (SVM). The gold-standard datasets for lncRScan-SVM model training, lncRNA prediction and method comparison were constructed according to the GENCODE gene annotations of human and mouse respectively. By integrating features derived from gene structure, transcript sequence, potential codon sequence and conservation, lncRScan-SVM outperforms other approaches, which is evaluated by several criteria such as sensitivity, specificity, accuracy, Matthews correlation coefficient (MCC) and area under curve (AUC). In addition, several known human lncRNA datasets were assessed using lncRScan-SVM. LncRScan-SVM is an efficient tool for predicting the lncRNAs, and it is quite useful for current lncRNA study. Public Library of Science 2015-10-05 /pmc/articles/PMC4593643/ /pubmed/26437338 http://dx.doi.org/10.1371/journal.pone.0139654 Text en © 2015 Sun et al http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited.
spellingShingle Research Article
Sun, Lei
Liu, Hui
Zhang, Lin
Meng, Jia
lncRScan-SVM: A Tool for Predicting Long Non-Coding RNAs Using Support Vector Machine
title lncRScan-SVM: A Tool for Predicting Long Non-Coding RNAs Using Support Vector Machine
title_full lncRScan-SVM: A Tool for Predicting Long Non-Coding RNAs Using Support Vector Machine
title_fullStr lncRScan-SVM: A Tool for Predicting Long Non-Coding RNAs Using Support Vector Machine
title_full_unstemmed lncRScan-SVM: A Tool for Predicting Long Non-Coding RNAs Using Support Vector Machine
title_short lncRScan-SVM: A Tool for Predicting Long Non-Coding RNAs Using Support Vector Machine
title_sort lncrscan-svm: a tool for predicting long non-coding rnas using support vector machine
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4593643/
https://www.ncbi.nlm.nih.gov/pubmed/26437338
http://dx.doi.org/10.1371/journal.pone.0139654
work_keys_str_mv AT sunlei lncrscansvmatoolforpredictinglongnoncodingrnasusingsupportvectormachine
AT liuhui lncrscansvmatoolforpredictinglongnoncodingrnasusingsupportvectormachine
AT zhanglin lncrscansvmatoolforpredictinglongnoncodingrnasusingsupportvectormachine
AT mengjia lncrscansvmatoolforpredictinglongnoncodingrnasusingsupportvectormachine