Cargando…

LncRNApred: Classification of Long Non-Coding RNAs and Protein-Coding Transcripts by the Ensemble Algorithm with a New Hybrid Feature

As a novel class of noncoding RNAs, long noncoding RNAs (lncRNAs) have been verified to be associated with various diseases. As large scale transcripts are generated every year, it is significant to accurately and quickly identify lncRNAs from thousands of assembled transcripts. To accurately discov...

Descripción completa

Detalles Bibliográficos
Autores principales: Pian, Cong, Zhang, Guangle, Chen, Zhi, Chen, Yuanyuan, Zhang, Jin, Yang, Tao, Zhang, Liangyun
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2016
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4882039/
https://www.ncbi.nlm.nih.gov/pubmed/27228152
http://dx.doi.org/10.1371/journal.pone.0154567
_version_ 1782434067184091136
author Pian, Cong
Zhang, Guangle
Chen, Zhi
Chen, Yuanyuan
Zhang, Jin
Yang, Tao
Zhang, Liangyun
author_facet Pian, Cong
Zhang, Guangle
Chen, Zhi
Chen, Yuanyuan
Zhang, Jin
Yang, Tao
Zhang, Liangyun
author_sort Pian, Cong
collection PubMed
description As a novel class of noncoding RNAs, long noncoding RNAs (lncRNAs) have been verified to be associated with various diseases. As large scale transcripts are generated every year, it is significant to accurately and quickly identify lncRNAs from thousands of assembled transcripts. To accurately discover new lncRNAs, we develop a classification tool of random forest (RF) named LncRNApred based on a new hybrid feature. This hybrid feature set includes three new proposed features, which are MaxORF, RMaxORF and SNR. LncRNApred is effective for classifying lncRNAs and protein coding transcripts accurately and quickly. Moreover,our RF model only requests the training using data on human coding and non-coding transcripts. Other species can also be predicted by using LncRNApred. The result shows that our method is more effective compared with the Coding Potential Calculate (CPC). The web server of LncRNApred is available for free at http://mm20132014.wicp.net:57203/LncRNApred/home.jsp.
format Online
Article
Text
id pubmed-4882039
institution National Center for Biotechnology Information
language English
publishDate 2016
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-48820392016-06-10 LncRNApred: Classification of Long Non-Coding RNAs and Protein-Coding Transcripts by the Ensemble Algorithm with a New Hybrid Feature Pian, Cong Zhang, Guangle Chen, Zhi Chen, Yuanyuan Zhang, Jin Yang, Tao Zhang, Liangyun PLoS One Research Article As a novel class of noncoding RNAs, long noncoding RNAs (lncRNAs) have been verified to be associated with various diseases. As large scale transcripts are generated every year, it is significant to accurately and quickly identify lncRNAs from thousands of assembled transcripts. To accurately discover new lncRNAs, we develop a classification tool of random forest (RF) named LncRNApred based on a new hybrid feature. This hybrid feature set includes three new proposed features, which are MaxORF, RMaxORF and SNR. LncRNApred is effective for classifying lncRNAs and protein coding transcripts accurately and quickly. Moreover,our RF model only requests the training using data on human coding and non-coding transcripts. Other species can also be predicted by using LncRNApred. The result shows that our method is more effective compared with the Coding Potential Calculate (CPC). The web server of LncRNApred is available for free at http://mm20132014.wicp.net:57203/LncRNApred/home.jsp. Public Library of Science 2016-05-26 /pmc/articles/PMC4882039/ /pubmed/27228152 http://dx.doi.org/10.1371/journal.pone.0154567 Text en © 2016 Pian et al http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle Research Article
Pian, Cong
Zhang, Guangle
Chen, Zhi
Chen, Yuanyuan
Zhang, Jin
Yang, Tao
Zhang, Liangyun
LncRNApred: Classification of Long Non-Coding RNAs and Protein-Coding Transcripts by the Ensemble Algorithm with a New Hybrid Feature
title LncRNApred: Classification of Long Non-Coding RNAs and Protein-Coding Transcripts by the Ensemble Algorithm with a New Hybrid Feature
title_full LncRNApred: Classification of Long Non-Coding RNAs and Protein-Coding Transcripts by the Ensemble Algorithm with a New Hybrid Feature
title_fullStr LncRNApred: Classification of Long Non-Coding RNAs and Protein-Coding Transcripts by the Ensemble Algorithm with a New Hybrid Feature
title_full_unstemmed LncRNApred: Classification of Long Non-Coding RNAs and Protein-Coding Transcripts by the Ensemble Algorithm with a New Hybrid Feature
title_short LncRNApred: Classification of Long Non-Coding RNAs and Protein-Coding Transcripts by the Ensemble Algorithm with a New Hybrid Feature
title_sort lncrnapred: classification of long non-coding rnas and protein-coding transcripts by the ensemble algorithm with a new hybrid feature
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4882039/
https://www.ncbi.nlm.nih.gov/pubmed/27228152
http://dx.doi.org/10.1371/journal.pone.0154567
work_keys_str_mv AT piancong lncrnapredclassificationoflongnoncodingrnasandproteincodingtranscriptsbytheensemblealgorithmwithanewhybridfeature
AT zhangguangle lncrnapredclassificationoflongnoncodingrnasandproteincodingtranscriptsbytheensemblealgorithmwithanewhybridfeature
AT chenzhi lncrnapredclassificationoflongnoncodingrnasandproteincodingtranscriptsbytheensemblealgorithmwithanewhybridfeature
AT chenyuanyuan lncrnapredclassificationoflongnoncodingrnasandproteincodingtranscriptsbytheensemblealgorithmwithanewhybridfeature
AT zhangjin lncrnapredclassificationoflongnoncodingrnasandproteincodingtranscriptsbytheensemblealgorithmwithanewhybridfeature
AT yangtao lncrnapredclassificationoflongnoncodingrnasandproteincodingtranscriptsbytheensemblealgorithmwithanewhybridfeature
AT zhangliangyun lncrnapredclassificationoflongnoncodingrnasandproteincodingtranscriptsbytheensemblealgorithmwithanewhybridfeature