Cargando…
LncRNApred: Classification of Long Non-Coding RNAs and Protein-Coding Transcripts by the Ensemble Algorithm with a New Hybrid Feature
As a novel class of noncoding RNAs, long noncoding RNAs (lncRNAs) have been verified to be associated with various diseases. As large scale transcripts are generated every year, it is significant to accurately and quickly identify lncRNAs from thousands of assembled transcripts. To accurately discov...
Autores principales: | , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Public Library of Science
2016
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4882039/ https://www.ncbi.nlm.nih.gov/pubmed/27228152 http://dx.doi.org/10.1371/journal.pone.0154567 |
_version_ | 1782434067184091136 |
---|---|
author | Pian, Cong Zhang, Guangle Chen, Zhi Chen, Yuanyuan Zhang, Jin Yang, Tao Zhang, Liangyun |
author_facet | Pian, Cong Zhang, Guangle Chen, Zhi Chen, Yuanyuan Zhang, Jin Yang, Tao Zhang, Liangyun |
author_sort | Pian, Cong |
collection | PubMed |
description | As a novel class of noncoding RNAs, long noncoding RNAs (lncRNAs) have been verified to be associated with various diseases. As large scale transcripts are generated every year, it is significant to accurately and quickly identify lncRNAs from thousands of assembled transcripts. To accurately discover new lncRNAs, we develop a classification tool of random forest (RF) named LncRNApred based on a new hybrid feature. This hybrid feature set includes three new proposed features, which are MaxORF, RMaxORF and SNR. LncRNApred is effective for classifying lncRNAs and protein coding transcripts accurately and quickly. Moreover,our RF model only requests the training using data on human coding and non-coding transcripts. Other species can also be predicted by using LncRNApred. The result shows that our method is more effective compared with the Coding Potential Calculate (CPC). The web server of LncRNApred is available for free at http://mm20132014.wicp.net:57203/LncRNApred/home.jsp. |
format | Online Article Text |
id | pubmed-4882039 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2016 |
publisher | Public Library of Science |
record_format | MEDLINE/PubMed |
spelling | pubmed-48820392016-06-10 LncRNApred: Classification of Long Non-Coding RNAs and Protein-Coding Transcripts by the Ensemble Algorithm with a New Hybrid Feature Pian, Cong Zhang, Guangle Chen, Zhi Chen, Yuanyuan Zhang, Jin Yang, Tao Zhang, Liangyun PLoS One Research Article As a novel class of noncoding RNAs, long noncoding RNAs (lncRNAs) have been verified to be associated with various diseases. As large scale transcripts are generated every year, it is significant to accurately and quickly identify lncRNAs from thousands of assembled transcripts. To accurately discover new lncRNAs, we develop a classification tool of random forest (RF) named LncRNApred based on a new hybrid feature. This hybrid feature set includes three new proposed features, which are MaxORF, RMaxORF and SNR. LncRNApred is effective for classifying lncRNAs and protein coding transcripts accurately and quickly. Moreover,our RF model only requests the training using data on human coding and non-coding transcripts. Other species can also be predicted by using LncRNApred. The result shows that our method is more effective compared with the Coding Potential Calculate (CPC). The web server of LncRNApred is available for free at http://mm20132014.wicp.net:57203/LncRNApred/home.jsp. Public Library of Science 2016-05-26 /pmc/articles/PMC4882039/ /pubmed/27228152 http://dx.doi.org/10.1371/journal.pone.0154567 Text en © 2016 Pian et al http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited. |
spellingShingle | Research Article Pian, Cong Zhang, Guangle Chen, Zhi Chen, Yuanyuan Zhang, Jin Yang, Tao Zhang, Liangyun LncRNApred: Classification of Long Non-Coding RNAs and Protein-Coding Transcripts by the Ensemble Algorithm with a New Hybrid Feature |
title | LncRNApred: Classification of Long Non-Coding RNAs and Protein-Coding Transcripts by the Ensemble Algorithm with a New Hybrid Feature |
title_full | LncRNApred: Classification of Long Non-Coding RNAs and Protein-Coding Transcripts by the Ensemble Algorithm with a New Hybrid Feature |
title_fullStr | LncRNApred: Classification of Long Non-Coding RNAs and Protein-Coding Transcripts by the Ensemble Algorithm with a New Hybrid Feature |
title_full_unstemmed | LncRNApred: Classification of Long Non-Coding RNAs and Protein-Coding Transcripts by the Ensemble Algorithm with a New Hybrid Feature |
title_short | LncRNApred: Classification of Long Non-Coding RNAs and Protein-Coding Transcripts by the Ensemble Algorithm with a New Hybrid Feature |
title_sort | lncrnapred: classification of long non-coding rnas and protein-coding transcripts by the ensemble algorithm with a new hybrid feature |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4882039/ https://www.ncbi.nlm.nih.gov/pubmed/27228152 http://dx.doi.org/10.1371/journal.pone.0154567 |
work_keys_str_mv | AT piancong lncrnapredclassificationoflongnoncodingrnasandproteincodingtranscriptsbytheensemblealgorithmwithanewhybridfeature AT zhangguangle lncrnapredclassificationoflongnoncodingrnasandproteincodingtranscriptsbytheensemblealgorithmwithanewhybridfeature AT chenzhi lncrnapredclassificationoflongnoncodingrnasandproteincodingtranscriptsbytheensemblealgorithmwithanewhybridfeature AT chenyuanyuan lncrnapredclassificationoflongnoncodingrnasandproteincodingtranscriptsbytheensemblealgorithmwithanewhybridfeature AT zhangjin lncrnapredclassificationoflongnoncodingrnasandproteincodingtranscriptsbytheensemblealgorithmwithanewhybridfeature AT yangtao lncrnapredclassificationoflongnoncodingrnasandproteincodingtranscriptsbytheensemblealgorithmwithanewhybridfeature AT zhangliangyun lncrnapredclassificationoflongnoncodingrnasandproteincodingtranscriptsbytheensemblealgorithmwithanewhybridfeature |