Cargando…
LncFinder: an integrated platform for long non-coding RNA identification utilizing sequence intrinsic composition, structural information and physicochemical property
Discovering new long non-coding RNAs (lncRNAs) has been a fundamental step in lncRNA-related research. Nowadays, many machine learning-based tools have been developed for lncRNA identification. However, many methods predict lncRNAs using sequence-derived features alone, which tend to display unstabl...
Autores principales: | , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Oxford University Press
2018
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6954391/ https://www.ncbi.nlm.nih.gov/pubmed/30084867 http://dx.doi.org/10.1093/bib/bby065 |
_version_ | 1783486787937632256 |
---|---|
author | Han, Siyu Liang, Yanchun Ma, Qin Xu, Yangyi Zhang, Yu Du, Wei Wang, Cankun Li, Ying |
author_facet | Han, Siyu Liang, Yanchun Ma, Qin Xu, Yangyi Zhang, Yu Du, Wei Wang, Cankun Li, Ying |
author_sort | Han, Siyu |
collection | PubMed |
description | Discovering new long non-coding RNAs (lncRNAs) has been a fundamental step in lncRNA-related research. Nowadays, many machine learning-based tools have been developed for lncRNA identification. However, many methods predict lncRNAs using sequence-derived features alone, which tend to display unstable performances on different species. Moreover, the majority of tools cannot be re-trained or tailored by users and neither can the features be customized or integrated to meet researchers’ requirements. In this study, features extracted from sequence-intrinsic composition, secondary structure and physicochemical property are comprehensively reviewed and evaluated. An integrated platform named LncFinder is also developed to enhance the performance and promote the research of lncRNA identification. LncFinder includes a novel lncRNA predictor using the heterologous features we designed. Experimental results show that our method outperforms several state-of-the-art tools on multiple species with more robust and satisfactory results. Researchers can additionally employ LncFinder to extract various classic features, build classifier with numerous machine learning algorithms and evaluate classifier performance effectively and efficiently. LncFinder can reveal the properties of lncRNA and mRNA from various perspectives and further inspire lncRNA–protein interaction prediction and lncRNA evolution analysis. It is anticipated that LncFinder can significantly facilitate lncRNA-related research, especially for the poorly explored species. LncFinder is released as R package (https://CRAN.R-project.org/package=LncFinder). A web server (http://bmbl.sdstate.edu/lncfinder/) is also developed to maximize its availability. |
format | Online Article Text |
id | pubmed-6954391 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2018 |
publisher | Oxford University Press |
record_format | MEDLINE/PubMed |
spelling | pubmed-69543912020-01-16 LncFinder: an integrated platform for long non-coding RNA identification utilizing sequence intrinsic composition, structural information and physicochemical property Han, Siyu Liang, Yanchun Ma, Qin Xu, Yangyi Zhang, Yu Du, Wei Wang, Cankun Li, Ying Brief Bioinform Paper Discovering new long non-coding RNAs (lncRNAs) has been a fundamental step in lncRNA-related research. Nowadays, many machine learning-based tools have been developed for lncRNA identification. However, many methods predict lncRNAs using sequence-derived features alone, which tend to display unstable performances on different species. Moreover, the majority of tools cannot be re-trained or tailored by users and neither can the features be customized or integrated to meet researchers’ requirements. In this study, features extracted from sequence-intrinsic composition, secondary structure and physicochemical property are comprehensively reviewed and evaluated. An integrated platform named LncFinder is also developed to enhance the performance and promote the research of lncRNA identification. LncFinder includes a novel lncRNA predictor using the heterologous features we designed. Experimental results show that our method outperforms several state-of-the-art tools on multiple species with more robust and satisfactory results. Researchers can additionally employ LncFinder to extract various classic features, build classifier with numerous machine learning algorithms and evaluate classifier performance effectively and efficiently. LncFinder can reveal the properties of lncRNA and mRNA from various perspectives and further inspire lncRNA–protein interaction prediction and lncRNA evolution analysis. It is anticipated that LncFinder can significantly facilitate lncRNA-related research, especially for the poorly explored species. LncFinder is released as R package (https://CRAN.R-project.org/package=LncFinder). A web server (http://bmbl.sdstate.edu/lncfinder/) is also developed to maximize its availability. Oxford University Press 2018-07-31 /pmc/articles/PMC6954391/ /pubmed/30084867 http://dx.doi.org/10.1093/bib/bby065 Text en © The Author(s) 2018. Published by Oxford University Press. http://creativecommons.org/licenses/by-nc/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/4.0/), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial re-use, please contact journals.permissions@oup.com |
spellingShingle | Paper Han, Siyu Liang, Yanchun Ma, Qin Xu, Yangyi Zhang, Yu Du, Wei Wang, Cankun Li, Ying LncFinder: an integrated platform for long non-coding RNA identification utilizing sequence intrinsic composition, structural information and physicochemical property |
title | LncFinder: an integrated platform for long non-coding RNA identification utilizing sequence intrinsic composition, structural information and physicochemical property |
title_full | LncFinder: an integrated platform for long non-coding RNA identification utilizing sequence intrinsic composition, structural information and physicochemical property |
title_fullStr | LncFinder: an integrated platform for long non-coding RNA identification utilizing sequence intrinsic composition, structural information and physicochemical property |
title_full_unstemmed | LncFinder: an integrated platform for long non-coding RNA identification utilizing sequence intrinsic composition, structural information and physicochemical property |
title_short | LncFinder: an integrated platform for long non-coding RNA identification utilizing sequence intrinsic composition, structural information and physicochemical property |
title_sort | lncfinder: an integrated platform for long non-coding rna identification utilizing sequence intrinsic composition, structural information and physicochemical property |
topic | Paper |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6954391/ https://www.ncbi.nlm.nih.gov/pubmed/30084867 http://dx.doi.org/10.1093/bib/bby065 |
work_keys_str_mv | AT hansiyu lncfinderanintegratedplatformforlongnoncodingrnaidentificationutilizingsequenceintrinsiccompositionstructuralinformationandphysicochemicalproperty AT liangyanchun lncfinderanintegratedplatformforlongnoncodingrnaidentificationutilizingsequenceintrinsiccompositionstructuralinformationandphysicochemicalproperty AT maqin lncfinderanintegratedplatformforlongnoncodingrnaidentificationutilizingsequenceintrinsiccompositionstructuralinformationandphysicochemicalproperty AT xuyangyi lncfinderanintegratedplatformforlongnoncodingrnaidentificationutilizingsequenceintrinsiccompositionstructuralinformationandphysicochemicalproperty AT zhangyu lncfinderanintegratedplatformforlongnoncodingrnaidentificationutilizingsequenceintrinsiccompositionstructuralinformationandphysicochemicalproperty AT duwei lncfinderanintegratedplatformforlongnoncodingrnaidentificationutilizingsequenceintrinsiccompositionstructuralinformationandphysicochemicalproperty AT wangcankun lncfinderanintegratedplatformforlongnoncodingrnaidentificationutilizingsequenceintrinsiccompositionstructuralinformationandphysicochemicalproperty AT liying lncfinderanintegratedplatformforlongnoncodingrnaidentificationutilizingsequenceintrinsiccompositionstructuralinformationandphysicochemicalproperty |