Cargando…

LncFinder: an integrated platform for long non-coding RNA identification utilizing sequence intrinsic composition, structural information and physicochemical property

Discovering new long non-coding RNAs (lncRNAs) has been a fundamental step in lncRNA-related research. Nowadays, many machine learning-based tools have been developed for lncRNA identification. However, many methods predict lncRNAs using sequence-derived features alone, which tend to display unstabl...

Descripción completa

Detalles Bibliográficos
Autores principales: Han, Siyu, Liang, Yanchun, Ma, Qin, Xu, Yangyi, Zhang, Yu, Du, Wei, Wang, Cankun, Li, Ying
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2018
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6954391/
https://www.ncbi.nlm.nih.gov/pubmed/30084867
http://dx.doi.org/10.1093/bib/bby065
_version_ 1783486787937632256
author Han, Siyu
Liang, Yanchun
Ma, Qin
Xu, Yangyi
Zhang, Yu
Du, Wei
Wang, Cankun
Li, Ying
author_facet Han, Siyu
Liang, Yanchun
Ma, Qin
Xu, Yangyi
Zhang, Yu
Du, Wei
Wang, Cankun
Li, Ying
author_sort Han, Siyu
collection PubMed
description Discovering new long non-coding RNAs (lncRNAs) has been a fundamental step in lncRNA-related research. Nowadays, many machine learning-based tools have been developed for lncRNA identification. However, many methods predict lncRNAs using sequence-derived features alone, which tend to display unstable performances on different species. Moreover, the majority of tools cannot be re-trained or tailored by users and neither can the features be customized or integrated to meet researchers’ requirements. In this study, features extracted from sequence-intrinsic composition, secondary structure and physicochemical property are comprehensively reviewed and evaluated. An integrated platform named LncFinder is also developed to enhance the performance and promote the research of lncRNA identification. LncFinder includes a novel lncRNA predictor using the heterologous features we designed. Experimental results show that our method outperforms several state-of-the-art tools on multiple species with more robust and satisfactory results. Researchers can additionally employ LncFinder to extract various classic features, build classifier with numerous machine learning algorithms and evaluate classifier performance effectively and efficiently. LncFinder can reveal the properties of lncRNA and mRNA from various perspectives and further inspire lncRNA–protein interaction prediction and lncRNA evolution analysis. It is anticipated that LncFinder can significantly facilitate lncRNA-related research, especially for the poorly explored species. LncFinder is released as R package (https://CRAN.R-project.org/package=LncFinder). A web server (http://bmbl.sdstate.edu/lncfinder/) is also developed to maximize its availability.
format Online
Article
Text
id pubmed-6954391
institution National Center for Biotechnology Information
language English
publishDate 2018
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-69543912020-01-16 LncFinder: an integrated platform for long non-coding RNA identification utilizing sequence intrinsic composition, structural information and physicochemical property Han, Siyu Liang, Yanchun Ma, Qin Xu, Yangyi Zhang, Yu Du, Wei Wang, Cankun Li, Ying Brief Bioinform Paper Discovering new long non-coding RNAs (lncRNAs) has been a fundamental step in lncRNA-related research. Nowadays, many machine learning-based tools have been developed for lncRNA identification. However, many methods predict lncRNAs using sequence-derived features alone, which tend to display unstable performances on different species. Moreover, the majority of tools cannot be re-trained or tailored by users and neither can the features be customized or integrated to meet researchers’ requirements. In this study, features extracted from sequence-intrinsic composition, secondary structure and physicochemical property are comprehensively reviewed and evaluated. An integrated platform named LncFinder is also developed to enhance the performance and promote the research of lncRNA identification. LncFinder includes a novel lncRNA predictor using the heterologous features we designed. Experimental results show that our method outperforms several state-of-the-art tools on multiple species with more robust and satisfactory results. Researchers can additionally employ LncFinder to extract various classic features, build classifier with numerous machine learning algorithms and evaluate classifier performance effectively and efficiently. LncFinder can reveal the properties of lncRNA and mRNA from various perspectives and further inspire lncRNA–protein interaction prediction and lncRNA evolution analysis. It is anticipated that LncFinder can significantly facilitate lncRNA-related research, especially for the poorly explored species. LncFinder is released as R package (https://CRAN.R-project.org/package=LncFinder). A web server (http://bmbl.sdstate.edu/lncfinder/) is also developed to maximize its availability. Oxford University Press 2018-07-31 /pmc/articles/PMC6954391/ /pubmed/30084867 http://dx.doi.org/10.1093/bib/bby065 Text en © The Author(s) 2018. Published by Oxford University Press. http://creativecommons.org/licenses/by-nc/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/4.0/), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial re-use, please contact journals.permissions@oup.com
spellingShingle Paper
Han, Siyu
Liang, Yanchun
Ma, Qin
Xu, Yangyi
Zhang, Yu
Du, Wei
Wang, Cankun
Li, Ying
LncFinder: an integrated platform for long non-coding RNA identification utilizing sequence intrinsic composition, structural information and physicochemical property
title LncFinder: an integrated platform for long non-coding RNA identification utilizing sequence intrinsic composition, structural information and physicochemical property
title_full LncFinder: an integrated platform for long non-coding RNA identification utilizing sequence intrinsic composition, structural information and physicochemical property
title_fullStr LncFinder: an integrated platform for long non-coding RNA identification utilizing sequence intrinsic composition, structural information and physicochemical property
title_full_unstemmed LncFinder: an integrated platform for long non-coding RNA identification utilizing sequence intrinsic composition, structural information and physicochemical property
title_short LncFinder: an integrated platform for long non-coding RNA identification utilizing sequence intrinsic composition, structural information and physicochemical property
title_sort lncfinder: an integrated platform for long non-coding rna identification utilizing sequence intrinsic composition, structural information and physicochemical property
topic Paper
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6954391/
https://www.ncbi.nlm.nih.gov/pubmed/30084867
http://dx.doi.org/10.1093/bib/bby065
work_keys_str_mv AT hansiyu lncfinderanintegratedplatformforlongnoncodingrnaidentificationutilizingsequenceintrinsiccompositionstructuralinformationandphysicochemicalproperty
AT liangyanchun lncfinderanintegratedplatformforlongnoncodingrnaidentificationutilizingsequenceintrinsiccompositionstructuralinformationandphysicochemicalproperty
AT maqin lncfinderanintegratedplatformforlongnoncodingrnaidentificationutilizingsequenceintrinsiccompositionstructuralinformationandphysicochemicalproperty
AT xuyangyi lncfinderanintegratedplatformforlongnoncodingrnaidentificationutilizingsequenceintrinsiccompositionstructuralinformationandphysicochemicalproperty
AT zhangyu lncfinderanintegratedplatformforlongnoncodingrnaidentificationutilizingsequenceintrinsiccompositionstructuralinformationandphysicochemicalproperty
AT duwei lncfinderanintegratedplatformforlongnoncodingrnaidentificationutilizingsequenceintrinsiccompositionstructuralinformationandphysicochemicalproperty
AT wangcankun lncfinderanintegratedplatformforlongnoncodingrnaidentificationutilizingsequenceintrinsiccompositionstructuralinformationandphysicochemicalproperty
AT liying lncfinderanintegratedplatformforlongnoncodingrnaidentificationutilizingsequenceintrinsiccompositionstructuralinformationandphysicochemicalproperty