Cargando…

Automatically clustering large-scale miRNA sequences: methods and experiments

BACKGROUND: Since the initial annotation of microRNAs (miRNAs) in 2001, many studies have sought to identify additional miRNAs experimentally or computationally in various species. MiRNAs act with the Argonaut family of proteins to regulate target messenger RNAs (mRNAs) post-transcriptionally. Curre...

Descripción completa

Detalles Bibliográficos
Autores principales: Wan, Linxia, Ding, Jiandong, Jin, Ting, Guan, Jihong, Zhou, Shuigeng
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2012
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3535721/
https://www.ncbi.nlm.nih.gov/pubmed/23282099
http://dx.doi.org/10.1186/1471-2164-13-S8-S15
_version_ 1782254704972005376
author Wan, Linxia
Ding, Jiandong
Jin, Ting
Guan, Jihong
Zhou, Shuigeng
author_facet Wan, Linxia
Ding, Jiandong
Jin, Ting
Guan, Jihong
Zhou, Shuigeng
author_sort Wan, Linxia
collection PubMed
description BACKGROUND: Since the initial annotation of microRNAs (miRNAs) in 2001, many studies have sought to identify additional miRNAs experimentally or computationally in various species. MiRNAs act with the Argonaut family of proteins to regulate target messenger RNAs (mRNAs) post-transcriptionally. Currently, researches mainly focus on single miRNA function study. Considering that members in the same miRNA family might participate in the same pathway or regulate the same target(s) and thus share similar biological functions, people can explore useful knowledge from high quality miRNA family architecture. RESULTS: In this article, we developed an unsupervised clustering-based method miRCluster to automatically group miRNAs. In order to evaluate this method, several data sets were constructed from the online database miRBase. Results showed that miRCluster can efficiently arrange miRNAs (e.g identify 354 families in miRBase16 with an accuracy of 92.08%, and can recognize 9 of all 10 newly-added families in miRBase 17). By far, ~30% mature miRNAs registered in miRBase are unclassified. With miRCluster, over 85% unclassified miRNAs can be assigned to certain families, while ~44% of these miRNAs distributed in ~300novel families. CONCLUSIONS: In short, miRCluster is an automatic and efficient miRNA family identification method, which does not require any prior knowledge. It can be helpful in real use, especially when exploring functions of novel miRNAs. All relevant materials could be freely accessed online (http://admis.fudan.edu.cn/projects/miRCluster).
format Online
Article
Text
id pubmed-3535721
institution National Center for Biotechnology Information
language English
publishDate 2012
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-35357212013-01-04 Automatically clustering large-scale miRNA sequences: methods and experiments Wan, Linxia Ding, Jiandong Jin, Ting Guan, Jihong Zhou, Shuigeng BMC Genomics Research BACKGROUND: Since the initial annotation of microRNAs (miRNAs) in 2001, many studies have sought to identify additional miRNAs experimentally or computationally in various species. MiRNAs act with the Argonaut family of proteins to regulate target messenger RNAs (mRNAs) post-transcriptionally. Currently, researches mainly focus on single miRNA function study. Considering that members in the same miRNA family might participate in the same pathway or regulate the same target(s) and thus share similar biological functions, people can explore useful knowledge from high quality miRNA family architecture. RESULTS: In this article, we developed an unsupervised clustering-based method miRCluster to automatically group miRNAs. In order to evaluate this method, several data sets were constructed from the online database miRBase. Results showed that miRCluster can efficiently arrange miRNAs (e.g identify 354 families in miRBase16 with an accuracy of 92.08%, and can recognize 9 of all 10 newly-added families in miRBase 17). By far, ~30% mature miRNAs registered in miRBase are unclassified. With miRCluster, over 85% unclassified miRNAs can be assigned to certain families, while ~44% of these miRNAs distributed in ~300novel families. CONCLUSIONS: In short, miRCluster is an automatic and efficient miRNA family identification method, which does not require any prior knowledge. It can be helpful in real use, especially when exploring functions of novel miRNAs. All relevant materials could be freely accessed online (http://admis.fudan.edu.cn/projects/miRCluster). BioMed Central 2012-12-17 /pmc/articles/PMC3535721/ /pubmed/23282099 http://dx.doi.org/10.1186/1471-2164-13-S8-S15 Text en Copyright ©2012 Wan et al.; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research
Wan, Linxia
Ding, Jiandong
Jin, Ting
Guan, Jihong
Zhou, Shuigeng
Automatically clustering large-scale miRNA sequences: methods and experiments
title Automatically clustering large-scale miRNA sequences: methods and experiments
title_full Automatically clustering large-scale miRNA sequences: methods and experiments
title_fullStr Automatically clustering large-scale miRNA sequences: methods and experiments
title_full_unstemmed Automatically clustering large-scale miRNA sequences: methods and experiments
title_short Automatically clustering large-scale miRNA sequences: methods and experiments
title_sort automatically clustering large-scale mirna sequences: methods and experiments
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3535721/
https://www.ncbi.nlm.nih.gov/pubmed/23282099
http://dx.doi.org/10.1186/1471-2164-13-S8-S15
work_keys_str_mv AT wanlinxia automaticallyclusteringlargescalemirnasequencesmethodsandexperiments
AT dingjiandong automaticallyclusteringlargescalemirnasequencesmethodsandexperiments
AT jinting automaticallyclusteringlargescalemirnasequencesmethodsandexperiments
AT guanjihong automaticallyclusteringlargescalemirnasequencesmethodsandexperiments
AT zhoushuigeng automaticallyclusteringlargescalemirnasequencesmethodsandexperiments