Cargando…

The Sequence Structures of Human MicroRNA Molecules and Their Implications

The count of the nucleotides in a cloned, short genomic sequence has become an important criterion to annotate such a sequence as a miRNA molecule. While the majority of human mature miRNA sequences consist of 22 nucleotides, there exists discrepancy in the characteristic lengths of the miRNA sequen...

Descripción completa

Detalles Bibliográficos
Autores principales: Fang, Zhide, Du, Ruofei, Edwards, Andrea, Flemington, Erik K., Zhang, Kun
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2013
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3548844/
https://www.ncbi.nlm.nih.gov/pubmed/23349828
http://dx.doi.org/10.1371/journal.pone.0054215
_version_ 1782256382176657408
author Fang, Zhide
Du, Ruofei
Edwards, Andrea
Flemington, Erik K.
Zhang, Kun
author_facet Fang, Zhide
Du, Ruofei
Edwards, Andrea
Flemington, Erik K.
Zhang, Kun
author_sort Fang, Zhide
collection PubMed
description The count of the nucleotides in a cloned, short genomic sequence has become an important criterion to annotate such a sequence as a miRNA molecule. While the majority of human mature miRNA sequences consist of 22 nucleotides, there exists discrepancy in the characteristic lengths of the miRNA sequences. There is also a lack of systematic studies on such length distribution and on the biological factors that are related to or may affect this length. In this paper, we intend to fill this gap by investigating the sequence structure of human miRNA molecules using statistics tools. We demonstrate that the traditional discrete probability distributions do not model the length distribution of the human mature miRNAs well, and we obtain the statistical distribution model with a decent fit. We observe that the four nucleotide bases in a miRNA sequence are not randomly distributed, implying that possible structural patterns such as dinucleotide (trinucleotide or higher order) may exist. Furthermore, we study the relationships of this length distribution to multiple important factors such as evolutionary conservation, tumorigenesis, the length of precursor loop structures, and the number of predicted targets. The association between the miRNA sequence length and the distributions of target site counts in corresponding predicted genes is also presented. This study results in several novel findings worthy of further investigation that include: (1) rapid evolution introduces variation to the miRNA sequence length distribution; (2) miRNAs with extreme sequence lengths are unlikely to be cancer-related; and (3) the miRNA sequence length is positively correlated to the precursor length and the number of predicted target genes.
format Online
Article
Text
id pubmed-3548844
institution National Center for Biotechnology Information
language English
publishDate 2013
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-35488442013-01-24 The Sequence Structures of Human MicroRNA Molecules and Their Implications Fang, Zhide Du, Ruofei Edwards, Andrea Flemington, Erik K. Zhang, Kun PLoS One Research Article The count of the nucleotides in a cloned, short genomic sequence has become an important criterion to annotate such a sequence as a miRNA molecule. While the majority of human mature miRNA sequences consist of 22 nucleotides, there exists discrepancy in the characteristic lengths of the miRNA sequences. There is also a lack of systematic studies on such length distribution and on the biological factors that are related to or may affect this length. In this paper, we intend to fill this gap by investigating the sequence structure of human miRNA molecules using statistics tools. We demonstrate that the traditional discrete probability distributions do not model the length distribution of the human mature miRNAs well, and we obtain the statistical distribution model with a decent fit. We observe that the four nucleotide bases in a miRNA sequence are not randomly distributed, implying that possible structural patterns such as dinucleotide (trinucleotide or higher order) may exist. Furthermore, we study the relationships of this length distribution to multiple important factors such as evolutionary conservation, tumorigenesis, the length of precursor loop structures, and the number of predicted targets. The association between the miRNA sequence length and the distributions of target site counts in corresponding predicted genes is also presented. This study results in several novel findings worthy of further investigation that include: (1) rapid evolution introduces variation to the miRNA sequence length distribution; (2) miRNAs with extreme sequence lengths are unlikely to be cancer-related; and (3) the miRNA sequence length is positively correlated to the precursor length and the number of predicted target genes. Public Library of Science 2013-01-18 /pmc/articles/PMC3548844/ /pubmed/23349828 http://dx.doi.org/10.1371/journal.pone.0054215 Text en © 2013 Fang et al http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited.
spellingShingle Research Article
Fang, Zhide
Du, Ruofei
Edwards, Andrea
Flemington, Erik K.
Zhang, Kun
The Sequence Structures of Human MicroRNA Molecules and Their Implications
title The Sequence Structures of Human MicroRNA Molecules and Their Implications
title_full The Sequence Structures of Human MicroRNA Molecules and Their Implications
title_fullStr The Sequence Structures of Human MicroRNA Molecules and Their Implications
title_full_unstemmed The Sequence Structures of Human MicroRNA Molecules and Their Implications
title_short The Sequence Structures of Human MicroRNA Molecules and Their Implications
title_sort sequence structures of human microrna molecules and their implications
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3548844/
https://www.ncbi.nlm.nih.gov/pubmed/23349828
http://dx.doi.org/10.1371/journal.pone.0054215
work_keys_str_mv AT fangzhide thesequencestructuresofhumanmicrornamoleculesandtheirimplications
AT duruofei thesequencestructuresofhumanmicrornamoleculesandtheirimplications
AT edwardsandrea thesequencestructuresofhumanmicrornamoleculesandtheirimplications
AT flemingtonerikk thesequencestructuresofhumanmicrornamoleculesandtheirimplications
AT zhangkun thesequencestructuresofhumanmicrornamoleculesandtheirimplications
AT fangzhide sequencestructuresofhumanmicrornamoleculesandtheirimplications
AT duruofei sequencestructuresofhumanmicrornamoleculesandtheirimplications
AT edwardsandrea sequencestructuresofhumanmicrornamoleculesandtheirimplications
AT flemingtonerikk sequencestructuresofhumanmicrornamoleculesandtheirimplications
AT zhangkun sequencestructuresofhumanmicrornamoleculesandtheirimplications