Cargando…
Computational identification of Drosophila microRNA genes
BACKGROUND: MicroRNAs (miRNAs) are a large family of 21-22 nucleotide non-coding RNAs with presumed post-transcriptional regulatory activity. Most miRNAs were identified by direct cloning of small RNAs, an approach that favors detection of abundant miRNAs. Three observations suggested that miRNA gen...
Autores principales: | , , , |
---|---|
Formato: | Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2003
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC193629/ https://www.ncbi.nlm.nih.gov/pubmed/12844358 http://dx.doi.org/10.1186/gb-2003-4-7-r42 |
_version_ | 1782120902388875264 |
---|---|
author | Lai, Eric C Tomancak, Pavel Williams, Robert W Rubin, Gerald M |
author_facet | Lai, Eric C Tomancak, Pavel Williams, Robert W Rubin, Gerald M |
author_sort | Lai, Eric C |
collection | PubMed |
description | BACKGROUND: MicroRNAs (miRNAs) are a large family of 21-22 nucleotide non-coding RNAs with presumed post-transcriptional regulatory activity. Most miRNAs were identified by direct cloning of small RNAs, an approach that favors detection of abundant miRNAs. Three observations suggested that miRNA genes might be identified using a computational approach. First, miRNAs generally derive from precursor transcripts of 70-100 nucleotides with extended stem-loop structure. Second, miRNAs are usually highly conserved between the genomes of related species. Third, miRNAs display a characteristic pattern of evolutionary divergence. RESULTS: We developed an informatic procedure called 'miRseeker', which analyzed the completed euchromatic sequences of Drosophila melanogaster and D. pseudoobscura for conserved sequences that adopt an extended stem-loop structure and display a pattern of nucleotide divergence characteristic of known miRNAs. The sensitivity of this computational procedure was demonstrated by the presence of 75% (18/24) of previously identified Drosophila miRNAs within the top 124 candidates. In total, we identified 48 novel miRNA candidates that were strongly conserved in more distant insect, nematode, or vertebrate genomes. We verified expression for a total of 24 novel miRNA genes, including 20 of 27 candidates conserved in a third species and 4 of 11 high-scoring, Drosophila-specific candidates. Our analyses lead us to estimate that drosophilid genomes contain around 110 miRNA genes. CONCLUSIONS: Our computational strategy succeeded in identifying bona fide miRNA genes and suggests that miRNAs constitute nearly 1% of predicted protein-coding genes in Drosophila, a percentage similar to the percentage of miRNAs recently attributed to other metazoan genomes. |
format | Text |
id | pubmed-193629 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2003 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-1936292003-09-11 Computational identification of Drosophila microRNA genes Lai, Eric C Tomancak, Pavel Williams, Robert W Rubin, Gerald M Genome Biol Research BACKGROUND: MicroRNAs (miRNAs) are a large family of 21-22 nucleotide non-coding RNAs with presumed post-transcriptional regulatory activity. Most miRNAs were identified by direct cloning of small RNAs, an approach that favors detection of abundant miRNAs. Three observations suggested that miRNA genes might be identified using a computational approach. First, miRNAs generally derive from precursor transcripts of 70-100 nucleotides with extended stem-loop structure. Second, miRNAs are usually highly conserved between the genomes of related species. Third, miRNAs display a characteristic pattern of evolutionary divergence. RESULTS: We developed an informatic procedure called 'miRseeker', which analyzed the completed euchromatic sequences of Drosophila melanogaster and D. pseudoobscura for conserved sequences that adopt an extended stem-loop structure and display a pattern of nucleotide divergence characteristic of known miRNAs. The sensitivity of this computational procedure was demonstrated by the presence of 75% (18/24) of previously identified Drosophila miRNAs within the top 124 candidates. In total, we identified 48 novel miRNA candidates that were strongly conserved in more distant insect, nematode, or vertebrate genomes. We verified expression for a total of 24 novel miRNA genes, including 20 of 27 candidates conserved in a third species and 4 of 11 high-scoring, Drosophila-specific candidates. Our analyses lead us to estimate that drosophilid genomes contain around 110 miRNA genes. CONCLUSIONS: Our computational strategy succeeded in identifying bona fide miRNA genes and suggests that miRNAs constitute nearly 1% of predicted protein-coding genes in Drosophila, a percentage similar to the percentage of miRNAs recently attributed to other metazoan genomes. BioMed Central 2003 2003-06-30 /pmc/articles/PMC193629/ /pubmed/12844358 http://dx.doi.org/10.1186/gb-2003-4-7-r42 Text en Copyright © 2003 Lai et al.; licensee BioMed Central Ltd. This is an Open Access article: verbatim copying and redistribution of this article are permitted in all media for any purpose, provided this notice is preserved along with the article's original URL. |
spellingShingle | Research Lai, Eric C Tomancak, Pavel Williams, Robert W Rubin, Gerald M Computational identification of Drosophila microRNA genes |
title | Computational identification of Drosophila microRNA genes |
title_full | Computational identification of Drosophila microRNA genes |
title_fullStr | Computational identification of Drosophila microRNA genes |
title_full_unstemmed | Computational identification of Drosophila microRNA genes |
title_short | Computational identification of Drosophila microRNA genes |
title_sort | computational identification of drosophila microrna genes |
topic | Research |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC193629/ https://www.ncbi.nlm.nih.gov/pubmed/12844358 http://dx.doi.org/10.1186/gb-2003-4-7-r42 |
work_keys_str_mv | AT laiericc computationalidentificationofdrosophilamicrornagenes AT tomancakpavel computationalidentificationofdrosophilamicrornagenes AT williamsrobertw computationalidentificationofdrosophilamicrornagenes AT rubingeraldm computationalidentificationofdrosophilamicrornagenes |