Cargando…

Computational identification of Drosophila microRNA genes

BACKGROUND: MicroRNAs (miRNAs) are a large family of 21-22 nucleotide non-coding RNAs with presumed post-transcriptional regulatory activity. Most miRNAs were identified by direct cloning of small RNAs, an approach that favors detection of abundant miRNAs. Three observations suggested that miRNA gen...

Descripción completa

Detalles Bibliográficos
Autores principales: Lai, Eric C, Tomancak, Pavel, Williams, Robert W, Rubin, Gerald M
Formato: Texto
Lenguaje:English
Publicado: BioMed Central 2003
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC193629/
https://www.ncbi.nlm.nih.gov/pubmed/12844358
http://dx.doi.org/10.1186/gb-2003-4-7-r42
_version_ 1782120902388875264
author Lai, Eric C
Tomancak, Pavel
Williams, Robert W
Rubin, Gerald M
author_facet Lai, Eric C
Tomancak, Pavel
Williams, Robert W
Rubin, Gerald M
author_sort Lai, Eric C
collection PubMed
description BACKGROUND: MicroRNAs (miRNAs) are a large family of 21-22 nucleotide non-coding RNAs with presumed post-transcriptional regulatory activity. Most miRNAs were identified by direct cloning of small RNAs, an approach that favors detection of abundant miRNAs. Three observations suggested that miRNA genes might be identified using a computational approach. First, miRNAs generally derive from precursor transcripts of 70-100 nucleotides with extended stem-loop structure. Second, miRNAs are usually highly conserved between the genomes of related species. Third, miRNAs display a characteristic pattern of evolutionary divergence. RESULTS: We developed an informatic procedure called 'miRseeker', which analyzed the completed euchromatic sequences of Drosophila melanogaster and D. pseudoobscura for conserved sequences that adopt an extended stem-loop structure and display a pattern of nucleotide divergence characteristic of known miRNAs. The sensitivity of this computational procedure was demonstrated by the presence of 75% (18/24) of previously identified Drosophila miRNAs within the top 124 candidates. In total, we identified 48 novel miRNA candidates that were strongly conserved in more distant insect, nematode, or vertebrate genomes. We verified expression for a total of 24 novel miRNA genes, including 20 of 27 candidates conserved in a third species and 4 of 11 high-scoring, Drosophila-specific candidates. Our analyses lead us to estimate that drosophilid genomes contain around 110 miRNA genes. CONCLUSIONS: Our computational strategy succeeded in identifying bona fide miRNA genes and suggests that miRNAs constitute nearly 1% of predicted protein-coding genes in Drosophila, a percentage similar to the percentage of miRNAs recently attributed to other metazoan genomes.
format Text
id pubmed-193629
institution National Center for Biotechnology Information
language English
publishDate 2003
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-1936292003-09-11 Computational identification of Drosophila microRNA genes Lai, Eric C Tomancak, Pavel Williams, Robert W Rubin, Gerald M Genome Biol Research BACKGROUND: MicroRNAs (miRNAs) are a large family of 21-22 nucleotide non-coding RNAs with presumed post-transcriptional regulatory activity. Most miRNAs were identified by direct cloning of small RNAs, an approach that favors detection of abundant miRNAs. Three observations suggested that miRNA genes might be identified using a computational approach. First, miRNAs generally derive from precursor transcripts of 70-100 nucleotides with extended stem-loop structure. Second, miRNAs are usually highly conserved between the genomes of related species. Third, miRNAs display a characteristic pattern of evolutionary divergence. RESULTS: We developed an informatic procedure called 'miRseeker', which analyzed the completed euchromatic sequences of Drosophila melanogaster and D. pseudoobscura for conserved sequences that adopt an extended stem-loop structure and display a pattern of nucleotide divergence characteristic of known miRNAs. The sensitivity of this computational procedure was demonstrated by the presence of 75% (18/24) of previously identified Drosophila miRNAs within the top 124 candidates. In total, we identified 48 novel miRNA candidates that were strongly conserved in more distant insect, nematode, or vertebrate genomes. We verified expression for a total of 24 novel miRNA genes, including 20 of 27 candidates conserved in a third species and 4 of 11 high-scoring, Drosophila-specific candidates. Our analyses lead us to estimate that drosophilid genomes contain around 110 miRNA genes. CONCLUSIONS: Our computational strategy succeeded in identifying bona fide miRNA genes and suggests that miRNAs constitute nearly 1% of predicted protein-coding genes in Drosophila, a percentage similar to the percentage of miRNAs recently attributed to other metazoan genomes. BioMed Central 2003 2003-06-30 /pmc/articles/PMC193629/ /pubmed/12844358 http://dx.doi.org/10.1186/gb-2003-4-7-r42 Text en Copyright © 2003 Lai et al.; licensee BioMed Central Ltd. This is an Open Access article: verbatim copying and redistribution of this article are permitted in all media for any purpose, provided this notice is preserved along with the article's original URL.
spellingShingle Research
Lai, Eric C
Tomancak, Pavel
Williams, Robert W
Rubin, Gerald M
Computational identification of Drosophila microRNA genes
title Computational identification of Drosophila microRNA genes
title_full Computational identification of Drosophila microRNA genes
title_fullStr Computational identification of Drosophila microRNA genes
title_full_unstemmed Computational identification of Drosophila microRNA genes
title_short Computational identification of Drosophila microRNA genes
title_sort computational identification of drosophila microrna genes
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC193629/
https://www.ncbi.nlm.nih.gov/pubmed/12844358
http://dx.doi.org/10.1186/gb-2003-4-7-r42
work_keys_str_mv AT laiericc computationalidentificationofdrosophilamicrornagenes
AT tomancakpavel computationalidentificationofdrosophilamicrornagenes
AT williamsrobertw computationalidentificationofdrosophilamicrornagenes
AT rubingeraldm computationalidentificationofdrosophilamicrornagenes