Cargando…

Evolutionary Modeling and Prediction of Non-Coding RNAs in Drosophila

We performed benchmarks of phylogenetic grammar-based ncRNA gene prediction, experimenting with eight different models of structural evolution and two different programs for genome alignment. We evaluated our models using alignments of twelve Drosophila genomes. We find that ncRNA prediction perform...

Descripción completa

Detalles Bibliográficos
Autores principales: Bradley, Robert K., Uzilov, Andrew V., Skinner, Mitchell E., Bendaña, Yuri R., Barquist, Lars, Holmes, Ian
Formato: Texto
Lenguaje:English
Publicado: Public Library of Science 2009
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2721679/
https://www.ncbi.nlm.nih.gov/pubmed/19668382
http://dx.doi.org/10.1371/journal.pone.0006478
_version_ 1782170232314396672
author Bradley, Robert K.
Uzilov, Andrew V.
Skinner, Mitchell E.
Bendaña, Yuri R.
Barquist, Lars
Holmes, Ian
author_facet Bradley, Robert K.
Uzilov, Andrew V.
Skinner, Mitchell E.
Bendaña, Yuri R.
Barquist, Lars
Holmes, Ian
author_sort Bradley, Robert K.
collection PubMed
description We performed benchmarks of phylogenetic grammar-based ncRNA gene prediction, experimenting with eight different models of structural evolution and two different programs for genome alignment. We evaluated our models using alignments of twelve Drosophila genomes. We find that ncRNA prediction performance can vary greatly between different gene predictors and subfamilies of ncRNA gene. Our estimates for false positive rates are based on simulations which preserve local islands of conservation; using these simulations, we predict a higher rate of false positives than previous computational ncRNA screens have reported. Using one of the tested prediction grammars, we provide an updated set of ncRNA predictions for D. melanogaster and compare them to previously-published predictions and experimental data. Many of our predictions show correlations with protein-coding genes. We found significant depletion of intergenic predictions near the 3′ end of coding regions and furthermore depletion of predictions in the first intron of protein-coding genes. Some of our predictions are colocated with larger putative unannotated genes: for example, 17 of our predictions showing homology to the RFAM family snoR28 appear in a tandem array on the X chromosome; the 4.5 Kbp spanned by the predicted tandem array is contained within a FlyBase-annotated cDNA.
format Text
id pubmed-2721679
institution National Center for Biotechnology Information
language English
publishDate 2009
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-27216792009-08-11 Evolutionary Modeling and Prediction of Non-Coding RNAs in Drosophila Bradley, Robert K. Uzilov, Andrew V. Skinner, Mitchell E. Bendaña, Yuri R. Barquist, Lars Holmes, Ian PLoS One Research Article We performed benchmarks of phylogenetic grammar-based ncRNA gene prediction, experimenting with eight different models of structural evolution and two different programs for genome alignment. We evaluated our models using alignments of twelve Drosophila genomes. We find that ncRNA prediction performance can vary greatly between different gene predictors and subfamilies of ncRNA gene. Our estimates for false positive rates are based on simulations which preserve local islands of conservation; using these simulations, we predict a higher rate of false positives than previous computational ncRNA screens have reported. Using one of the tested prediction grammars, we provide an updated set of ncRNA predictions for D. melanogaster and compare them to previously-published predictions and experimental data. Many of our predictions show correlations with protein-coding genes. We found significant depletion of intergenic predictions near the 3′ end of coding regions and furthermore depletion of predictions in the first intron of protein-coding genes. Some of our predictions are colocated with larger putative unannotated genes: for example, 17 of our predictions showing homology to the RFAM family snoR28 appear in a tandem array on the X chromosome; the 4.5 Kbp spanned by the predicted tandem array is contained within a FlyBase-annotated cDNA. Public Library of Science 2009-08-11 /pmc/articles/PMC2721679/ /pubmed/19668382 http://dx.doi.org/10.1371/journal.pone.0006478 Text en Bradley et al. http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited.
spellingShingle Research Article
Bradley, Robert K.
Uzilov, Andrew V.
Skinner, Mitchell E.
Bendaña, Yuri R.
Barquist, Lars
Holmes, Ian
Evolutionary Modeling and Prediction of Non-Coding RNAs in Drosophila
title Evolutionary Modeling and Prediction of Non-Coding RNAs in Drosophila
title_full Evolutionary Modeling and Prediction of Non-Coding RNAs in Drosophila
title_fullStr Evolutionary Modeling and Prediction of Non-Coding RNAs in Drosophila
title_full_unstemmed Evolutionary Modeling and Prediction of Non-Coding RNAs in Drosophila
title_short Evolutionary Modeling and Prediction of Non-Coding RNAs in Drosophila
title_sort evolutionary modeling and prediction of non-coding rnas in drosophila
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2721679/
https://www.ncbi.nlm.nih.gov/pubmed/19668382
http://dx.doi.org/10.1371/journal.pone.0006478
work_keys_str_mv AT bradleyrobertk evolutionarymodelingandpredictionofnoncodingrnasindrosophila
AT uzilovandrewv evolutionarymodelingandpredictionofnoncodingrnasindrosophila
AT skinnermitchelle evolutionarymodelingandpredictionofnoncodingrnasindrosophila
AT bendanayurir evolutionarymodelingandpredictionofnoncodingrnasindrosophila
AT barquistlars evolutionarymodelingandpredictionofnoncodingrnasindrosophila
AT holmesian evolutionarymodelingandpredictionofnoncodingrnasindrosophila