Cargando…
Evolutionary Modeling and Prediction of Non-Coding RNAs in Drosophila
We performed benchmarks of phylogenetic grammar-based ncRNA gene prediction, experimenting with eight different models of structural evolution and two different programs for genome alignment. We evaluated our models using alignments of twelve Drosophila genomes. We find that ncRNA prediction perform...
Autores principales: | , , , , , |
---|---|
Formato: | Texto |
Lenguaje: | English |
Publicado: |
Public Library of Science
2009
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2721679/ https://www.ncbi.nlm.nih.gov/pubmed/19668382 http://dx.doi.org/10.1371/journal.pone.0006478 |
_version_ | 1782170232314396672 |
---|---|
author | Bradley, Robert K. Uzilov, Andrew V. Skinner, Mitchell E. Bendaña, Yuri R. Barquist, Lars Holmes, Ian |
author_facet | Bradley, Robert K. Uzilov, Andrew V. Skinner, Mitchell E. Bendaña, Yuri R. Barquist, Lars Holmes, Ian |
author_sort | Bradley, Robert K. |
collection | PubMed |
description | We performed benchmarks of phylogenetic grammar-based ncRNA gene prediction, experimenting with eight different models of structural evolution and two different programs for genome alignment. We evaluated our models using alignments of twelve Drosophila genomes. We find that ncRNA prediction performance can vary greatly between different gene predictors and subfamilies of ncRNA gene. Our estimates for false positive rates are based on simulations which preserve local islands of conservation; using these simulations, we predict a higher rate of false positives than previous computational ncRNA screens have reported. Using one of the tested prediction grammars, we provide an updated set of ncRNA predictions for D. melanogaster and compare them to previously-published predictions and experimental data. Many of our predictions show correlations with protein-coding genes. We found significant depletion of intergenic predictions near the 3′ end of coding regions and furthermore depletion of predictions in the first intron of protein-coding genes. Some of our predictions are colocated with larger putative unannotated genes: for example, 17 of our predictions showing homology to the RFAM family snoR28 appear in a tandem array on the X chromosome; the 4.5 Kbp spanned by the predicted tandem array is contained within a FlyBase-annotated cDNA. |
format | Text |
id | pubmed-2721679 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2009 |
publisher | Public Library of Science |
record_format | MEDLINE/PubMed |
spelling | pubmed-27216792009-08-11 Evolutionary Modeling and Prediction of Non-Coding RNAs in Drosophila Bradley, Robert K. Uzilov, Andrew V. Skinner, Mitchell E. Bendaña, Yuri R. Barquist, Lars Holmes, Ian PLoS One Research Article We performed benchmarks of phylogenetic grammar-based ncRNA gene prediction, experimenting with eight different models of structural evolution and two different programs for genome alignment. We evaluated our models using alignments of twelve Drosophila genomes. We find that ncRNA prediction performance can vary greatly between different gene predictors and subfamilies of ncRNA gene. Our estimates for false positive rates are based on simulations which preserve local islands of conservation; using these simulations, we predict a higher rate of false positives than previous computational ncRNA screens have reported. Using one of the tested prediction grammars, we provide an updated set of ncRNA predictions for D. melanogaster and compare them to previously-published predictions and experimental data. Many of our predictions show correlations with protein-coding genes. We found significant depletion of intergenic predictions near the 3′ end of coding regions and furthermore depletion of predictions in the first intron of protein-coding genes. Some of our predictions are colocated with larger putative unannotated genes: for example, 17 of our predictions showing homology to the RFAM family snoR28 appear in a tandem array on the X chromosome; the 4.5 Kbp spanned by the predicted tandem array is contained within a FlyBase-annotated cDNA. Public Library of Science 2009-08-11 /pmc/articles/PMC2721679/ /pubmed/19668382 http://dx.doi.org/10.1371/journal.pone.0006478 Text en Bradley et al. http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited. |
spellingShingle | Research Article Bradley, Robert K. Uzilov, Andrew V. Skinner, Mitchell E. Bendaña, Yuri R. Barquist, Lars Holmes, Ian Evolutionary Modeling and Prediction of Non-Coding RNAs in Drosophila |
title | Evolutionary Modeling and Prediction of Non-Coding RNAs in Drosophila
|
title_full | Evolutionary Modeling and Prediction of Non-Coding RNAs in Drosophila
|
title_fullStr | Evolutionary Modeling and Prediction of Non-Coding RNAs in Drosophila
|
title_full_unstemmed | Evolutionary Modeling and Prediction of Non-Coding RNAs in Drosophila
|
title_short | Evolutionary Modeling and Prediction of Non-Coding RNAs in Drosophila
|
title_sort | evolutionary modeling and prediction of non-coding rnas in drosophila |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2721679/ https://www.ncbi.nlm.nih.gov/pubmed/19668382 http://dx.doi.org/10.1371/journal.pone.0006478 |
work_keys_str_mv | AT bradleyrobertk evolutionarymodelingandpredictionofnoncodingrnasindrosophila AT uzilovandrewv evolutionarymodelingandpredictionofnoncodingrnasindrosophila AT skinnermitchelle evolutionarymodelingandpredictionofnoncodingrnasindrosophila AT bendanayurir evolutionarymodelingandpredictionofnoncodingrnasindrosophila AT barquistlars evolutionarymodelingandpredictionofnoncodingrnasindrosophila AT holmesian evolutionarymodelingandpredictionofnoncodingrnasindrosophila |