Cargando…
Psiscan: a computational approach to identify H/ACA-like and AGA-like non-coding RNA in trypanosomatid genomes
BACKGROUND: Detection of non coding RNA (ncRNA) molecules is a major bioinformatics challenge. This challenge is particularly difficult when attempting to detect H/ACA molecules which are involved in converting uridine to pseudouridine on rRNA in trypanosomes, because these organisms have unique H/A...
Autores principales: | , , , , , , , |
---|---|
Formato: | Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2008
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2613932/ https://www.ncbi.nlm.nih.gov/pubmed/18986541 http://dx.doi.org/10.1186/1471-2105-9-471 |
_version_ | 1782163214957543424 |
---|---|
author | Myslyuk, Inna Doniger, Tirza Horesh, Yair Hury, Avraham Hoffer, Ran Ziporen, Yaara Michaeli, Shulamit Unger, Ron |
author_facet | Myslyuk, Inna Doniger, Tirza Horesh, Yair Hury, Avraham Hoffer, Ran Ziporen, Yaara Michaeli, Shulamit Unger, Ron |
author_sort | Myslyuk, Inna |
collection | PubMed |
description | BACKGROUND: Detection of non coding RNA (ncRNA) molecules is a major bioinformatics challenge. This challenge is particularly difficult when attempting to detect H/ACA molecules which are involved in converting uridine to pseudouridine on rRNA in trypanosomes, because these organisms have unique H/ACA molecules (termed H/ACA-like) that lack several of the features that characterize H/ACA molecules in most other organisms. RESULTS: We present here a computational tool called Psiscan, which was designed to detect H/ACA-like molecules in trypanosomes. We started by analyzing known H/ACA-like molecules and characterized their crucial elements both computationally and experimentally. Next, we set up constraints based on this analysis and additional phylogenic and functional data to rapidly scan three trypanosome genomes (T. brucei, T. cruzi and L. major) for sequences that observe these constraints and are conserved among the species. In the next step, we used minimal energy calculation to select the molecules that are predicted to fold into a lowest energy structure that is consistent with the constraints. In the final computational step, we used a Support Vector Machine that was trained on known H/ACA-like molecules as positive examples and on negative examples of molecules that were identified by the computational analyses but were shown experimentally not to be H/ACA-like molecules. The leading candidate molecules predicted by the SVM model were then subjected to experimental validation. CONCLUSION: The experimental validation showed 11 molecules to be expressed (4 out of 25 in the intermediate stage and 7 out of 19 in the final validation after the machine learning stage). Five of these 11 molecules were further shown to be bona fide H/ACA-like molecules. As snoRNA in trypanosomes are organized in clusters, the new H/ACA-like molecules could be used as starting points to manually search for additional molecules in their neighbourhood. All together this study increased our repertoire by fourteen H/ACA-like and six C/D snoRNAs molecules from T. brucei and L. Major. In addition the experimental analysis revealed that six ncRNA molecules that are expressed are not downregulated in CBF5 silenced cells, suggesting that they have structural features of H/ACA-like molecules but do not have their standard function. We termed this novel class of molecules AGA-like, and we are exploring their function. This study demonstrates the power of tight collaboration between computational and experimental approaches in a combined effort to reveal the repertoire of ncRNA molecles. |
format | Text |
id | pubmed-2613932 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2008 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-26139322009-01-12 Psiscan: a computational approach to identify H/ACA-like and AGA-like non-coding RNA in trypanosomatid genomes Myslyuk, Inna Doniger, Tirza Horesh, Yair Hury, Avraham Hoffer, Ran Ziporen, Yaara Michaeli, Shulamit Unger, Ron BMC Bioinformatics Research Article BACKGROUND: Detection of non coding RNA (ncRNA) molecules is a major bioinformatics challenge. This challenge is particularly difficult when attempting to detect H/ACA molecules which are involved in converting uridine to pseudouridine on rRNA in trypanosomes, because these organisms have unique H/ACA molecules (termed H/ACA-like) that lack several of the features that characterize H/ACA molecules in most other organisms. RESULTS: We present here a computational tool called Psiscan, which was designed to detect H/ACA-like molecules in trypanosomes. We started by analyzing known H/ACA-like molecules and characterized their crucial elements both computationally and experimentally. Next, we set up constraints based on this analysis and additional phylogenic and functional data to rapidly scan three trypanosome genomes (T. brucei, T. cruzi and L. major) for sequences that observe these constraints and are conserved among the species. In the next step, we used minimal energy calculation to select the molecules that are predicted to fold into a lowest energy structure that is consistent with the constraints. In the final computational step, we used a Support Vector Machine that was trained on known H/ACA-like molecules as positive examples and on negative examples of molecules that were identified by the computational analyses but were shown experimentally not to be H/ACA-like molecules. The leading candidate molecules predicted by the SVM model were then subjected to experimental validation. CONCLUSION: The experimental validation showed 11 molecules to be expressed (4 out of 25 in the intermediate stage and 7 out of 19 in the final validation after the machine learning stage). Five of these 11 molecules were further shown to be bona fide H/ACA-like molecules. As snoRNA in trypanosomes are organized in clusters, the new H/ACA-like molecules could be used as starting points to manually search for additional molecules in their neighbourhood. All together this study increased our repertoire by fourteen H/ACA-like and six C/D snoRNAs molecules from T. brucei and L. Major. In addition the experimental analysis revealed that six ncRNA molecules that are expressed are not downregulated in CBF5 silenced cells, suggesting that they have structural features of H/ACA-like molecules but do not have their standard function. We termed this novel class of molecules AGA-like, and we are exploring their function. This study demonstrates the power of tight collaboration between computational and experimental approaches in a combined effort to reveal the repertoire of ncRNA molecles. BioMed Central 2008-11-05 /pmc/articles/PMC2613932/ /pubmed/18986541 http://dx.doi.org/10.1186/1471-2105-9-471 Text en Copyright © 2008 Myslyuk et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License ( (http://creativecommons.org/licenses/by/2.0) ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Research Article Myslyuk, Inna Doniger, Tirza Horesh, Yair Hury, Avraham Hoffer, Ran Ziporen, Yaara Michaeli, Shulamit Unger, Ron Psiscan: a computational approach to identify H/ACA-like and AGA-like non-coding RNA in trypanosomatid genomes |
title | Psiscan: a computational approach to identify H/ACA-like and AGA-like non-coding RNA in trypanosomatid genomes |
title_full | Psiscan: a computational approach to identify H/ACA-like and AGA-like non-coding RNA in trypanosomatid genomes |
title_fullStr | Psiscan: a computational approach to identify H/ACA-like and AGA-like non-coding RNA in trypanosomatid genomes |
title_full_unstemmed | Psiscan: a computational approach to identify H/ACA-like and AGA-like non-coding RNA in trypanosomatid genomes |
title_short | Psiscan: a computational approach to identify H/ACA-like and AGA-like non-coding RNA in trypanosomatid genomes |
title_sort | psiscan: a computational approach to identify h/aca-like and aga-like non-coding rna in trypanosomatid genomes |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2613932/ https://www.ncbi.nlm.nih.gov/pubmed/18986541 http://dx.doi.org/10.1186/1471-2105-9-471 |
work_keys_str_mv | AT myslyukinna psiscanacomputationalapproachtoidentifyhacalikeandagalikenoncodingrnaintrypanosomatidgenomes AT donigertirza psiscanacomputationalapproachtoidentifyhacalikeandagalikenoncodingrnaintrypanosomatidgenomes AT horeshyair psiscanacomputationalapproachtoidentifyhacalikeandagalikenoncodingrnaintrypanosomatidgenomes AT huryavraham psiscanacomputationalapproachtoidentifyhacalikeandagalikenoncodingrnaintrypanosomatidgenomes AT hofferran psiscanacomputationalapproachtoidentifyhacalikeandagalikenoncodingrnaintrypanosomatidgenomes AT ziporenyaara psiscanacomputationalapproachtoidentifyhacalikeandagalikenoncodingrnaintrypanosomatidgenomes AT michaelishulamit psiscanacomputationalapproachtoidentifyhacalikeandagalikenoncodingrnaintrypanosomatidgenomes AT ungerron psiscanacomputationalapproachtoidentifyhacalikeandagalikenoncodingrnaintrypanosomatidgenomes |