Cargando…

Navigating the unexplored seascape of pre-miRNA candidates in single-genome approaches

Motivation: The computational search for novel microRNA (miRNA) precursors often involves some sort of structural analysis with the aim of identifying which type of structures are prone to being recognized and processed by the cellular miRNA-maturation machinery. A natural way to tackle this problem...

Descripción completa

Detalles Bibliográficos
Autores principales: Mendes, Nuno D., Heyne, Steffen, Freitas, Ana T., Sagot, Marie-France, Backofen, Rolf
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2012
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3516144/
https://www.ncbi.nlm.nih.gov/pubmed/23052038
http://dx.doi.org/10.1093/bioinformatics/bts574
_version_ 1782252282564313088
author Mendes, Nuno D.
Heyne, Steffen
Freitas, Ana T.
Sagot, Marie-France
Backofen, Rolf
author_facet Mendes, Nuno D.
Heyne, Steffen
Freitas, Ana T.
Sagot, Marie-France
Backofen, Rolf
author_sort Mendes, Nuno D.
collection PubMed
description Motivation: The computational search for novel microRNA (miRNA) precursors often involves some sort of structural analysis with the aim of identifying which type of structures are prone to being recognized and processed by the cellular miRNA-maturation machinery. A natural way to tackle this problem is to perform clustering over the candidate structures along with known miRNA precursor structures. Mixed clusters allow then the identification of candidates that are similar to known precursors. Given the large number of pre-miRNA candidates that can be identified in single-genome approaches, even after applying several filters for precursor robustness and stability, a conventional structural clustering approach is unfeasible. Results: We propose a method to represent candidate structures in a feature space, which summarizes key sequence/structure characteristics of each candidate. We demonstrate that proximity in this feature space is related to sequence/structure similarity, and we select candidates that have a high similarity to known precursors. Additional filtering steps are then applied to further reduce the number of candidates to those with greater transcriptional potential. Our method is compared with another single-genome method (TripletSVM) in two datasets, showing better performance in one and comparable performance in the other, for larger training sets. Additionally, we show that our approach allows for a better interpretation of the results. Availability and Implementation: The MinDist method is implemented using Perl scripts and is freely available at http://www.cravela.org/?mindist=1. Contact: backofen@informatik.uni-freiburg.de Supplementary information: Supplementary data are available at Bioinformatics online.
format Online
Article
Text
id pubmed-3516144
institution National Center for Biotechnology Information
language English
publishDate 2012
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-35161442012-12-12 Navigating the unexplored seascape of pre-miRNA candidates in single-genome approaches Mendes, Nuno D. Heyne, Steffen Freitas, Ana T. Sagot, Marie-France Backofen, Rolf Bioinformatics Original Papers Motivation: The computational search for novel microRNA (miRNA) precursors often involves some sort of structural analysis with the aim of identifying which type of structures are prone to being recognized and processed by the cellular miRNA-maturation machinery. A natural way to tackle this problem is to perform clustering over the candidate structures along with known miRNA precursor structures. Mixed clusters allow then the identification of candidates that are similar to known precursors. Given the large number of pre-miRNA candidates that can be identified in single-genome approaches, even after applying several filters for precursor robustness and stability, a conventional structural clustering approach is unfeasible. Results: We propose a method to represent candidate structures in a feature space, which summarizes key sequence/structure characteristics of each candidate. We demonstrate that proximity in this feature space is related to sequence/structure similarity, and we select candidates that have a high similarity to known precursors. Additional filtering steps are then applied to further reduce the number of candidates to those with greater transcriptional potential. Our method is compared with another single-genome method (TripletSVM) in two datasets, showing better performance in one and comparable performance in the other, for larger training sets. Additionally, we show that our approach allows for a better interpretation of the results. Availability and Implementation: The MinDist method is implemented using Perl scripts and is freely available at http://www.cravela.org/?mindist=1. Contact: backofen@informatik.uni-freiburg.de Supplementary information: Supplementary data are available at Bioinformatics online. Oxford University Press 2012-12-01 2012-10-10 /pmc/articles/PMC3516144/ /pubmed/23052038 http://dx.doi.org/10.1093/bioinformatics/bts574 Text en © The Author 2012. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com http://creativecommons.org/licenses/by/3.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/3.0/), which permits non-commercial reuse, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial re-use, please contact journals.permissions@oup.com.
spellingShingle Original Papers
Mendes, Nuno D.
Heyne, Steffen
Freitas, Ana T.
Sagot, Marie-France
Backofen, Rolf
Navigating the unexplored seascape of pre-miRNA candidates in single-genome approaches
title Navigating the unexplored seascape of pre-miRNA candidates in single-genome approaches
title_full Navigating the unexplored seascape of pre-miRNA candidates in single-genome approaches
title_fullStr Navigating the unexplored seascape of pre-miRNA candidates in single-genome approaches
title_full_unstemmed Navigating the unexplored seascape of pre-miRNA candidates in single-genome approaches
title_short Navigating the unexplored seascape of pre-miRNA candidates in single-genome approaches
title_sort navigating the unexplored seascape of pre-mirna candidates in single-genome approaches
topic Original Papers
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3516144/
https://www.ncbi.nlm.nih.gov/pubmed/23052038
http://dx.doi.org/10.1093/bioinformatics/bts574
work_keys_str_mv AT mendesnunod navigatingtheunexploredseascapeofpremirnacandidatesinsinglegenomeapproaches
AT heynesteffen navigatingtheunexploredseascapeofpremirnacandidatesinsinglegenomeapproaches
AT freitasanat navigatingtheunexploredseascapeofpremirnacandidatesinsinglegenomeapproaches
AT sagotmariefrance navigatingtheunexploredseascapeofpremirnacandidatesinsinglegenomeapproaches
AT backofenrolf navigatingtheunexploredseascapeofpremirnacandidatesinsinglegenomeapproaches