Cargando…

miSTAR: miRNA target prediction through modeling quantitative and qualitative miRNA binding site information in a stacked model structure

In microRNA (miRNA) target prediction, typically two levels of information need to be modeled: the number of potential miRNA binding sites present in a target mRNA and the genomic context of each individual site. Single model structures insufficiently cope with this complex training data structure,...

Descripción completa

Detalles Bibliográficos
Autores principales: Van Peer, Gert, De Paepe, Ayla, Stock, Michiel, Anckaert, Jasper, Volders, Pieter-Jan, Vandesompele, Jo, De Baets, Bernard, Waegeman, Willem
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2017
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5397177/
https://www.ncbi.nlm.nih.gov/pubmed/27986855
http://dx.doi.org/10.1093/nar/gkw1260
_version_ 1783230218842931200
author Van Peer, Gert
De Paepe, Ayla
Stock, Michiel
Anckaert, Jasper
Volders, Pieter-Jan
Vandesompele, Jo
De Baets, Bernard
Waegeman, Willem
author_facet Van Peer, Gert
De Paepe, Ayla
Stock, Michiel
Anckaert, Jasper
Volders, Pieter-Jan
Vandesompele, Jo
De Baets, Bernard
Waegeman, Willem
author_sort Van Peer, Gert
collection PubMed
description In microRNA (miRNA) target prediction, typically two levels of information need to be modeled: the number of potential miRNA binding sites present in a target mRNA and the genomic context of each individual site. Single model structures insufficiently cope with this complex training data structure, consisting of feature vectors of unequal length as a consequence of the varying number of miRNA binding sites in different mRNAs. To circumvent this problem, we developed a two-layered, stacked model, in which the influence of binding site context is separately modeled. Using logistic regression and random forests, we applied the stacked model approach to a unique data set of 7990 probed miRNA–mRNA interactions, hereby including the largest number of miRNAs in model training to date. Compared to lower-complexity models, a particular stacked model, named miSTAR (miRNA stacked model target prediction; www.mi-star.org), displays a higher general performance and precision on top scoring predictions. More importantly, our model outperforms published and widely used miRNA target prediction algorithms. Finally, we highlight flaws in cross-validation schemes for evaluation of miRNA target prediction models and adopt a more fair and stringent approach.
format Online
Article
Text
id pubmed-5397177
institution National Center for Biotechnology Information
language English
publishDate 2017
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-53971772017-04-24 miSTAR: miRNA target prediction through modeling quantitative and qualitative miRNA binding site information in a stacked model structure Van Peer, Gert De Paepe, Ayla Stock, Michiel Anckaert, Jasper Volders, Pieter-Jan Vandesompele, Jo De Baets, Bernard Waegeman, Willem Nucleic Acids Res Methods Online In microRNA (miRNA) target prediction, typically two levels of information need to be modeled: the number of potential miRNA binding sites present in a target mRNA and the genomic context of each individual site. Single model structures insufficiently cope with this complex training data structure, consisting of feature vectors of unequal length as a consequence of the varying number of miRNA binding sites in different mRNAs. To circumvent this problem, we developed a two-layered, stacked model, in which the influence of binding site context is separately modeled. Using logistic regression and random forests, we applied the stacked model approach to a unique data set of 7990 probed miRNA–mRNA interactions, hereby including the largest number of miRNAs in model training to date. Compared to lower-complexity models, a particular stacked model, named miSTAR (miRNA stacked model target prediction; www.mi-star.org), displays a higher general performance and precision on top scoring predictions. More importantly, our model outperforms published and widely used miRNA target prediction algorithms. Finally, we highlight flaws in cross-validation schemes for evaluation of miRNA target prediction models and adopt a more fair and stringent approach. Oxford University Press 2017-04-20 2016-12-16 /pmc/articles/PMC5397177/ /pubmed/27986855 http://dx.doi.org/10.1093/nar/gkw1260 Text en © The Author(s) 2016. Published by Oxford University Press on behalf of Nucleic Acids Research. http://creativecommons.org/licenses/by-nc/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by-nc/4.0/), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial re-use, please contact journals.permissions@oup.com
spellingShingle Methods Online
Van Peer, Gert
De Paepe, Ayla
Stock, Michiel
Anckaert, Jasper
Volders, Pieter-Jan
Vandesompele, Jo
De Baets, Bernard
Waegeman, Willem
miSTAR: miRNA target prediction through modeling quantitative and qualitative miRNA binding site information in a stacked model structure
title miSTAR: miRNA target prediction through modeling quantitative and qualitative miRNA binding site information in a stacked model structure
title_full miSTAR: miRNA target prediction through modeling quantitative and qualitative miRNA binding site information in a stacked model structure
title_fullStr miSTAR: miRNA target prediction through modeling quantitative and qualitative miRNA binding site information in a stacked model structure
title_full_unstemmed miSTAR: miRNA target prediction through modeling quantitative and qualitative miRNA binding site information in a stacked model structure
title_short miSTAR: miRNA target prediction through modeling quantitative and qualitative miRNA binding site information in a stacked model structure
title_sort mistar: mirna target prediction through modeling quantitative and qualitative mirna binding site information in a stacked model structure
topic Methods Online
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5397177/
https://www.ncbi.nlm.nih.gov/pubmed/27986855
http://dx.doi.org/10.1093/nar/gkw1260
work_keys_str_mv AT vanpeergert mistarmirnatargetpredictionthroughmodelingquantitativeandqualitativemirnabindingsiteinformationinastackedmodelstructure
AT depaepeayla mistarmirnatargetpredictionthroughmodelingquantitativeandqualitativemirnabindingsiteinformationinastackedmodelstructure
AT stockmichiel mistarmirnatargetpredictionthroughmodelingquantitativeandqualitativemirnabindingsiteinformationinastackedmodelstructure
AT anckaertjasper mistarmirnatargetpredictionthroughmodelingquantitativeandqualitativemirnabindingsiteinformationinastackedmodelstructure
AT volderspieterjan mistarmirnatargetpredictionthroughmodelingquantitativeandqualitativemirnabindingsiteinformationinastackedmodelstructure
AT vandesompelejo mistarmirnatargetpredictionthroughmodelingquantitativeandqualitativemirnabindingsiteinformationinastackedmodelstructure
AT debaetsbernard mistarmirnatargetpredictionthroughmodelingquantitativeandqualitativemirnabindingsiteinformationinastackedmodelstructure
AT waegemanwillem mistarmirnatargetpredictionthroughmodelingquantitativeandqualitativemirnabindingsiteinformationinastackedmodelstructure