Cargando…
Comprehensive machine-learning-based analysis of microRNA–target interactions reveals variable transferability of interaction rules across species
BACKGROUND: MicroRNAs (miRNAs) are small non-coding RNAs that regulate gene expression post-transcriptionally via base-pairing with complementary sequences on messenger RNAs (mRNAs). Due to the technical challenges involved in the application of high-throughput experimental methods, datasets of dire...
Autores principales: | , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2021
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8146624/ https://www.ncbi.nlm.nih.gov/pubmed/34030625 http://dx.doi.org/10.1186/s12859-021-04164-x |
_version_ | 1783697439094472704 |
---|---|
author | Ben Or, Gilad Veksler-Lublinsky, Isana |
author_facet | Ben Or, Gilad Veksler-Lublinsky, Isana |
author_sort | Ben Or, Gilad |
collection | PubMed |
description | BACKGROUND: MicroRNAs (miRNAs) are small non-coding RNAs that regulate gene expression post-transcriptionally via base-pairing with complementary sequences on messenger RNAs (mRNAs). Due to the technical challenges involved in the application of high-throughput experimental methods, datasets of direct bona fide miRNA targets exist only for a few model organisms. Machine learning (ML)-based target prediction models were successfully trained and tested on some of these datasets. There is a need to further apply the trained models to organisms in which experimental training data are unavailable. However, it is largely unknown how the features of miRNA–target interactions evolve and whether some features have remained fixed during evolution, raising questions regarding the general, cross-species applicability of currently available ML methods. RESULTS: We examined the evolution of miRNA–target interaction rules and used data science and ML approaches to investigate whether these rules are transferable between species. We analyzed eight datasets of direct miRNA–target interactions in four species (human, mouse, worm, cattle). Using ML classifiers, we achieved high accuracy for intra-dataset classification and found that the most influential features of all datasets overlap significantly. To explore the relationships between datasets, we measured the divergence of their miRNA seed sequences and evaluated the performance of cross-dataset classification. We found that both measures coincide with the evolutionary distance between the compared species. CONCLUSIONS: The transferability of miRNA–targeting rules between species depends on several factors, the most associated factors being the composition of seed families and evolutionary distance. Furthermore, our feature-importance results suggest that some miRNA–target features have evolved while others remained fixed during the evolution of the species. Our findings lay the foundation for the future development of target prediction tools that could be applied to “non-model” organisms for which minimal experimental data are available. AVAILABILITY AND IMPLEMENTATION: The code is freely available at https://github.com/gbenor/TPVOD. SUPPLEMENTARY INFORMATION: The online version supplementary material available at 10.1186/s12859-021-04164-x. |
format | Online Article Text |
id | pubmed-8146624 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2021 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-81466242021-05-25 Comprehensive machine-learning-based analysis of microRNA–target interactions reveals variable transferability of interaction rules across species Ben Or, Gilad Veksler-Lublinsky, Isana BMC Bioinformatics Research Article BACKGROUND: MicroRNAs (miRNAs) are small non-coding RNAs that regulate gene expression post-transcriptionally via base-pairing with complementary sequences on messenger RNAs (mRNAs). Due to the technical challenges involved in the application of high-throughput experimental methods, datasets of direct bona fide miRNA targets exist only for a few model organisms. Machine learning (ML)-based target prediction models were successfully trained and tested on some of these datasets. There is a need to further apply the trained models to organisms in which experimental training data are unavailable. However, it is largely unknown how the features of miRNA–target interactions evolve and whether some features have remained fixed during evolution, raising questions regarding the general, cross-species applicability of currently available ML methods. RESULTS: We examined the evolution of miRNA–target interaction rules and used data science and ML approaches to investigate whether these rules are transferable between species. We analyzed eight datasets of direct miRNA–target interactions in four species (human, mouse, worm, cattle). Using ML classifiers, we achieved high accuracy for intra-dataset classification and found that the most influential features of all datasets overlap significantly. To explore the relationships between datasets, we measured the divergence of their miRNA seed sequences and evaluated the performance of cross-dataset classification. We found that both measures coincide with the evolutionary distance between the compared species. CONCLUSIONS: The transferability of miRNA–targeting rules between species depends on several factors, the most associated factors being the composition of seed families and evolutionary distance. Furthermore, our feature-importance results suggest that some miRNA–target features have evolved while others remained fixed during the evolution of the species. Our findings lay the foundation for the future development of target prediction tools that could be applied to “non-model” organisms for which minimal experimental data are available. AVAILABILITY AND IMPLEMENTATION: The code is freely available at https://github.com/gbenor/TPVOD. SUPPLEMENTARY INFORMATION: The online version supplementary material available at 10.1186/s12859-021-04164-x. BioMed Central 2021-05-24 /pmc/articles/PMC8146624/ /pubmed/34030625 http://dx.doi.org/10.1186/s12859-021-04164-x Text en © The Author(s) 2021 https://creativecommons.org/licenses/by/4.0/Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/ (https://creativecommons.org/publicdomain/zero/1.0/) ) applies to the data made available in this article, unless otherwise stated in a credit line to the data. |
spellingShingle | Research Article Ben Or, Gilad Veksler-Lublinsky, Isana Comprehensive machine-learning-based analysis of microRNA–target interactions reveals variable transferability of interaction rules across species |
title | Comprehensive machine-learning-based analysis of microRNA–target interactions reveals variable transferability of interaction rules across species |
title_full | Comprehensive machine-learning-based analysis of microRNA–target interactions reveals variable transferability of interaction rules across species |
title_fullStr | Comprehensive machine-learning-based analysis of microRNA–target interactions reveals variable transferability of interaction rules across species |
title_full_unstemmed | Comprehensive machine-learning-based analysis of microRNA–target interactions reveals variable transferability of interaction rules across species |
title_short | Comprehensive machine-learning-based analysis of microRNA–target interactions reveals variable transferability of interaction rules across species |
title_sort | comprehensive machine-learning-based analysis of microrna–target interactions reveals variable transferability of interaction rules across species |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8146624/ https://www.ncbi.nlm.nih.gov/pubmed/34030625 http://dx.doi.org/10.1186/s12859-021-04164-x |
work_keys_str_mv | AT benorgilad comprehensivemachinelearningbasedanalysisofmicrornatargetinteractionsrevealsvariabletransferabilityofinteractionrulesacrossspecies AT vekslerlublinskyisana comprehensivemachinelearningbasedanalysisofmicrornatargetinteractionsrevealsvariabletransferabilityofinteractionrulesacrossspecies |