Cargando…

smartPARE: An R Package for Efficient Identification of True mRNA Cleavage Sites

Degradome sequencing is commonly used to generate high-throughput information on mRNA cleavage sites mediated by small RNAs (sRNA). In our datasets of potato (Solanum tuberosum, St) and Phytophthora infestans (Pi), initial predictions generated high numbers of cleavage site predictions, which highli...

Descripción completa

Detalles Bibliográficos
Autores principales: Persson Hodén, Kristian, Hu, Xinyi, Martinez, German, Dixelius, Christina
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8073297/
https://www.ncbi.nlm.nih.gov/pubmed/33924042
http://dx.doi.org/10.3390/ijms22084267
_version_ 1783684097685585920
author Persson Hodén, Kristian
Hu, Xinyi
Martinez, German
Dixelius, Christina
author_facet Persson Hodén, Kristian
Hu, Xinyi
Martinez, German
Dixelius, Christina
author_sort Persson Hodén, Kristian
collection PubMed
description Degradome sequencing is commonly used to generate high-throughput information on mRNA cleavage sites mediated by small RNAs (sRNA). In our datasets of potato (Solanum tuberosum, St) and Phytophthora infestans (Pi), initial predictions generated high numbers of cleavage site predictions, which highlighted the need of improved analytic tools. Here, we present an R package based on a deep learning convolutional neural network (CNN) in a machine learning environment to optimize discrimination of false from true cleavage sites. When applying smartPARE to our datasets on potato during the infection process by the late blight pathogen, 7.3% of all cleavage windows represented true cleavages distributed on 214 sites in P. infestans and 444 sites in potato. The sRNA landscape of the two organisms is complex with uneven sRNA production and cleavage regions widespread in the two genomes. Multiple targets and several cases of complex regulatory cascades, particularly in potato, was revealed. We conclude that our new analytic approach is useful for anyone working on complex biological systems and with the interest of identifying cleavage sites particularly inferred by sRNA classes beyond miRNAs.
format Online
Article
Text
id pubmed-8073297
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-80732972021-04-27 smartPARE: An R Package for Efficient Identification of True mRNA Cleavage Sites Persson Hodén, Kristian Hu, Xinyi Martinez, German Dixelius, Christina Int J Mol Sci Article Degradome sequencing is commonly used to generate high-throughput information on mRNA cleavage sites mediated by small RNAs (sRNA). In our datasets of potato (Solanum tuberosum, St) and Phytophthora infestans (Pi), initial predictions generated high numbers of cleavage site predictions, which highlighted the need of improved analytic tools. Here, we present an R package based on a deep learning convolutional neural network (CNN) in a machine learning environment to optimize discrimination of false from true cleavage sites. When applying smartPARE to our datasets on potato during the infection process by the late blight pathogen, 7.3% of all cleavage windows represented true cleavages distributed on 214 sites in P. infestans and 444 sites in potato. The sRNA landscape of the two organisms is complex with uneven sRNA production and cleavage regions widespread in the two genomes. Multiple targets and several cases of complex regulatory cascades, particularly in potato, was revealed. We conclude that our new analytic approach is useful for anyone working on complex biological systems and with the interest of identifying cleavage sites particularly inferred by sRNA classes beyond miRNAs. MDPI 2021-04-20 /pmc/articles/PMC8073297/ /pubmed/33924042 http://dx.doi.org/10.3390/ijms22084267 Text en © 2021 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Persson Hodén, Kristian
Hu, Xinyi
Martinez, German
Dixelius, Christina
smartPARE: An R Package for Efficient Identification of True mRNA Cleavage Sites
title smartPARE: An R Package for Efficient Identification of True mRNA Cleavage Sites
title_full smartPARE: An R Package for Efficient Identification of True mRNA Cleavage Sites
title_fullStr smartPARE: An R Package for Efficient Identification of True mRNA Cleavage Sites
title_full_unstemmed smartPARE: An R Package for Efficient Identification of True mRNA Cleavage Sites
title_short smartPARE: An R Package for Efficient Identification of True mRNA Cleavage Sites
title_sort smartpare: an r package for efficient identification of true mrna cleavage sites
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8073297/
https://www.ncbi.nlm.nih.gov/pubmed/33924042
http://dx.doi.org/10.3390/ijms22084267
work_keys_str_mv AT perssonhodenkristian smartpareanrpackageforefficientidentificationoftruemrnacleavagesites
AT huxinyi smartpareanrpackageforefficientidentificationoftruemrnacleavagesites
AT martinezgerman smartpareanrpackageforefficientidentificationoftruemrnacleavagesites
AT dixeliuschristina smartpareanrpackageforefficientidentificationoftruemrnacleavagesites