tRFTars: predicting the targets of tRNA-derived fragments

BACKGROUND: tRNA-derived fragments (tRFs) are 14–40-nucleotide-long, small non-coding RNAs derived from specific tRNA cleavage events with key regulatory functions in many biological processes. Many studies have shown that tRFs are associated with Argonaute (AGO) complexes and inhibit gene expressio...

Descripción completa

Detalles Bibliográficos
Autores principales: Xiao, Qiong, Gao, Peng, Huang, Xuanzhang, Chen, Xiaowan, Chen, Quan, Lv, Xinger, Fu, Yu, Song, Yongxi, Wang, Zhenning
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7908658/
https://www.ncbi.nlm.nih.gov/pubmed/33632236
http://dx.doi.org/10.1186/s12967-021-02731-7
Descripción
Sumario:BACKGROUND: tRNA-derived fragments (tRFs) are 14–40-nucleotide-long, small non-coding RNAs derived from specific tRNA cleavage events with key regulatory functions in many biological processes. Many studies have shown that tRFs are associated with Argonaute (AGO) complexes and inhibit gene expression in the same manner as miRNAs. However, there are currently no tools for accurately predicting tRF target genes. METHODS: We used tRF-mRNA pairs identified by crosslinking, ligation, and sequencing of hybrids (CLASH) and covalent ligation of endogenous AGO-bound RNAs (CLEAR)-CLIP to assess features that may participate in tRF targeting, including the sequence context of each site and tRF-mRNA interactions. We applied genetic algorithm (GA) to select key features and support vector machine (SVM) to construct tRF prediction models. RESULTS: We first identified features that globally influenced tRF targeting. Among these features, the most significant were the minimum free folding energy (MFE), position 8 match, number of bases paired in the tRF-mRNA duplex, and length of the tRF, which were consistent with previous findings. Our constructed model yielded an area under the receiver operating characteristic (ROC) curve (AUC) = 0.980 (0.977–0.983) in the training process and an AUC = 0.847 (0.83–0.861) in the test process. The model was applied to all the sites with perfect Watson–Crick complementarity to the seed in the 3′ untranslated region (3′-UTR) of the human genome. Seven of nine target/nontarget genes of tRFs confirmed by reporter assay were predicted. We also validated the predictions via quantitative real-time PCR (qRT-PCR). Thirteen potential target genes from the top of the predictions were significantly down-regulated at the mRNA levels by overexpression of the tRFs (tRF-3001a, tRF-3003a or tRF-3009a). CONCLUSIONS: Predictions can be obtained online, tRFTars, freely available at http://trftars.cmuzhenninglab.org:3838/tar/, which is the first tool to predict targets of tRFs in humans with a user-friendly interface.