Cargando…

A Survey on Methods for Predicting Polyadenylation Sites from DNA Sequences, Bulk RNA-seq, and Single-cell RNA-seq

Alternative polyadenylation (APA) plays important roles in modulating mRNA stability, translation, and subcellular localization, and contributes extensively to shaping eukaryotic transcriptome complexity and proteome diversity. Identification of poly(A) sites (pAs) on a genome-wide scale is a critic...

Descripción completa

Detalles Bibliográficos
Autores principales: Ye, Wenbin, Lian, Qiwei, Ye, Congting, Wu, Xiaohui
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Elsevier 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10372920/
https://www.ncbi.nlm.nih.gov/pubmed/36167284
http://dx.doi.org/10.1016/j.gpb.2022.09.005
_version_ 1785078454507536384
author Ye, Wenbin
Lian, Qiwei
Ye, Congting
Wu, Xiaohui
author_facet Ye, Wenbin
Lian, Qiwei
Ye, Congting
Wu, Xiaohui
author_sort Ye, Wenbin
collection PubMed
description Alternative polyadenylation (APA) plays important roles in modulating mRNA stability, translation, and subcellular localization, and contributes extensively to shaping eukaryotic transcriptome complexity and proteome diversity. Identification of poly(A) sites (pAs) on a genome-wide scale is a critical step toward understanding the underlying mechanism of APA-mediated gene regulation. A number of established computational tools have been proposed to predict pAs from diverse genomic data. Here we provided an exhaustive overview of computational approaches for predicting pAs from DNA sequences, bulk RNA sequencing (RNA-seq) data, and single-cell RNA sequencing (scRNA-seq) data. Particularly, we examined several representative tools using bulk RNA-seq and scRNA-seq data from peripheral blood mononuclear cells and put forward operable suggestions on how to assess the reliability of pAs predicted by different tools. We also proposed practical guidelines on choosing appropriate methods applicable to diverse scenarios. Moreover, we discussed in depth the challenges in improving the performance of pA prediction and benchmarking different methods. Additionally, we highlighted outstanding challenges and opportunities using new machine learning and integrative multi-omics techniques, and provided our perspective on how computational methodologies might evolve in the future for non-3′ untranslated region, tissue-specific, cross-species, and single-cell pA prediction.
format Online
Article
Text
id pubmed-10372920
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Elsevier
record_format MEDLINE/PubMed
spelling pubmed-103729202023-07-28 A Survey on Methods for Predicting Polyadenylation Sites from DNA Sequences, Bulk RNA-seq, and Single-cell RNA-seq Ye, Wenbin Lian, Qiwei Ye, Congting Wu, Xiaohui Genomics Proteomics Bioinformatics Review Alternative polyadenylation (APA) plays important roles in modulating mRNA stability, translation, and subcellular localization, and contributes extensively to shaping eukaryotic transcriptome complexity and proteome diversity. Identification of poly(A) sites (pAs) on a genome-wide scale is a critical step toward understanding the underlying mechanism of APA-mediated gene regulation. A number of established computational tools have been proposed to predict pAs from diverse genomic data. Here we provided an exhaustive overview of computational approaches for predicting pAs from DNA sequences, bulk RNA sequencing (RNA-seq) data, and single-cell RNA sequencing (scRNA-seq) data. Particularly, we examined several representative tools using bulk RNA-seq and scRNA-seq data from peripheral blood mononuclear cells and put forward operable suggestions on how to assess the reliability of pAs predicted by different tools. We also proposed practical guidelines on choosing appropriate methods applicable to diverse scenarios. Moreover, we discussed in depth the challenges in improving the performance of pA prediction and benchmarking different methods. Additionally, we highlighted outstanding challenges and opportunities using new machine learning and integrative multi-omics techniques, and provided our perspective on how computational methodologies might evolve in the future for non-3′ untranslated region, tissue-specific, cross-species, and single-cell pA prediction. Elsevier 2023-02 2022-09-24 /pmc/articles/PMC10372920/ /pubmed/36167284 http://dx.doi.org/10.1016/j.gpb.2022.09.005 Text en © 2023 The Authors https://creativecommons.org/licenses/by-nc-nd/4.0/This is an open access article under the CC BY-NC-ND license (http://creativecommons.org/licenses/by-nc-nd/4.0/).
spellingShingle Review
Ye, Wenbin
Lian, Qiwei
Ye, Congting
Wu, Xiaohui
A Survey on Methods for Predicting Polyadenylation Sites from DNA Sequences, Bulk RNA-seq, and Single-cell RNA-seq
title A Survey on Methods for Predicting Polyadenylation Sites from DNA Sequences, Bulk RNA-seq, and Single-cell RNA-seq
title_full A Survey on Methods for Predicting Polyadenylation Sites from DNA Sequences, Bulk RNA-seq, and Single-cell RNA-seq
title_fullStr A Survey on Methods for Predicting Polyadenylation Sites from DNA Sequences, Bulk RNA-seq, and Single-cell RNA-seq
title_full_unstemmed A Survey on Methods for Predicting Polyadenylation Sites from DNA Sequences, Bulk RNA-seq, and Single-cell RNA-seq
title_short A Survey on Methods for Predicting Polyadenylation Sites from DNA Sequences, Bulk RNA-seq, and Single-cell RNA-seq
title_sort survey on methods for predicting polyadenylation sites from dna sequences, bulk rna-seq, and single-cell rna-seq
topic Review
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10372920/
https://www.ncbi.nlm.nih.gov/pubmed/36167284
http://dx.doi.org/10.1016/j.gpb.2022.09.005
work_keys_str_mv AT yewenbin asurveyonmethodsforpredictingpolyadenylationsitesfromdnasequencesbulkrnaseqandsinglecellrnaseq
AT lianqiwei asurveyonmethodsforpredictingpolyadenylationsitesfromdnasequencesbulkrnaseqandsinglecellrnaseq
AT yecongting asurveyonmethodsforpredictingpolyadenylationsitesfromdnasequencesbulkrnaseqandsinglecellrnaseq
AT wuxiaohui asurveyonmethodsforpredictingpolyadenylationsitesfromdnasequencesbulkrnaseqandsinglecellrnaseq
AT yewenbin surveyonmethodsforpredictingpolyadenylationsitesfromdnasequencesbulkrnaseqandsinglecellrnaseq
AT lianqiwei surveyonmethodsforpredictingpolyadenylationsitesfromdnasequencesbulkrnaseqandsinglecellrnaseq
AT yecongting surveyonmethodsforpredictingpolyadenylationsitesfromdnasequencesbulkrnaseqandsinglecellrnaseq
AT wuxiaohui surveyonmethodsforpredictingpolyadenylationsitesfromdnasequencesbulkrnaseqandsinglecellrnaseq