Cargando…

Polyadenylation-related isoform switching in human evolution revealed by full-length transcript structure

Rhesus macaque is a unique nonhuman primate model for human evolutionary and translational study, but the error-prone gene models critically limit its applications. Here, we de novo defined full-length macaque gene models based on single molecule, long-read transcriptome sequencing in four macaque t...

Descripción completa

Detalles Bibliográficos
Autores principales: Li, Yumei, Shen, Qing Sunny, Peng, Qi, Ding, Wanqiu, Zhang, Jie, Zhong, Xiaoming, An, Ni A, Ji, Mingjun, Zhou, Wei-Zhen, Li, Chuan-Yun
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8574621/
https://www.ncbi.nlm.nih.gov/pubmed/33973996
http://dx.doi.org/10.1093/bib/bbab157
_version_ 1784595580093202432
author Li, Yumei
Shen, Qing Sunny
Peng, Qi
Ding, Wanqiu
Zhang, Jie
Zhong, Xiaoming
An, Ni A
Ji, Mingjun
Zhou, Wei-Zhen
Li, Chuan-Yun
author_facet Li, Yumei
Shen, Qing Sunny
Peng, Qi
Ding, Wanqiu
Zhang, Jie
Zhong, Xiaoming
An, Ni A
Ji, Mingjun
Zhou, Wei-Zhen
Li, Chuan-Yun
author_sort Li, Yumei
collection PubMed
description Rhesus macaque is a unique nonhuman primate model for human evolutionary and translational study, but the error-prone gene models critically limit its applications. Here, we de novo defined full-length macaque gene models based on single molecule, long-read transcriptome sequencing in four macaque tissues (frontal cortex, cerebellum, heart and testis). Overall, 8 588 227 poly(A)-bearing complementary DNA reads with a mean length of 14 106 nt were generated to compile the backbone of macaque transcripts, with the fine-scale structures further refined by RNA sequencing and cap analysis gene expression sequencing data. In total, 51 605 macaque gene models were accurately defined, covering 89.7% of macaque or 75.7% of human orthologous genes. Based on the full-length gene models, we performed a human–macaque comparative analysis on polyadenylation (PA) regulation. Using macaque and mouse as outgroup species, we identified 79 distal PA events newly originated in humans and found that the strengthening of the distal PA sites, rather than the weakening of the proximal sites, predominantly contributes to the origination of these human-specific isoforms. Notably, these isoforms are selectively constrained in general and contribute to the temporospatially specific reduction of gene expression, through the tinkering of previously existed mechanisms of nuclear retention and microRNA (miRNA) regulation. Overall, the protocol and resource highlight the application of bioinformatics in integrating multilayer genomics data to provide an intact reference for model animal studies, and the isoform switching detected may constitute a hitherto underestimated regulatory layer in shaping the human-specific transcriptome and phenotypic changes.
format Online
Article
Text
id pubmed-8574621
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-85746212021-11-09 Polyadenylation-related isoform switching in human evolution revealed by full-length transcript structure Li, Yumei Shen, Qing Sunny Peng, Qi Ding, Wanqiu Zhang, Jie Zhong, Xiaoming An, Ni A Ji, Mingjun Zhou, Wei-Zhen Li, Chuan-Yun Brief Bioinform Problem Solving Protocol Rhesus macaque is a unique nonhuman primate model for human evolutionary and translational study, but the error-prone gene models critically limit its applications. Here, we de novo defined full-length macaque gene models based on single molecule, long-read transcriptome sequencing in four macaque tissues (frontal cortex, cerebellum, heart and testis). Overall, 8 588 227 poly(A)-bearing complementary DNA reads with a mean length of 14 106 nt were generated to compile the backbone of macaque transcripts, with the fine-scale structures further refined by RNA sequencing and cap analysis gene expression sequencing data. In total, 51 605 macaque gene models were accurately defined, covering 89.7% of macaque or 75.7% of human orthologous genes. Based on the full-length gene models, we performed a human–macaque comparative analysis on polyadenylation (PA) regulation. Using macaque and mouse as outgroup species, we identified 79 distal PA events newly originated in humans and found that the strengthening of the distal PA sites, rather than the weakening of the proximal sites, predominantly contributes to the origination of these human-specific isoforms. Notably, these isoforms are selectively constrained in general and contribute to the temporospatially specific reduction of gene expression, through the tinkering of previously existed mechanisms of nuclear retention and microRNA (miRNA) regulation. Overall, the protocol and resource highlight the application of bioinformatics in integrating multilayer genomics data to provide an intact reference for model animal studies, and the isoform switching detected may constitute a hitherto underestimated regulatory layer in shaping the human-specific transcriptome and phenotypic changes. Oxford University Press 2021-05-11 /pmc/articles/PMC8574621/ /pubmed/33973996 http://dx.doi.org/10.1093/bib/bbab157 Text en © The Author(s) 2021. Published by Oxford University Press. https://creativecommons.org/licenses/by-nc/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (https://creativecommons.org/licenses/by-nc/4.0/), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial re-use, please contact journals.permissions@oup.com
spellingShingle Problem Solving Protocol
Li, Yumei
Shen, Qing Sunny
Peng, Qi
Ding, Wanqiu
Zhang, Jie
Zhong, Xiaoming
An, Ni A
Ji, Mingjun
Zhou, Wei-Zhen
Li, Chuan-Yun
Polyadenylation-related isoform switching in human evolution revealed by full-length transcript structure
title Polyadenylation-related isoform switching in human evolution revealed by full-length transcript structure
title_full Polyadenylation-related isoform switching in human evolution revealed by full-length transcript structure
title_fullStr Polyadenylation-related isoform switching in human evolution revealed by full-length transcript structure
title_full_unstemmed Polyadenylation-related isoform switching in human evolution revealed by full-length transcript structure
title_short Polyadenylation-related isoform switching in human evolution revealed by full-length transcript structure
title_sort polyadenylation-related isoform switching in human evolution revealed by full-length transcript structure
topic Problem Solving Protocol
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8574621/
https://www.ncbi.nlm.nih.gov/pubmed/33973996
http://dx.doi.org/10.1093/bib/bbab157
work_keys_str_mv AT liyumei polyadenylationrelatedisoformswitchinginhumanevolutionrevealedbyfulllengthtranscriptstructure
AT shenqingsunny polyadenylationrelatedisoformswitchinginhumanevolutionrevealedbyfulllengthtranscriptstructure
AT pengqi polyadenylationrelatedisoformswitchinginhumanevolutionrevealedbyfulllengthtranscriptstructure
AT dingwanqiu polyadenylationrelatedisoformswitchinginhumanevolutionrevealedbyfulllengthtranscriptstructure
AT zhangjie polyadenylationrelatedisoformswitchinginhumanevolutionrevealedbyfulllengthtranscriptstructure
AT zhongxiaoming polyadenylationrelatedisoformswitchinginhumanevolutionrevealedbyfulllengthtranscriptstructure
AT annia polyadenylationrelatedisoformswitchinginhumanevolutionrevealedbyfulllengthtranscriptstructure
AT jimingjun polyadenylationrelatedisoformswitchinginhumanevolutionrevealedbyfulllengthtranscriptstructure
AT zhouweizhen polyadenylationrelatedisoformswitchinginhumanevolutionrevealedbyfulllengthtranscriptstructure
AT lichuanyun polyadenylationrelatedisoformswitchinginhumanevolutionrevealedbyfulllengthtranscriptstructure