Cargando…

GenTree, an integrated resource for analyzing the evolution and function of primate-specific coding genes

The origination of new genes contributes to phenotypic evolution in humans. Two major challenges in the study of new genes are the inference of gene ages and annotation of their protein-coding potential. To tackle these challenges, we created GenTree, an integrated online database that compiles age...

Descripción completa

Detalles Bibliográficos
Autores principales: Shao, Yi, Chen, Chunyan, Shen, Hao, He, Bin Z., Yu, Daqi, Jiang, Shuai, Zhao, Shilei, Gao, Zhiqiang, Zhu, Zhenglin, Chen, Xi, Fu, Yan, Chen, Hua, Gao, Ge, Long, Manyuan, Zhang, Yong E.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Cold Spring Harbor Laboratory Press 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6442393/
https://www.ncbi.nlm.nih.gov/pubmed/30862647
http://dx.doi.org/10.1101/gr.238733.118
_version_ 1783407700457029632
author Shao, Yi
Chen, Chunyan
Shen, Hao
He, Bin Z.
Yu, Daqi
Jiang, Shuai
Zhao, Shilei
Gao, Zhiqiang
Zhu, Zhenglin
Chen, Xi
Fu, Yan
Chen, Hua
Gao, Ge
Long, Manyuan
Zhang, Yong E.
author_facet Shao, Yi
Chen, Chunyan
Shen, Hao
He, Bin Z.
Yu, Daqi
Jiang, Shuai
Zhao, Shilei
Gao, Zhiqiang
Zhu, Zhenglin
Chen, Xi
Fu, Yan
Chen, Hua
Gao, Ge
Long, Manyuan
Zhang, Yong E.
author_sort Shao, Yi
collection PubMed
description The origination of new genes contributes to phenotypic evolution in humans. Two major challenges in the study of new genes are the inference of gene ages and annotation of their protein-coding potential. To tackle these challenges, we created GenTree, an integrated online database that compiles age inferences from three major methods together with functional genomic data for new genes. Genome-wide comparison of the age inference methods revealed that the synteny-based pipeline (SBP) is most suited for recently duplicated genes, whereas the protein-family–based methods are useful for ancient genes. For SBP-dated primate-specific protein-coding genes (PSGs), we performed manual evaluation based on published PSG lists and showed that SBP generated a conservative data set of PSGs by masking less reliable syntenic regions. After assessing the coding potential based on evolutionary constraint and peptide evidence from proteomic data, we curated a list of 254 PSGs with different levels of protein evidence. This list also includes 41 candidate misannotated pseudogenes that encode primate-specific short proteins. Coexpression analysis showed that PSGs are preferentially recruited into organs with rapidly evolving pathways such as spermatogenesis, immune response, mother–fetus interaction, and brain development. For brain development, primate-specific KRAB zinc-finger proteins (KZNFs) are specifically up-regulated in the mid-fetal stage, which may have contributed to the evolution of this critical stage. Altogether, hundreds of PSGs are either recruited to processes under strong selection pressure or to processes supporting an evolving novel organ.
format Online
Article
Text
id pubmed-6442393
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher Cold Spring Harbor Laboratory Press
record_format MEDLINE/PubMed
spelling pubmed-64423932019-04-17 GenTree, an integrated resource for analyzing the evolution and function of primate-specific coding genes Shao, Yi Chen, Chunyan Shen, Hao He, Bin Z. Yu, Daqi Jiang, Shuai Zhao, Shilei Gao, Zhiqiang Zhu, Zhenglin Chen, Xi Fu, Yan Chen, Hua Gao, Ge Long, Manyuan Zhang, Yong E. Genome Res Resource The origination of new genes contributes to phenotypic evolution in humans. Two major challenges in the study of new genes are the inference of gene ages and annotation of their protein-coding potential. To tackle these challenges, we created GenTree, an integrated online database that compiles age inferences from three major methods together with functional genomic data for new genes. Genome-wide comparison of the age inference methods revealed that the synteny-based pipeline (SBP) is most suited for recently duplicated genes, whereas the protein-family–based methods are useful for ancient genes. For SBP-dated primate-specific protein-coding genes (PSGs), we performed manual evaluation based on published PSG lists and showed that SBP generated a conservative data set of PSGs by masking less reliable syntenic regions. After assessing the coding potential based on evolutionary constraint and peptide evidence from proteomic data, we curated a list of 254 PSGs with different levels of protein evidence. This list also includes 41 candidate misannotated pseudogenes that encode primate-specific short proteins. Coexpression analysis showed that PSGs are preferentially recruited into organs with rapidly evolving pathways such as spermatogenesis, immune response, mother–fetus interaction, and brain development. For brain development, primate-specific KRAB zinc-finger proteins (KZNFs) are specifically up-regulated in the mid-fetal stage, which may have contributed to the evolution of this critical stage. Altogether, hundreds of PSGs are either recruited to processes under strong selection pressure or to processes supporting an evolving novel organ. Cold Spring Harbor Laboratory Press 2019-04 /pmc/articles/PMC6442393/ /pubmed/30862647 http://dx.doi.org/10.1101/gr.238733.118 Text en © 2019 Shao et al.; Published by Cold Spring Harbor Laboratory Press http://creativecommons.org/licenses/by-nc/4.0/ This article, published in Genome Research, is available under a Creative Commons License (Attribution-NonCommercial 4.0 International), as described at http://creativecommons.org/licenses/by-nc/4.0/.
spellingShingle Resource
Shao, Yi
Chen, Chunyan
Shen, Hao
He, Bin Z.
Yu, Daqi
Jiang, Shuai
Zhao, Shilei
Gao, Zhiqiang
Zhu, Zhenglin
Chen, Xi
Fu, Yan
Chen, Hua
Gao, Ge
Long, Manyuan
Zhang, Yong E.
GenTree, an integrated resource for analyzing the evolution and function of primate-specific coding genes
title GenTree, an integrated resource for analyzing the evolution and function of primate-specific coding genes
title_full GenTree, an integrated resource for analyzing the evolution and function of primate-specific coding genes
title_fullStr GenTree, an integrated resource for analyzing the evolution and function of primate-specific coding genes
title_full_unstemmed GenTree, an integrated resource for analyzing the evolution and function of primate-specific coding genes
title_short GenTree, an integrated resource for analyzing the evolution and function of primate-specific coding genes
title_sort gentree, an integrated resource for analyzing the evolution and function of primate-specific coding genes
topic Resource
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6442393/
https://www.ncbi.nlm.nih.gov/pubmed/30862647
http://dx.doi.org/10.1101/gr.238733.118
work_keys_str_mv AT shaoyi gentreeanintegratedresourceforanalyzingtheevolutionandfunctionofprimatespecificcodinggenes
AT chenchunyan gentreeanintegratedresourceforanalyzingtheevolutionandfunctionofprimatespecificcodinggenes
AT shenhao gentreeanintegratedresourceforanalyzingtheevolutionandfunctionofprimatespecificcodinggenes
AT hebinz gentreeanintegratedresourceforanalyzingtheevolutionandfunctionofprimatespecificcodinggenes
AT yudaqi gentreeanintegratedresourceforanalyzingtheevolutionandfunctionofprimatespecificcodinggenes
AT jiangshuai gentreeanintegratedresourceforanalyzingtheevolutionandfunctionofprimatespecificcodinggenes
AT zhaoshilei gentreeanintegratedresourceforanalyzingtheevolutionandfunctionofprimatespecificcodinggenes
AT gaozhiqiang gentreeanintegratedresourceforanalyzingtheevolutionandfunctionofprimatespecificcodinggenes
AT zhuzhenglin gentreeanintegratedresourceforanalyzingtheevolutionandfunctionofprimatespecificcodinggenes
AT chenxi gentreeanintegratedresourceforanalyzingtheevolutionandfunctionofprimatespecificcodinggenes
AT fuyan gentreeanintegratedresourceforanalyzingtheevolutionandfunctionofprimatespecificcodinggenes
AT chenhua gentreeanintegratedresourceforanalyzingtheevolutionandfunctionofprimatespecificcodinggenes
AT gaoge gentreeanintegratedresourceforanalyzingtheevolutionandfunctionofprimatespecificcodinggenes
AT longmanyuan gentreeanintegratedresourceforanalyzingtheevolutionandfunctionofprimatespecificcodinggenes
AT zhangyonge gentreeanintegratedresourceforanalyzingtheevolutionandfunctionofprimatespecificcodinggenes