Cargando…
GenTree, an integrated resource for analyzing the evolution and function of primate-specific coding genes
The origination of new genes contributes to phenotypic evolution in humans. Two major challenges in the study of new genes are the inference of gene ages and annotation of their protein-coding potential. To tackle these challenges, we created GenTree, an integrated online database that compiles age...
Autores principales: | , , , , , , , , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Cold Spring Harbor Laboratory Press
2019
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6442393/ https://www.ncbi.nlm.nih.gov/pubmed/30862647 http://dx.doi.org/10.1101/gr.238733.118 |
_version_ | 1783407700457029632 |
---|---|
author | Shao, Yi Chen, Chunyan Shen, Hao He, Bin Z. Yu, Daqi Jiang, Shuai Zhao, Shilei Gao, Zhiqiang Zhu, Zhenglin Chen, Xi Fu, Yan Chen, Hua Gao, Ge Long, Manyuan Zhang, Yong E. |
author_facet | Shao, Yi Chen, Chunyan Shen, Hao He, Bin Z. Yu, Daqi Jiang, Shuai Zhao, Shilei Gao, Zhiqiang Zhu, Zhenglin Chen, Xi Fu, Yan Chen, Hua Gao, Ge Long, Manyuan Zhang, Yong E. |
author_sort | Shao, Yi |
collection | PubMed |
description | The origination of new genes contributes to phenotypic evolution in humans. Two major challenges in the study of new genes are the inference of gene ages and annotation of their protein-coding potential. To tackle these challenges, we created GenTree, an integrated online database that compiles age inferences from three major methods together with functional genomic data for new genes. Genome-wide comparison of the age inference methods revealed that the synteny-based pipeline (SBP) is most suited for recently duplicated genes, whereas the protein-family–based methods are useful for ancient genes. For SBP-dated primate-specific protein-coding genes (PSGs), we performed manual evaluation based on published PSG lists and showed that SBP generated a conservative data set of PSGs by masking less reliable syntenic regions. After assessing the coding potential based on evolutionary constraint and peptide evidence from proteomic data, we curated a list of 254 PSGs with different levels of protein evidence. This list also includes 41 candidate misannotated pseudogenes that encode primate-specific short proteins. Coexpression analysis showed that PSGs are preferentially recruited into organs with rapidly evolving pathways such as spermatogenesis, immune response, mother–fetus interaction, and brain development. For brain development, primate-specific KRAB zinc-finger proteins (KZNFs) are specifically up-regulated in the mid-fetal stage, which may have contributed to the evolution of this critical stage. Altogether, hundreds of PSGs are either recruited to processes under strong selection pressure or to processes supporting an evolving novel organ. |
format | Online Article Text |
id | pubmed-6442393 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2019 |
publisher | Cold Spring Harbor Laboratory Press |
record_format | MEDLINE/PubMed |
spelling | pubmed-64423932019-04-17 GenTree, an integrated resource for analyzing the evolution and function of primate-specific coding genes Shao, Yi Chen, Chunyan Shen, Hao He, Bin Z. Yu, Daqi Jiang, Shuai Zhao, Shilei Gao, Zhiqiang Zhu, Zhenglin Chen, Xi Fu, Yan Chen, Hua Gao, Ge Long, Manyuan Zhang, Yong E. Genome Res Resource The origination of new genes contributes to phenotypic evolution in humans. Two major challenges in the study of new genes are the inference of gene ages and annotation of their protein-coding potential. To tackle these challenges, we created GenTree, an integrated online database that compiles age inferences from three major methods together with functional genomic data for new genes. Genome-wide comparison of the age inference methods revealed that the synteny-based pipeline (SBP) is most suited for recently duplicated genes, whereas the protein-family–based methods are useful for ancient genes. For SBP-dated primate-specific protein-coding genes (PSGs), we performed manual evaluation based on published PSG lists and showed that SBP generated a conservative data set of PSGs by masking less reliable syntenic regions. After assessing the coding potential based on evolutionary constraint and peptide evidence from proteomic data, we curated a list of 254 PSGs with different levels of protein evidence. This list also includes 41 candidate misannotated pseudogenes that encode primate-specific short proteins. Coexpression analysis showed that PSGs are preferentially recruited into organs with rapidly evolving pathways such as spermatogenesis, immune response, mother–fetus interaction, and brain development. For brain development, primate-specific KRAB zinc-finger proteins (KZNFs) are specifically up-regulated in the mid-fetal stage, which may have contributed to the evolution of this critical stage. Altogether, hundreds of PSGs are either recruited to processes under strong selection pressure or to processes supporting an evolving novel organ. Cold Spring Harbor Laboratory Press 2019-04 /pmc/articles/PMC6442393/ /pubmed/30862647 http://dx.doi.org/10.1101/gr.238733.118 Text en © 2019 Shao et al.; Published by Cold Spring Harbor Laboratory Press http://creativecommons.org/licenses/by-nc/4.0/ This article, published in Genome Research, is available under a Creative Commons License (Attribution-NonCommercial 4.0 International), as described at http://creativecommons.org/licenses/by-nc/4.0/. |
spellingShingle | Resource Shao, Yi Chen, Chunyan Shen, Hao He, Bin Z. Yu, Daqi Jiang, Shuai Zhao, Shilei Gao, Zhiqiang Zhu, Zhenglin Chen, Xi Fu, Yan Chen, Hua Gao, Ge Long, Manyuan Zhang, Yong E. GenTree, an integrated resource for analyzing the evolution and function of primate-specific coding genes |
title | GenTree, an integrated resource for analyzing the evolution and function of primate-specific coding genes |
title_full | GenTree, an integrated resource for analyzing the evolution and function of primate-specific coding genes |
title_fullStr | GenTree, an integrated resource for analyzing the evolution and function of primate-specific coding genes |
title_full_unstemmed | GenTree, an integrated resource for analyzing the evolution and function of primate-specific coding genes |
title_short | GenTree, an integrated resource for analyzing the evolution and function of primate-specific coding genes |
title_sort | gentree, an integrated resource for analyzing the evolution and function of primate-specific coding genes |
topic | Resource |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6442393/ https://www.ncbi.nlm.nih.gov/pubmed/30862647 http://dx.doi.org/10.1101/gr.238733.118 |
work_keys_str_mv | AT shaoyi gentreeanintegratedresourceforanalyzingtheevolutionandfunctionofprimatespecificcodinggenes AT chenchunyan gentreeanintegratedresourceforanalyzingtheevolutionandfunctionofprimatespecificcodinggenes AT shenhao gentreeanintegratedresourceforanalyzingtheevolutionandfunctionofprimatespecificcodinggenes AT hebinz gentreeanintegratedresourceforanalyzingtheevolutionandfunctionofprimatespecificcodinggenes AT yudaqi gentreeanintegratedresourceforanalyzingtheevolutionandfunctionofprimatespecificcodinggenes AT jiangshuai gentreeanintegratedresourceforanalyzingtheevolutionandfunctionofprimatespecificcodinggenes AT zhaoshilei gentreeanintegratedresourceforanalyzingtheevolutionandfunctionofprimatespecificcodinggenes AT gaozhiqiang gentreeanintegratedresourceforanalyzingtheevolutionandfunctionofprimatespecificcodinggenes AT zhuzhenglin gentreeanintegratedresourceforanalyzingtheevolutionandfunctionofprimatespecificcodinggenes AT chenxi gentreeanintegratedresourceforanalyzingtheevolutionandfunctionofprimatespecificcodinggenes AT fuyan gentreeanintegratedresourceforanalyzingtheevolutionandfunctionofprimatespecificcodinggenes AT chenhua gentreeanintegratedresourceforanalyzingtheevolutionandfunctionofprimatespecificcodinggenes AT gaoge gentreeanintegratedresourceforanalyzingtheevolutionandfunctionofprimatespecificcodinggenes AT longmanyuan gentreeanintegratedresourceforanalyzingtheevolutionandfunctionofprimatespecificcodinggenes AT zhangyonge gentreeanintegratedresourceforanalyzingtheevolutionandfunctionofprimatespecificcodinggenes |