Cargando…

Mining Unknown Porcine Protein Isoforms by Tissue-based Map of Proteome Enhances Pig Genome Annotation

A lack of the complete pig proteome has left a gap in our knowledge of the pig genome and has restricted the feasibility of using pigs as a biomedical model. In this study, we developed a tissue-based proteome map using 34 major normal pig tissues. A total of 5841 unknown protein isoforms were ident...

Descripción completa

Detalles Bibliográficos
Autores principales: Zhao, Pengju, Zheng, Xianrui, Yu, Ying, Hou, Zhuocheng, Diao, Chenguang, Wang, Haifei, Kang, Huimin, Ning, Chao, Li, Junhui, Feng, Wen, Wang, Wen, Liu, George E., Li, Bugao, Smith, Jacqueline, Chamba, Yangzom, Liu, Jian-Feng
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Elsevier 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9170766/
https://www.ncbi.nlm.nih.gov/pubmed/33631433
http://dx.doi.org/10.1016/j.gpb.2021.02.002
_version_ 1784721506914271232
author Zhao, Pengju
Zheng, Xianrui
Yu, Ying
Hou, Zhuocheng
Diao, Chenguang
Wang, Haifei
Kang, Huimin
Ning, Chao
Li, Junhui
Feng, Wen
Wang, Wen
Liu, George E.
Li, Bugao
Smith, Jacqueline
Chamba, Yangzom
Liu, Jian-Feng
author_facet Zhao, Pengju
Zheng, Xianrui
Yu, Ying
Hou, Zhuocheng
Diao, Chenguang
Wang, Haifei
Kang, Huimin
Ning, Chao
Li, Junhui
Feng, Wen
Wang, Wen
Liu, George E.
Li, Bugao
Smith, Jacqueline
Chamba, Yangzom
Liu, Jian-Feng
author_sort Zhao, Pengju
collection PubMed
description A lack of the complete pig proteome has left a gap in our knowledge of the pig genome and has restricted the feasibility of using pigs as a biomedical model. In this study, we developed a tissue-based proteome map using 34 major normal pig tissues. A total of 5841 unknown protein isoforms were identified and systematically characterized, including 2225 novel protein isoforms, 669 protein isoforms from 460 genes symbolized beginning with LOC, and 2947 protein isoforms without clear NCBI annotation in the current pig reference genome. These newly identified protein isoforms were functionally annotated through profiling the pig transcriptome with high-throughput RNA sequencing of the same pig tissues, further improving the genome annotation of the corresponding protein-coding genes. Combining the well-annotated genes that have parallel expression pattern and subcellular witness, we predicted the tissue-related subcellularlocations and potential functions for these unknown proteins. Finally, we mined 3081 orthologous genes for 52.7% of unknown protein isoforms across multiple species, referring to 68 KEGG pathways as well as 23 disease signaling pathways. These findings provide valuable insights and a rich resource for enhancing studies of pig genomics and biology, as well as biomedical model application to human medicine.
format Online
Article
Text
id pubmed-9170766
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Elsevier
record_format MEDLINE/PubMed
spelling pubmed-91707662022-06-08 Mining Unknown Porcine Protein Isoforms by Tissue-based Map of Proteome Enhances Pig Genome Annotation Zhao, Pengju Zheng, Xianrui Yu, Ying Hou, Zhuocheng Diao, Chenguang Wang, Haifei Kang, Huimin Ning, Chao Li, Junhui Feng, Wen Wang, Wen Liu, George E. Li, Bugao Smith, Jacqueline Chamba, Yangzom Liu, Jian-Feng Genomics Proteomics Bioinformatics Original Research A lack of the complete pig proteome has left a gap in our knowledge of the pig genome and has restricted the feasibility of using pigs as a biomedical model. In this study, we developed a tissue-based proteome map using 34 major normal pig tissues. A total of 5841 unknown protein isoforms were identified and systematically characterized, including 2225 novel protein isoforms, 669 protein isoforms from 460 genes symbolized beginning with LOC, and 2947 protein isoforms without clear NCBI annotation in the current pig reference genome. These newly identified protein isoforms were functionally annotated through profiling the pig transcriptome with high-throughput RNA sequencing of the same pig tissues, further improving the genome annotation of the corresponding protein-coding genes. Combining the well-annotated genes that have parallel expression pattern and subcellular witness, we predicted the tissue-related subcellularlocations and potential functions for these unknown proteins. Finally, we mined 3081 orthologous genes for 52.7% of unknown protein isoforms across multiple species, referring to 68 KEGG pathways as well as 23 disease signaling pathways. These findings provide valuable insights and a rich resource for enhancing studies of pig genomics and biology, as well as biomedical model application to human medicine. Elsevier 2021-10 2021-02-23 /pmc/articles/PMC9170766/ /pubmed/33631433 http://dx.doi.org/10.1016/j.gpb.2021.02.002 Text en © 2021 The Authors. Published by Elsevier B.V. and Science Press on behalf of Beijing Institute of Genomics, Chinese Academy of Sciences / China National Center for Bioinformation and Genetics Society of China. https://creativecommons.org/licenses/by-nc-nd/4.0/This is an open access article under the CC BY-NC-ND license (http://creativecommons.org/licenses/by-nc-nd/4.0/).
spellingShingle Original Research
Zhao, Pengju
Zheng, Xianrui
Yu, Ying
Hou, Zhuocheng
Diao, Chenguang
Wang, Haifei
Kang, Huimin
Ning, Chao
Li, Junhui
Feng, Wen
Wang, Wen
Liu, George E.
Li, Bugao
Smith, Jacqueline
Chamba, Yangzom
Liu, Jian-Feng
Mining Unknown Porcine Protein Isoforms by Tissue-based Map of Proteome Enhances Pig Genome Annotation
title Mining Unknown Porcine Protein Isoforms by Tissue-based Map of Proteome Enhances Pig Genome Annotation
title_full Mining Unknown Porcine Protein Isoforms by Tissue-based Map of Proteome Enhances Pig Genome Annotation
title_fullStr Mining Unknown Porcine Protein Isoforms by Tissue-based Map of Proteome Enhances Pig Genome Annotation
title_full_unstemmed Mining Unknown Porcine Protein Isoforms by Tissue-based Map of Proteome Enhances Pig Genome Annotation
title_short Mining Unknown Porcine Protein Isoforms by Tissue-based Map of Proteome Enhances Pig Genome Annotation
title_sort mining unknown porcine protein isoforms by tissue-based map of proteome enhances pig genome annotation
topic Original Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9170766/
https://www.ncbi.nlm.nih.gov/pubmed/33631433
http://dx.doi.org/10.1016/j.gpb.2021.02.002
work_keys_str_mv AT zhaopengju miningunknownporcineproteinisoformsbytissuebasedmapofproteomeenhancespiggenomeannotation
AT zhengxianrui miningunknownporcineproteinisoformsbytissuebasedmapofproteomeenhancespiggenomeannotation
AT yuying miningunknownporcineproteinisoformsbytissuebasedmapofproteomeenhancespiggenomeannotation
AT houzhuocheng miningunknownporcineproteinisoformsbytissuebasedmapofproteomeenhancespiggenomeannotation
AT diaochenguang miningunknownporcineproteinisoformsbytissuebasedmapofproteomeenhancespiggenomeannotation
AT wanghaifei miningunknownporcineproteinisoformsbytissuebasedmapofproteomeenhancespiggenomeannotation
AT kanghuimin miningunknownporcineproteinisoformsbytissuebasedmapofproteomeenhancespiggenomeannotation
AT ningchao miningunknownporcineproteinisoformsbytissuebasedmapofproteomeenhancespiggenomeannotation
AT lijunhui miningunknownporcineproteinisoformsbytissuebasedmapofproteomeenhancespiggenomeannotation
AT fengwen miningunknownporcineproteinisoformsbytissuebasedmapofproteomeenhancespiggenomeannotation
AT wangwen miningunknownporcineproteinisoformsbytissuebasedmapofproteomeenhancespiggenomeannotation
AT liugeorgee miningunknownporcineproteinisoformsbytissuebasedmapofproteomeenhancespiggenomeannotation
AT libugao miningunknownporcineproteinisoformsbytissuebasedmapofproteomeenhancespiggenomeannotation
AT smithjacqueline miningunknownporcineproteinisoformsbytissuebasedmapofproteomeenhancespiggenomeannotation
AT chambayangzom miningunknownporcineproteinisoformsbytissuebasedmapofproteomeenhancespiggenomeannotation
AT liujianfeng miningunknownporcineproteinisoformsbytissuebasedmapofproteomeenhancespiggenomeannotation