Cargando…

Improving the genome and proteome annotations of the marine model diatom Thalassiosira pseudonana using a proteogenomics strategy

Diatoms are unicellular eukaryotic phytoplankton that account for approximately 20% of global carbon fixation and 40% of marine primary productivity; thus, they are essential for global carbon biogeochemical cycling and climate. The availability of ten diatom genome sequences has facilitated evoluti...

Descripción completa

Detalles Bibliográficos
Autores principales: Chen, Xiao-Huang, Yang, Ming-Kun, Li, Yuan-Yuan, Xie, Zhang-Xian, Zhang, Shu-Feng, Töpel, Mats, Amin, Shady A., Lin, Lin, Ge, Feng, Wang, Da-Zhi
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Springer Nature Singapore 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10077189/
https://www.ncbi.nlm.nih.gov/pubmed/37073328
http://dx.doi.org/10.1007/s42995-022-00161-y
_version_ 1785020265575481344
author Chen, Xiao-Huang
Yang, Ming-Kun
Li, Yuan-Yuan
Xie, Zhang-Xian
Zhang, Shu-Feng
Töpel, Mats
Amin, Shady A.
Lin, Lin
Ge, Feng
Wang, Da-Zhi
author_facet Chen, Xiao-Huang
Yang, Ming-Kun
Li, Yuan-Yuan
Xie, Zhang-Xian
Zhang, Shu-Feng
Töpel, Mats
Amin, Shady A.
Lin, Lin
Ge, Feng
Wang, Da-Zhi
author_sort Chen, Xiao-Huang
collection PubMed
description Diatoms are unicellular eukaryotic phytoplankton that account for approximately 20% of global carbon fixation and 40% of marine primary productivity; thus, they are essential for global carbon biogeochemical cycling and climate. The availability of ten diatom genome sequences has facilitated evolutionary, biological and ecological research over the past decade; however, a complimentary map of the diatom proteome with direct measurements of proteins and peptides is still lacking. Here, we present a proteome map of the model marine diatom Thalassiosira pseudonana using high-resolution mass spectrometry combined with a proteogenomic strategy. In-depth proteomic profiling of three different growth phases and three nutrient-deficient samples identified 9526 proteins, accounting for ~ 81% of the predicted protein-coding genes. Proteogenomic analysis identified 1235 novel genes, 975 revised genes, 104 splice variants and 234 single amino acid variants. Furthermore, our quantitative proteomic analysis experimentally demonstrated that a considerable number of novel genes were differentially translated under different nutrient conditions. These findings substantially improve the genome annotation of T. pseudonana and provide insights into new biological functions of diatoms. This relatively comprehensive diatom proteome catalog will complement available diatom genome and transcriptome data to advance biological and ecological research of marine diatoms. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1007/s42995-022-00161-y.
format Online
Article
Text
id pubmed-10077189
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Springer Nature Singapore
record_format MEDLINE/PubMed
spelling pubmed-100771892023-04-17 Improving the genome and proteome annotations of the marine model diatom Thalassiosira pseudonana using a proteogenomics strategy Chen, Xiao-Huang Yang, Ming-Kun Li, Yuan-Yuan Xie, Zhang-Xian Zhang, Shu-Feng Töpel, Mats Amin, Shady A. Lin, Lin Ge, Feng Wang, Da-Zhi Mar Life Sci Technol Research Paper Diatoms are unicellular eukaryotic phytoplankton that account for approximately 20% of global carbon fixation and 40% of marine primary productivity; thus, they are essential for global carbon biogeochemical cycling and climate. The availability of ten diatom genome sequences has facilitated evolutionary, biological and ecological research over the past decade; however, a complimentary map of the diatom proteome with direct measurements of proteins and peptides is still lacking. Here, we present a proteome map of the model marine diatom Thalassiosira pseudonana using high-resolution mass spectrometry combined with a proteogenomic strategy. In-depth proteomic profiling of three different growth phases and three nutrient-deficient samples identified 9526 proteins, accounting for ~ 81% of the predicted protein-coding genes. Proteogenomic analysis identified 1235 novel genes, 975 revised genes, 104 splice variants and 234 single amino acid variants. Furthermore, our quantitative proteomic analysis experimentally demonstrated that a considerable number of novel genes were differentially translated under different nutrient conditions. These findings substantially improve the genome annotation of T. pseudonana and provide insights into new biological functions of diatoms. This relatively comprehensive diatom proteome catalog will complement available diatom genome and transcriptome data to advance biological and ecological research of marine diatoms. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1007/s42995-022-00161-y. Springer Nature Singapore 2023-02-03 /pmc/articles/PMC10077189/ /pubmed/37073328 http://dx.doi.org/10.1007/s42995-022-00161-y Text en © The Author(s) 2023 https://creativecommons.org/licenses/by/4.0/ Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) .
spellingShingle Research Paper
Chen, Xiao-Huang
Yang, Ming-Kun
Li, Yuan-Yuan
Xie, Zhang-Xian
Zhang, Shu-Feng
Töpel, Mats
Amin, Shady A.
Lin, Lin
Ge, Feng
Wang, Da-Zhi
Improving the genome and proteome annotations of the marine model diatom Thalassiosira pseudonana using a proteogenomics strategy
title Improving the genome and proteome annotations of the marine model diatom Thalassiosira pseudonana using a proteogenomics strategy
title_full Improving the genome and proteome annotations of the marine model diatom Thalassiosira pseudonana using a proteogenomics strategy
title_fullStr Improving the genome and proteome annotations of the marine model diatom Thalassiosira pseudonana using a proteogenomics strategy
title_full_unstemmed Improving the genome and proteome annotations of the marine model diatom Thalassiosira pseudonana using a proteogenomics strategy
title_short Improving the genome and proteome annotations of the marine model diatom Thalassiosira pseudonana using a proteogenomics strategy
title_sort improving the genome and proteome annotations of the marine model diatom thalassiosira pseudonana using a proteogenomics strategy
topic Research Paper
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10077189/
https://www.ncbi.nlm.nih.gov/pubmed/37073328
http://dx.doi.org/10.1007/s42995-022-00161-y
work_keys_str_mv AT chenxiaohuang improvingthegenomeandproteomeannotationsofthemarinemodeldiatomthalassiosirapseudonanausingaproteogenomicsstrategy
AT yangmingkun improvingthegenomeandproteomeannotationsofthemarinemodeldiatomthalassiosirapseudonanausingaproteogenomicsstrategy
AT liyuanyuan improvingthegenomeandproteomeannotationsofthemarinemodeldiatomthalassiosirapseudonanausingaproteogenomicsstrategy
AT xiezhangxian improvingthegenomeandproteomeannotationsofthemarinemodeldiatomthalassiosirapseudonanausingaproteogenomicsstrategy
AT zhangshufeng improvingthegenomeandproteomeannotationsofthemarinemodeldiatomthalassiosirapseudonanausingaproteogenomicsstrategy
AT topelmats improvingthegenomeandproteomeannotationsofthemarinemodeldiatomthalassiosirapseudonanausingaproteogenomicsstrategy
AT aminshadya improvingthegenomeandproteomeannotationsofthemarinemodeldiatomthalassiosirapseudonanausingaproteogenomicsstrategy
AT linlin improvingthegenomeandproteomeannotationsofthemarinemodeldiatomthalassiosirapseudonanausingaproteogenomicsstrategy
AT gefeng improvingthegenomeandproteomeannotationsofthemarinemodeldiatomthalassiosirapseudonanausingaproteogenomicsstrategy
AT wangdazhi improvingthegenomeandproteomeannotationsofthemarinemodeldiatomthalassiosirapseudonanausingaproteogenomicsstrategy