Cargando…

Spectrum of Protein Location in Proteomes Captures Evolutionary Relationship Between Species

The native subcellular location (also referred to as localization or cellular compartment) of a protein is the one in which it acts most frequently; it is one aspect of protein function. Do ten eukaryotic model organisms differ in their location spectrum, i.e., the fraction of its proteome in each o...

Descripción completa

Detalles Bibliográficos
Autores principales: Marot-Lassauzaie, Valérie, Goldberg, Tatyana, Armenteros, Jose Juan Almagro, Nielsen, Henrik, Rost, Burkhard
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Springer US 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8379119/
https://www.ncbi.nlm.nih.gov/pubmed/34328525
http://dx.doi.org/10.1007/s00239-021-10022-4
_version_ 1783740944286220288
author Marot-Lassauzaie, Valérie
Goldberg, Tatyana
Armenteros, Jose Juan Almagro
Nielsen, Henrik
Rost, Burkhard
author_facet Marot-Lassauzaie, Valérie
Goldberg, Tatyana
Armenteros, Jose Juan Almagro
Nielsen, Henrik
Rost, Burkhard
author_sort Marot-Lassauzaie, Valérie
collection PubMed
description The native subcellular location (also referred to as localization or cellular compartment) of a protein is the one in which it acts most frequently; it is one aspect of protein function. Do ten eukaryotic model organisms differ in their location spectrum, i.e., the fraction of its proteome in each of seven major cellular compartments? As experimental annotations of locations remain biased and incomplete, we need prediction methods to answer this question. After systematic bias corrections, the complete but faulty prediction methods appeared to be more appropriate to compare location spectra between species than the incomplete more accurate experimental data. This work compared the location spectra for ten eukaryotes: Homo sapiens (human), Gorilla gorilla (gorilla), Pan troglodytes (chimpanzee), Mus musculus (mouse), Rattus norvegicus (rat), Drosophila melanogaster (fruit/vinegar fly), Anopheles gambiae (African malaria mosquito), Caenorhabitis elegans (nematode), Saccharomyces cerevisiae (baker’s yeast), and Schizosaccharomyces pombe (fission yeast). The two largest classes were predicted to be the nucleus and the cytoplasm together accounting for 47–62% of all proteins, while 7–21% of the proteins were predicted in the plasma membrane and 4–15% to be secreted. Overall, the predicted location spectra were largely similar. However, in detail, the differences sufficed to plot trees (UPGMA) and 2D (PCA) maps relating the ten organisms using a simple Euclidean distance in seven states (location classes). The relations based on the simple predicted location spectra captured aspects of cross-species comparisons usually revealed only by much more detailed evolutionary comparisons. Most interestingly, known phylogenetic relations were reproduced better by paralog-only than by ortholog-only trees. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1007/s00239-021-10022-4.
format Online
Article
Text
id pubmed-8379119
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Springer US
record_format MEDLINE/PubMed
spelling pubmed-83791192021-09-02 Spectrum of Protein Location in Proteomes Captures Evolutionary Relationship Between Species Marot-Lassauzaie, Valérie Goldberg, Tatyana Armenteros, Jose Juan Almagro Nielsen, Henrik Rost, Burkhard J Mol Evol Original Article The native subcellular location (also referred to as localization or cellular compartment) of a protein is the one in which it acts most frequently; it is one aspect of protein function. Do ten eukaryotic model organisms differ in their location spectrum, i.e., the fraction of its proteome in each of seven major cellular compartments? As experimental annotations of locations remain biased and incomplete, we need prediction methods to answer this question. After systematic bias corrections, the complete but faulty prediction methods appeared to be more appropriate to compare location spectra between species than the incomplete more accurate experimental data. This work compared the location spectra for ten eukaryotes: Homo sapiens (human), Gorilla gorilla (gorilla), Pan troglodytes (chimpanzee), Mus musculus (mouse), Rattus norvegicus (rat), Drosophila melanogaster (fruit/vinegar fly), Anopheles gambiae (African malaria mosquito), Caenorhabitis elegans (nematode), Saccharomyces cerevisiae (baker’s yeast), and Schizosaccharomyces pombe (fission yeast). The two largest classes were predicted to be the nucleus and the cytoplasm together accounting for 47–62% of all proteins, while 7–21% of the proteins were predicted in the plasma membrane and 4–15% to be secreted. Overall, the predicted location spectra were largely similar. However, in detail, the differences sufficed to plot trees (UPGMA) and 2D (PCA) maps relating the ten organisms using a simple Euclidean distance in seven states (location classes). The relations based on the simple predicted location spectra captured aspects of cross-species comparisons usually revealed only by much more detailed evolutionary comparisons. Most interestingly, known phylogenetic relations were reproduced better by paralog-only than by ortholog-only trees. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1007/s00239-021-10022-4. Springer US 2021-07-30 2021 /pmc/articles/PMC8379119/ /pubmed/34328525 http://dx.doi.org/10.1007/s00239-021-10022-4 Text en © The Author(s) 2021 https://creativecommons.org/licenses/by/4.0/Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) .
spellingShingle Original Article
Marot-Lassauzaie, Valérie
Goldberg, Tatyana
Armenteros, Jose Juan Almagro
Nielsen, Henrik
Rost, Burkhard
Spectrum of Protein Location in Proteomes Captures Evolutionary Relationship Between Species
title Spectrum of Protein Location in Proteomes Captures Evolutionary Relationship Between Species
title_full Spectrum of Protein Location in Proteomes Captures Evolutionary Relationship Between Species
title_fullStr Spectrum of Protein Location in Proteomes Captures Evolutionary Relationship Between Species
title_full_unstemmed Spectrum of Protein Location in Proteomes Captures Evolutionary Relationship Between Species
title_short Spectrum of Protein Location in Proteomes Captures Evolutionary Relationship Between Species
title_sort spectrum of protein location in proteomes captures evolutionary relationship between species
topic Original Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8379119/
https://www.ncbi.nlm.nih.gov/pubmed/34328525
http://dx.doi.org/10.1007/s00239-021-10022-4
work_keys_str_mv AT marotlassauzaievalerie spectrumofproteinlocationinproteomescapturesevolutionaryrelationshipbetweenspecies
AT goldbergtatyana spectrumofproteinlocationinproteomescapturesevolutionaryrelationshipbetweenspecies
AT armenterosjosejuanalmagro spectrumofproteinlocationinproteomescapturesevolutionaryrelationshipbetweenspecies
AT nielsenhenrik spectrumofproteinlocationinproteomescapturesevolutionaryrelationshipbetweenspecies
AT rostburkhard spectrumofproteinlocationinproteomescapturesevolutionaryrelationshipbetweenspecies