Cargando…

Elucidation of cross-species proteomic effects in human and hominin bone proteome identification through a bioinformatics experiment

BACKGROUND: The study of ancient protein sequences is increasingly focused on the analysis of older samples, including those of ancient hominins. The analysis of such ancient proteomes thereby potentially suffers from “cross-species proteomic effects”: the loss of peptide and protein identifications...

Descripción completa

Detalles Bibliográficos
Autor principal:	Welker, F.
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	BioMed Central 2018
Materias:	Research Article
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5819086/ https://www.ncbi.nlm.nih.gov/pubmed/29463217 http://dx.doi.org/10.1186/s12862-018-1141-1

_version_	1783301136299589632
author	Welker, F.
author_facet	Welker, F.
author_sort	Welker, F.
collection	PubMed
description	BACKGROUND: The study of ancient protein sequences is increasingly focused on the analysis of older samples, including those of ancient hominins. The analysis of such ancient proteomes thereby potentially suffers from “cross-species proteomic effects”: the loss of peptide and protein identifications at increased evolutionary distances due to a larger number of protein sequence differences between the database sequence and the analyzed organism. Error-tolerant proteomic search algorithms should theoretically overcome this problem at both the peptide and protein level; however, this has not been demonstrated. If error-tolerant searches do not overcome the cross-species proteomic issue then there might be inherent biases in the identified proteomes. Here, a bioinformatics experiment is performed to test this using a set of modern human bone proteomes and three independent searches against sequence databases at increasing evolutionary distances: the human (0 Ma), chimpanzee (6-8 Ma) and orangutan (16-17 Ma) reference proteomes, respectively. RESULTS: Incorrectly suggested amino acid substitutions are absent when employing adequate filtering criteria for mutable Peptide Spectrum Matches (PSMs), but roughly half of the mutable PSMs were not recovered. As a result, peptide and protein identification rates are higher in error-tolerant mode compared to non-error-tolerant searches but did not recover protein identifications completely. Data indicates that peptide length and the number of mutations between the target and database sequences are the main factors influencing mutable PSM identification. CONCLUSIONS: The error-tolerant results suggest that the cross-species proteomics problem is not overcome at increasing evolutionary distances, even at the protein level. Peptide and protein loss has the potential to significantly impact divergence dating and proteome comparisons when using ancient samples as there is a bias towards the identification of conserved sequences and proteins. Effects are minimized between moderately divergent proteomes, as indicated by almost complete recovery of informative positions in the search against the chimpanzee proteome (≈90%, 6-8 Ma). This provides a bioinformatic background to future phylogenetic and proteomic analysis of ancient hominin proteomes, including the future description of novel hominin amino acid sequences, but also has negative implications for the study of fast-evolving proteins in hominins, non-hominin animals, and ancient bacterial proteins in evolutionary contexts. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (10.1186/s12862-018-1141-1) contains supplementary material, which is available to authorized users.
format	Online Article Text
id	pubmed-5819086
institution	National Center for Biotechnology Information
language	English
publishDate	2018
publisher	BioMed Central
record_format	MEDLINE/PubMed
spelling	pubmed-58190862018-02-21 Elucidation of cross-species proteomic effects in human and hominin bone proteome identification through a bioinformatics experiment Welker, F. BMC Evol Biol Research Article BACKGROUND: The study of ancient protein sequences is increasingly focused on the analysis of older samples, including those of ancient hominins. The analysis of such ancient proteomes thereby potentially suffers from “cross-species proteomic effects”: the loss of peptide and protein identifications at increased evolutionary distances due to a larger number of protein sequence differences between the database sequence and the analyzed organism. Error-tolerant proteomic search algorithms should theoretically overcome this problem at both the peptide and protein level; however, this has not been demonstrated. If error-tolerant searches do not overcome the cross-species proteomic issue then there might be inherent biases in the identified proteomes. Here, a bioinformatics experiment is performed to test this using a set of modern human bone proteomes and three independent searches against sequence databases at increasing evolutionary distances: the human (0 Ma), chimpanzee (6-8 Ma) and orangutan (16-17 Ma) reference proteomes, respectively. RESULTS: Incorrectly suggested amino acid substitutions are absent when employing adequate filtering criteria for mutable Peptide Spectrum Matches (PSMs), but roughly half of the mutable PSMs were not recovered. As a result, peptide and protein identification rates are higher in error-tolerant mode compared to non-error-tolerant searches but did not recover protein identifications completely. Data indicates that peptide length and the number of mutations between the target and database sequences are the main factors influencing mutable PSM identification. CONCLUSIONS: The error-tolerant results suggest that the cross-species proteomics problem is not overcome at increasing evolutionary distances, even at the protein level. Peptide and protein loss has the potential to significantly impact divergence dating and proteome comparisons when using ancient samples as there is a bias towards the identification of conserved sequences and proteins. Effects are minimized between moderately divergent proteomes, as indicated by almost complete recovery of informative positions in the search against the chimpanzee proteome (≈90%, 6-8 Ma). This provides a bioinformatic background to future phylogenetic and proteomic analysis of ancient hominin proteomes, including the future description of novel hominin amino acid sequences, but also has negative implications for the study of fast-evolving proteins in hominins, non-hominin animals, and ancient bacterial proteins in evolutionary contexts. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (10.1186/s12862-018-1141-1) contains supplementary material, which is available to authorized users. BioMed Central 2018-02-20 /pmc/articles/PMC5819086/ /pubmed/29463217 http://dx.doi.org/10.1186/s12862-018-1141-1 Text en © The Author(s). 2018 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle	Research Article Welker, F. Elucidation of cross-species proteomic effects in human and hominin bone proteome identification through a bioinformatics experiment
title	Elucidation of cross-species proteomic effects in human and hominin bone proteome identification through a bioinformatics experiment
title_full	Elucidation of cross-species proteomic effects in human and hominin bone proteome identification through a bioinformatics experiment
title_fullStr	Elucidation of cross-species proteomic effects in human and hominin bone proteome identification through a bioinformatics experiment
title_full_unstemmed	Elucidation of cross-species proteomic effects in human and hominin bone proteome identification through a bioinformatics experiment
title_short	Elucidation of cross-species proteomic effects in human and hominin bone proteome identification through a bioinformatics experiment
title_sort	elucidation of cross-species proteomic effects in human and hominin bone proteome identification through a bioinformatics experiment
topic	Research Article
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5819086/ https://www.ncbi.nlm.nih.gov/pubmed/29463217 http://dx.doi.org/10.1186/s12862-018-1141-1
work_keys_str_mv	AT welkerf elucidationofcrossspeciesproteomiceffectsinhumanandhomininboneproteomeidentificationthroughabioinformaticsexperiment

Elucidation of cross-species proteomic effects in human and hominin bone proteome identification through a bioinformatics experiment

Ejemplares similares