Cargando…

A predictive model for vertebrate bone identification from collagen using proteomic mass spectrometry

Proteogenomics is an increasingly common method for species identification as it allows for rapid and inexpensive interrogation of an unknown organism’s proteome—even when the proteome is partially degraded. The proteomic method typically uses tandem mass spectrometry to survey all peptides detectab...

Descripción completa

Detalles Bibliográficos
Autores principales: Yang, Heyi, Butler, Erin R., Monier, Samantha A., Teubl, Jennifer, Fenyö, David, Ueberheide, Beatrix, Siegel, Donald
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Nature Publishing Group UK 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8149876/
https://www.ncbi.nlm.nih.gov/pubmed/34035355
http://dx.doi.org/10.1038/s41598-021-90231-5
_version_ 1783698041139625984
author Yang, Heyi
Butler, Erin R.
Monier, Samantha A.
Teubl, Jennifer
Fenyö, David
Ueberheide, Beatrix
Siegel, Donald
author_facet Yang, Heyi
Butler, Erin R.
Monier, Samantha A.
Teubl, Jennifer
Fenyö, David
Ueberheide, Beatrix
Siegel, Donald
author_sort Yang, Heyi
collection PubMed
description Proteogenomics is an increasingly common method for species identification as it allows for rapid and inexpensive interrogation of an unknown organism’s proteome—even when the proteome is partially degraded. The proteomic method typically uses tandem mass spectrometry to survey all peptides detectable in a sample that frequently contains hundreds or thousands of proteins. Species identification is based on detection of a small numbers of species-specific peptides. Genetic analysis of proteins by mass spectrometry, however, is a developing field, and the bone proteome, typically consisting of only two proteins, pushes the limits of this technology. Nearly 20% of highly confident spectra from modern human bone samples identify non-human species when searched against a vertebrate database—as would be necessary with a fragment of unknown bone. These non-human peptides are often the result of current limitations in mass spectrometry or algorithm interpretation errors. Consequently, it is difficult to know if a “species-specific” peptide used to identify a sample is actually present in that sample. Here we evaluate the causes of peptide sequence errors and propose an unbiased, probabilistic approach to determine the likelihood that a species is correctly identified from bone without relying on species-specific peptides.
format Online
Article
Text
id pubmed-8149876
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Nature Publishing Group UK
record_format MEDLINE/PubMed
spelling pubmed-81498762021-05-26 A predictive model for vertebrate bone identification from collagen using proteomic mass spectrometry Yang, Heyi Butler, Erin R. Monier, Samantha A. Teubl, Jennifer Fenyö, David Ueberheide, Beatrix Siegel, Donald Sci Rep Article Proteogenomics is an increasingly common method for species identification as it allows for rapid and inexpensive interrogation of an unknown organism’s proteome—even when the proteome is partially degraded. The proteomic method typically uses tandem mass spectrometry to survey all peptides detectable in a sample that frequently contains hundreds or thousands of proteins. Species identification is based on detection of a small numbers of species-specific peptides. Genetic analysis of proteins by mass spectrometry, however, is a developing field, and the bone proteome, typically consisting of only two proteins, pushes the limits of this technology. Nearly 20% of highly confident spectra from modern human bone samples identify non-human species when searched against a vertebrate database—as would be necessary with a fragment of unknown bone. These non-human peptides are often the result of current limitations in mass spectrometry or algorithm interpretation errors. Consequently, it is difficult to know if a “species-specific” peptide used to identify a sample is actually present in that sample. Here we evaluate the causes of peptide sequence errors and propose an unbiased, probabilistic approach to determine the likelihood that a species is correctly identified from bone without relying on species-specific peptides. Nature Publishing Group UK 2021-05-25 /pmc/articles/PMC8149876/ /pubmed/34035355 http://dx.doi.org/10.1038/s41598-021-90231-5 Text en © The Author(s) 2021 https://creativecommons.org/licenses/by/4.0/Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) .
spellingShingle Article
Yang, Heyi
Butler, Erin R.
Monier, Samantha A.
Teubl, Jennifer
Fenyö, David
Ueberheide, Beatrix
Siegel, Donald
A predictive model for vertebrate bone identification from collagen using proteomic mass spectrometry
title A predictive model for vertebrate bone identification from collagen using proteomic mass spectrometry
title_full A predictive model for vertebrate bone identification from collagen using proteomic mass spectrometry
title_fullStr A predictive model for vertebrate bone identification from collagen using proteomic mass spectrometry
title_full_unstemmed A predictive model for vertebrate bone identification from collagen using proteomic mass spectrometry
title_short A predictive model for vertebrate bone identification from collagen using proteomic mass spectrometry
title_sort predictive model for vertebrate bone identification from collagen using proteomic mass spectrometry
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8149876/
https://www.ncbi.nlm.nih.gov/pubmed/34035355
http://dx.doi.org/10.1038/s41598-021-90231-5
work_keys_str_mv AT yangheyi apredictivemodelforvertebrateboneidentificationfromcollagenusingproteomicmassspectrometry
AT butlererinr apredictivemodelforvertebrateboneidentificationfromcollagenusingproteomicmassspectrometry
AT moniersamanthaa apredictivemodelforvertebrateboneidentificationfromcollagenusingproteomicmassspectrometry
AT teubljennifer apredictivemodelforvertebrateboneidentificationfromcollagenusingproteomicmassspectrometry
AT fenyodavid apredictivemodelforvertebrateboneidentificationfromcollagenusingproteomicmassspectrometry
AT ueberheidebeatrix apredictivemodelforvertebrateboneidentificationfromcollagenusingproteomicmassspectrometry
AT siegeldonald apredictivemodelforvertebrateboneidentificationfromcollagenusingproteomicmassspectrometry
AT yangheyi predictivemodelforvertebrateboneidentificationfromcollagenusingproteomicmassspectrometry
AT butlererinr predictivemodelforvertebrateboneidentificationfromcollagenusingproteomicmassspectrometry
AT moniersamanthaa predictivemodelforvertebrateboneidentificationfromcollagenusingproteomicmassspectrometry
AT teubljennifer predictivemodelforvertebrateboneidentificationfromcollagenusingproteomicmassspectrometry
AT fenyodavid predictivemodelforvertebrateboneidentificationfromcollagenusingproteomicmassspectrometry
AT ueberheidebeatrix predictivemodelforvertebrateboneidentificationfromcollagenusingproteomicmassspectrometry
AT siegeldonald predictivemodelforvertebrateboneidentificationfromcollagenusingproteomicmassspectrometry