Cargando…

Protein abundances can distinguish between naturally-occurring and laboratory strains of Yersinia pestis, the causative agent of plague

The rapid pace of bacterial evolution enables organisms to adapt to the laboratory environment with repeated passage and thus diverge from naturally-occurring environmental (“wild”) strains. Distinguishing wild and laboratory strains is clearly important for biodefense and bioforensics; however, DNA...

Descripción completa

Detalles Bibliográficos
Autores principales: Merkley, Eric D., Sego, Landon H., Lin, Andy, Leiser, Owen P., Kaiser, Brooke L. Deatherage, Adkins, Joshua N., Keim, Paul S., Wagner, David M., Kreuzer, Helen W.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2017
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5576697/
https://www.ncbi.nlm.nih.gov/pubmed/28854255
http://dx.doi.org/10.1371/journal.pone.0183478
_version_ 1783260235281989632
author Merkley, Eric D.
Sego, Landon H.
Lin, Andy
Leiser, Owen P.
Kaiser, Brooke L. Deatherage
Adkins, Joshua N.
Keim, Paul S.
Wagner, David M.
Kreuzer, Helen W.
author_facet Merkley, Eric D.
Sego, Landon H.
Lin, Andy
Leiser, Owen P.
Kaiser, Brooke L. Deatherage
Adkins, Joshua N.
Keim, Paul S.
Wagner, David M.
Kreuzer, Helen W.
author_sort Merkley, Eric D.
collection PubMed
description The rapid pace of bacterial evolution enables organisms to adapt to the laboratory environment with repeated passage and thus diverge from naturally-occurring environmental (“wild”) strains. Distinguishing wild and laboratory strains is clearly important for biodefense and bioforensics; however, DNA sequence data alone has thus far not provided a clear signature, perhaps due to lack of understanding of how diverse genome changes lead to convergent phenotypes, difficulty in detecting certain types of mutations, or perhaps because some adaptive modifications are epigenetic. Monitoring protein abundance, a molecular measure of phenotype, can overcome some of these difficulties. We have assembled a collection of Yersinia pestis proteomics datasets from our own published and unpublished work, and from a proteomics data archive, and demonstrated that protein abundance data can clearly distinguish laboratory-adapted from wild. We developed a lasso logistic regression classifier that uses binary (presence/absence) or quantitative protein abundance measures to predict whether a sample is laboratory-adapted or wild that proved to be ~98% accurate, as judged by replicated 10-fold cross-validation. Protein features selected by the classifier accord well with our previous study of laboratory adaptation in Y. pestis. The input data was derived from a variety of unrelated experiments and contained significant confounding variables. We show that the classifier is robust with respect to these variables. The methodology is able to discover signatures for laboratory facility and culture medium that are largely independent of the signature of laboratory adaptation. Going beyond our previous laboratory evolution study, this work suggests that proteomic differences between laboratory-adapted and wild Y. pestis are general, potentially pointing to a process that could apply to other species as well. Additionally, we show that proteomics datasets (even archived data collected for different purposes) contain the information necessary to distinguish wild and laboratory samples. This work has clear applications in biomarker detection as well as biodefense.
format Online
Article
Text
id pubmed-5576697
institution National Center for Biotechnology Information
language English
publishDate 2017
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-55766972017-09-15 Protein abundances can distinguish between naturally-occurring and laboratory strains of Yersinia pestis, the causative agent of plague Merkley, Eric D. Sego, Landon H. Lin, Andy Leiser, Owen P. Kaiser, Brooke L. Deatherage Adkins, Joshua N. Keim, Paul S. Wagner, David M. Kreuzer, Helen W. PLoS One Research Article The rapid pace of bacterial evolution enables organisms to adapt to the laboratory environment with repeated passage and thus diverge from naturally-occurring environmental (“wild”) strains. Distinguishing wild and laboratory strains is clearly important for biodefense and bioforensics; however, DNA sequence data alone has thus far not provided a clear signature, perhaps due to lack of understanding of how diverse genome changes lead to convergent phenotypes, difficulty in detecting certain types of mutations, or perhaps because some adaptive modifications are epigenetic. Monitoring protein abundance, a molecular measure of phenotype, can overcome some of these difficulties. We have assembled a collection of Yersinia pestis proteomics datasets from our own published and unpublished work, and from a proteomics data archive, and demonstrated that protein abundance data can clearly distinguish laboratory-adapted from wild. We developed a lasso logistic regression classifier that uses binary (presence/absence) or quantitative protein abundance measures to predict whether a sample is laboratory-adapted or wild that proved to be ~98% accurate, as judged by replicated 10-fold cross-validation. Protein features selected by the classifier accord well with our previous study of laboratory adaptation in Y. pestis. The input data was derived from a variety of unrelated experiments and contained significant confounding variables. We show that the classifier is robust with respect to these variables. The methodology is able to discover signatures for laboratory facility and culture medium that are largely independent of the signature of laboratory adaptation. Going beyond our previous laboratory evolution study, this work suggests that proteomic differences between laboratory-adapted and wild Y. pestis are general, potentially pointing to a process that could apply to other species as well. Additionally, we show that proteomics datasets (even archived data collected for different purposes) contain the information necessary to distinguish wild and laboratory samples. This work has clear applications in biomarker detection as well as biodefense. Public Library of Science 2017-08-30 /pmc/articles/PMC5576697/ /pubmed/28854255 http://dx.doi.org/10.1371/journal.pone.0183478 Text en © 2017 Merkley et al http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle Research Article
Merkley, Eric D.
Sego, Landon H.
Lin, Andy
Leiser, Owen P.
Kaiser, Brooke L. Deatherage
Adkins, Joshua N.
Keim, Paul S.
Wagner, David M.
Kreuzer, Helen W.
Protein abundances can distinguish between naturally-occurring and laboratory strains of Yersinia pestis, the causative agent of plague
title Protein abundances can distinguish between naturally-occurring and laboratory strains of Yersinia pestis, the causative agent of plague
title_full Protein abundances can distinguish between naturally-occurring and laboratory strains of Yersinia pestis, the causative agent of plague
title_fullStr Protein abundances can distinguish between naturally-occurring and laboratory strains of Yersinia pestis, the causative agent of plague
title_full_unstemmed Protein abundances can distinguish between naturally-occurring and laboratory strains of Yersinia pestis, the causative agent of plague
title_short Protein abundances can distinguish between naturally-occurring and laboratory strains of Yersinia pestis, the causative agent of plague
title_sort protein abundances can distinguish between naturally-occurring and laboratory strains of yersinia pestis, the causative agent of plague
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5576697/
https://www.ncbi.nlm.nih.gov/pubmed/28854255
http://dx.doi.org/10.1371/journal.pone.0183478
work_keys_str_mv AT merkleyericd proteinabundancescandistinguishbetweennaturallyoccurringandlaboratorystrainsofyersiniapestisthecausativeagentofplague
AT segolandonh proteinabundancescandistinguishbetweennaturallyoccurringandlaboratorystrainsofyersiniapestisthecausativeagentofplague
AT linandy proteinabundancescandistinguishbetweennaturallyoccurringandlaboratorystrainsofyersiniapestisthecausativeagentofplague
AT leiserowenp proteinabundancescandistinguishbetweennaturallyoccurringandlaboratorystrainsofyersiniapestisthecausativeagentofplague
AT kaiserbrookeldeatherage proteinabundancescandistinguishbetweennaturallyoccurringandlaboratorystrainsofyersiniapestisthecausativeagentofplague
AT adkinsjoshuan proteinabundancescandistinguishbetweennaturallyoccurringandlaboratorystrainsofyersiniapestisthecausativeagentofplague
AT keimpauls proteinabundancescandistinguishbetweennaturallyoccurringandlaboratorystrainsofyersiniapestisthecausativeagentofplague
AT wagnerdavidm proteinabundancescandistinguishbetweennaturallyoccurringandlaboratorystrainsofyersiniapestisthecausativeagentofplague
AT kreuzerhelenw proteinabundancescandistinguishbetweennaturallyoccurringandlaboratorystrainsofyersiniapestisthecausativeagentofplague