Cargando…

Bioinformatic Analysis of Temporal and Spatial Proteome Alternations During Infections

Microbial pathogens have evolved numerous mechanisms to hijack host’s systems, thus causing disease. This is mediated by alterations in the combined host-pathogen proteome in time and space. Mass spectrometry-based proteomics approaches have been developed and tailored to map disease progression. Th...

Descripción completa

Detalles Bibliográficos
Autores principales: Rahmatbakhsh, Matineh, Gagarinova, Alla, Babu, Mohan
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Frontiers Media S.A. 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8283032/
https://www.ncbi.nlm.nih.gov/pubmed/34276775
http://dx.doi.org/10.3389/fgene.2021.667936
_version_ 1783723116517654528
author Rahmatbakhsh, Matineh
Gagarinova, Alla
Babu, Mohan
author_facet Rahmatbakhsh, Matineh
Gagarinova, Alla
Babu, Mohan
author_sort Rahmatbakhsh, Matineh
collection PubMed
description Microbial pathogens have evolved numerous mechanisms to hijack host’s systems, thus causing disease. This is mediated by alterations in the combined host-pathogen proteome in time and space. Mass spectrometry-based proteomics approaches have been developed and tailored to map disease progression. The result is complex multidimensional data that pose numerous analytic challenges for downstream interpretation. However, a systematic review of approaches for the downstream analysis of such data has been lacking in the field. In this review, we detail the steps of a typical temporal and spatial analysis, including data pre-processing steps (i.e., quality control, data normalization, the imputation of missing values, and dimensionality reduction), different statistical and machine learning approaches, validation, interpretation, and the extraction of biological information from mass spectrometry data. We also discuss current best practices for these steps based on a collection of independent studies to guide users in selecting the most suitable strategies for their dataset and analysis objectives. Moreover, we also compiled the list of commonly used R software packages for each step of the analysis. These could be easily integrated into one’s analysis pipeline. Furthermore, we guide readers through various analysis steps by applying these workflows to mock and host-pathogen interaction data from public datasets. The workflows presented in this review will serve as an introduction for data analysis novices, while also helping established users update their data analysis pipelines. We conclude the review by discussing future directions and developments in temporal and spatial proteomics and data analysis approaches. Data analysis codes, prepared for this review are available from https://github.com/BabuLab-UofR/TempSpac, where guidelines and sample datasets are also offered for testing purposes.
format Online
Article
Text
id pubmed-8283032
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Frontiers Media S.A.
record_format MEDLINE/PubMed
spelling pubmed-82830322021-07-17 Bioinformatic Analysis of Temporal and Spatial Proteome Alternations During Infections Rahmatbakhsh, Matineh Gagarinova, Alla Babu, Mohan Front Genet Genetics Microbial pathogens have evolved numerous mechanisms to hijack host’s systems, thus causing disease. This is mediated by alterations in the combined host-pathogen proteome in time and space. Mass spectrometry-based proteomics approaches have been developed and tailored to map disease progression. The result is complex multidimensional data that pose numerous analytic challenges for downstream interpretation. However, a systematic review of approaches for the downstream analysis of such data has been lacking in the field. In this review, we detail the steps of a typical temporal and spatial analysis, including data pre-processing steps (i.e., quality control, data normalization, the imputation of missing values, and dimensionality reduction), different statistical and machine learning approaches, validation, interpretation, and the extraction of biological information from mass spectrometry data. We also discuss current best practices for these steps based on a collection of independent studies to guide users in selecting the most suitable strategies for their dataset and analysis objectives. Moreover, we also compiled the list of commonly used R software packages for each step of the analysis. These could be easily integrated into one’s analysis pipeline. Furthermore, we guide readers through various analysis steps by applying these workflows to mock and host-pathogen interaction data from public datasets. The workflows presented in this review will serve as an introduction for data analysis novices, while also helping established users update their data analysis pipelines. We conclude the review by discussing future directions and developments in temporal and spatial proteomics and data analysis approaches. Data analysis codes, prepared for this review are available from https://github.com/BabuLab-UofR/TempSpac, where guidelines and sample datasets are also offered for testing purposes. Frontiers Media S.A. 2021-07-02 /pmc/articles/PMC8283032/ /pubmed/34276775 http://dx.doi.org/10.3389/fgene.2021.667936 Text en Copyright © 2021 Rahmatbakhsh, Gagarinova and Babu. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle Genetics
Rahmatbakhsh, Matineh
Gagarinova, Alla
Babu, Mohan
Bioinformatic Analysis of Temporal and Spatial Proteome Alternations During Infections
title Bioinformatic Analysis of Temporal and Spatial Proteome Alternations During Infections
title_full Bioinformatic Analysis of Temporal and Spatial Proteome Alternations During Infections
title_fullStr Bioinformatic Analysis of Temporal and Spatial Proteome Alternations During Infections
title_full_unstemmed Bioinformatic Analysis of Temporal and Spatial Proteome Alternations During Infections
title_short Bioinformatic Analysis of Temporal and Spatial Proteome Alternations During Infections
title_sort bioinformatic analysis of temporal and spatial proteome alternations during infections
topic Genetics
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8283032/
https://www.ncbi.nlm.nih.gov/pubmed/34276775
http://dx.doi.org/10.3389/fgene.2021.667936
work_keys_str_mv AT rahmatbakhshmatineh bioinformaticanalysisoftemporalandspatialproteomealternationsduringinfections
AT gagarinovaalla bioinformaticanalysisoftemporalandspatialproteomealternationsduringinfections
AT babumohan bioinformaticanalysisoftemporalandspatialproteomealternationsduringinfections