Cargando…
Bioinformatic Analysis of Temporal and Spatial Proteome Alternations During Infections
Microbial pathogens have evolved numerous mechanisms to hijack host’s systems, thus causing disease. This is mediated by alterations in the combined host-pathogen proteome in time and space. Mass spectrometry-based proteomics approaches have been developed and tailored to map disease progression. Th...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Frontiers Media S.A.
2021
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8283032/ https://www.ncbi.nlm.nih.gov/pubmed/34276775 http://dx.doi.org/10.3389/fgene.2021.667936 |
_version_ | 1783723116517654528 |
---|---|
author | Rahmatbakhsh, Matineh Gagarinova, Alla Babu, Mohan |
author_facet | Rahmatbakhsh, Matineh Gagarinova, Alla Babu, Mohan |
author_sort | Rahmatbakhsh, Matineh |
collection | PubMed |
description | Microbial pathogens have evolved numerous mechanisms to hijack host’s systems, thus causing disease. This is mediated by alterations in the combined host-pathogen proteome in time and space. Mass spectrometry-based proteomics approaches have been developed and tailored to map disease progression. The result is complex multidimensional data that pose numerous analytic challenges for downstream interpretation. However, a systematic review of approaches for the downstream analysis of such data has been lacking in the field. In this review, we detail the steps of a typical temporal and spatial analysis, including data pre-processing steps (i.e., quality control, data normalization, the imputation of missing values, and dimensionality reduction), different statistical and machine learning approaches, validation, interpretation, and the extraction of biological information from mass spectrometry data. We also discuss current best practices for these steps based on a collection of independent studies to guide users in selecting the most suitable strategies for their dataset and analysis objectives. Moreover, we also compiled the list of commonly used R software packages for each step of the analysis. These could be easily integrated into one’s analysis pipeline. Furthermore, we guide readers through various analysis steps by applying these workflows to mock and host-pathogen interaction data from public datasets. The workflows presented in this review will serve as an introduction for data analysis novices, while also helping established users update their data analysis pipelines. We conclude the review by discussing future directions and developments in temporal and spatial proteomics and data analysis approaches. Data analysis codes, prepared for this review are available from https://github.com/BabuLab-UofR/TempSpac, where guidelines and sample datasets are also offered for testing purposes. |
format | Online Article Text |
id | pubmed-8283032 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2021 |
publisher | Frontiers Media S.A. |
record_format | MEDLINE/PubMed |
spelling | pubmed-82830322021-07-17 Bioinformatic Analysis of Temporal and Spatial Proteome Alternations During Infections Rahmatbakhsh, Matineh Gagarinova, Alla Babu, Mohan Front Genet Genetics Microbial pathogens have evolved numerous mechanisms to hijack host’s systems, thus causing disease. This is mediated by alterations in the combined host-pathogen proteome in time and space. Mass spectrometry-based proteomics approaches have been developed and tailored to map disease progression. The result is complex multidimensional data that pose numerous analytic challenges for downstream interpretation. However, a systematic review of approaches for the downstream analysis of such data has been lacking in the field. In this review, we detail the steps of a typical temporal and spatial analysis, including data pre-processing steps (i.e., quality control, data normalization, the imputation of missing values, and dimensionality reduction), different statistical and machine learning approaches, validation, interpretation, and the extraction of biological information from mass spectrometry data. We also discuss current best practices for these steps based on a collection of independent studies to guide users in selecting the most suitable strategies for their dataset and analysis objectives. Moreover, we also compiled the list of commonly used R software packages for each step of the analysis. These could be easily integrated into one’s analysis pipeline. Furthermore, we guide readers through various analysis steps by applying these workflows to mock and host-pathogen interaction data from public datasets. The workflows presented in this review will serve as an introduction for data analysis novices, while also helping established users update their data analysis pipelines. We conclude the review by discussing future directions and developments in temporal and spatial proteomics and data analysis approaches. Data analysis codes, prepared for this review are available from https://github.com/BabuLab-UofR/TempSpac, where guidelines and sample datasets are also offered for testing purposes. Frontiers Media S.A. 2021-07-02 /pmc/articles/PMC8283032/ /pubmed/34276775 http://dx.doi.org/10.3389/fgene.2021.667936 Text en Copyright © 2021 Rahmatbakhsh, Gagarinova and Babu. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms. |
spellingShingle | Genetics Rahmatbakhsh, Matineh Gagarinova, Alla Babu, Mohan Bioinformatic Analysis of Temporal and Spatial Proteome Alternations During Infections |
title | Bioinformatic Analysis of Temporal and Spatial Proteome Alternations During Infections |
title_full | Bioinformatic Analysis of Temporal and Spatial Proteome Alternations During Infections |
title_fullStr | Bioinformatic Analysis of Temporal and Spatial Proteome Alternations During Infections |
title_full_unstemmed | Bioinformatic Analysis of Temporal and Spatial Proteome Alternations During Infections |
title_short | Bioinformatic Analysis of Temporal and Spatial Proteome Alternations During Infections |
title_sort | bioinformatic analysis of temporal and spatial proteome alternations during infections |
topic | Genetics |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8283032/ https://www.ncbi.nlm.nih.gov/pubmed/34276775 http://dx.doi.org/10.3389/fgene.2021.667936 |
work_keys_str_mv | AT rahmatbakhshmatineh bioinformaticanalysisoftemporalandspatialproteomealternationsduringinfections AT gagarinovaalla bioinformaticanalysisoftemporalandspatialproteomealternationsduringinfections AT babumohan bioinformaticanalysisoftemporalandspatialproteomealternationsduringinfections |