Cargando…

Determination of the Stage Composition of Plasmodium Infections from Bulk Gene Expression Data

Malaria symptoms are caused by the development of the parasites within the blood of an infected host. Bulk RNA sequencing (RNA-seq) of infected blood can reveal interactions between parasites and the host immune system during an infection, but because multiple developmental stages with distinct tran...

Descripción completa

Detalles Bibliográficos
Autores principales: Tebben, Kieran, Dia, Aliou, Serre, David
Formato: Online Artículo Texto
Lenguaje:English
Publicado: American Society for Microbiology 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9426464/
https://www.ncbi.nlm.nih.gov/pubmed/35862820
http://dx.doi.org/10.1128/msystems.00258-22
_version_ 1784778685673373696
author Tebben, Kieran
Dia, Aliou
Serre, David
author_facet Tebben, Kieran
Dia, Aliou
Serre, David
author_sort Tebben, Kieran
collection PubMed
description Malaria symptoms are caused by the development of the parasites within the blood of an infected host. Bulk RNA sequencing (RNA-seq) of infected blood can reveal interactions between parasites and the host immune system during an infection, but because multiple developmental stages with distinct transcriptional profiles are concurrently present in infected blood, it is necessary to correct such analyses for differences in cell composition among samples. Gene expression deconvolution is a statistical approach that has been developed for inferring the cell composition of complex tissues characterized by bulk RNA-seq using gene expression profiles from reference cell types. Here, we describe the evaluation of a species-agnostic reference data set that can be used for efficient and accurate gene expression deconvolution of bulk RNA-seq data generated from any Plasmodium species and for correct gene expression analyses for biases caused by differences in stage composition among samples. IMPORTANCE Differences in cell type proportions among samples can introduce artifacts in gene expression analyses and mask genuine differences in gene regulation. Gene expression deconvolution allows estimation of the proportion of each cell type present in one sample directly from bulk RNA sequencing data, but this approach requires a reference data set with the signature profile of each cell type. Here, we evaluate the suitability of a rodent malaria parasite gene expression data set for estimating the proportions of each parasite developmental stage present in bulk RNA sequencing data generated from blood-stage infections with the human parasites Plasmodium falciparum and Plasmodium vivax. These analyses provide a species-agnostic approach for reliably estimating stage proportions in infected human blood and correcting subsequent gene expression analyses for these variations.
format Online
Article
Text
id pubmed-9426464
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher American Society for Microbiology
record_format MEDLINE/PubMed
spelling pubmed-94264642022-08-31 Determination of the Stage Composition of Plasmodium Infections from Bulk Gene Expression Data Tebben, Kieran Dia, Aliou Serre, David mSystems Methods and Protocols Malaria symptoms are caused by the development of the parasites within the blood of an infected host. Bulk RNA sequencing (RNA-seq) of infected blood can reveal interactions between parasites and the host immune system during an infection, but because multiple developmental stages with distinct transcriptional profiles are concurrently present in infected blood, it is necessary to correct such analyses for differences in cell composition among samples. Gene expression deconvolution is a statistical approach that has been developed for inferring the cell composition of complex tissues characterized by bulk RNA-seq using gene expression profiles from reference cell types. Here, we describe the evaluation of a species-agnostic reference data set that can be used for efficient and accurate gene expression deconvolution of bulk RNA-seq data generated from any Plasmodium species and for correct gene expression analyses for biases caused by differences in stage composition among samples. IMPORTANCE Differences in cell type proportions among samples can introduce artifacts in gene expression analyses and mask genuine differences in gene regulation. Gene expression deconvolution allows estimation of the proportion of each cell type present in one sample directly from bulk RNA sequencing data, but this approach requires a reference data set with the signature profile of each cell type. Here, we evaluate the suitability of a rodent malaria parasite gene expression data set for estimating the proportions of each parasite developmental stage present in bulk RNA sequencing data generated from blood-stage infections with the human parasites Plasmodium falciparum and Plasmodium vivax. These analyses provide a species-agnostic approach for reliably estimating stage proportions in infected human blood and correcting subsequent gene expression analyses for these variations. American Society for Microbiology 2022-07-05 /pmc/articles/PMC9426464/ /pubmed/35862820 http://dx.doi.org/10.1128/msystems.00258-22 Text en Copyright © 2022 Tebben et al. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution 4.0 International license (https://creativecommons.org/licenses/by/4.0/) .
spellingShingle Methods and Protocols
Tebben, Kieran
Dia, Aliou
Serre, David
Determination of the Stage Composition of Plasmodium Infections from Bulk Gene Expression Data
title Determination of the Stage Composition of Plasmodium Infections from Bulk Gene Expression Data
title_full Determination of the Stage Composition of Plasmodium Infections from Bulk Gene Expression Data
title_fullStr Determination of the Stage Composition of Plasmodium Infections from Bulk Gene Expression Data
title_full_unstemmed Determination of the Stage Composition of Plasmodium Infections from Bulk Gene Expression Data
title_short Determination of the Stage Composition of Plasmodium Infections from Bulk Gene Expression Data
title_sort determination of the stage composition of plasmodium infections from bulk gene expression data
topic Methods and Protocols
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9426464/
https://www.ncbi.nlm.nih.gov/pubmed/35862820
http://dx.doi.org/10.1128/msystems.00258-22
work_keys_str_mv AT tebbenkieran determinationofthestagecompositionofplasmodiuminfectionsfrombulkgeneexpressiondata
AT diaaliou determinationofthestagecompositionofplasmodiuminfectionsfrombulkgeneexpressiondata
AT serredavid determinationofthestagecompositionofplasmodiuminfectionsfrombulkgeneexpressiondata