Cargando…

Echidna: integrated simulations of single-cell immune receptor repertoires and transcriptomes

MOTIVATION: Single-cell sequencing now enables the recovery of full-length immune receptor repertoires [B cell receptor (BCR) and T cell receptor (TCR) repertoires], in addition to gene expression information. The feature-rich datasets produced from such experiments require extensive and diverse com...

Descripción completa

Detalles Bibliográficos
Autores principales: Han, Jiami, Masserey, Solène, Shlesinger, Danielle, Kuhn, Raphael, Papadopoulou, Chrysa, Agrafiotis, Andreas, Kreiner, Victor, Dizerens, Raphael, Hong, Kai-Lin, Weber, Cédric, Greiff, Victor, Oxenius, Annette, Reddy, Sai T, Yermanos, Alexander
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9710610/
https://www.ncbi.nlm.nih.gov/pubmed/36699357
http://dx.doi.org/10.1093/bioadv/vbac062
_version_ 1784841403596013568
author Han, Jiami
Masserey, Solène
Shlesinger, Danielle
Kuhn, Raphael
Papadopoulou, Chrysa
Agrafiotis, Andreas
Kreiner, Victor
Dizerens, Raphael
Hong, Kai-Lin
Weber, Cédric
Greiff, Victor
Oxenius, Annette
Reddy, Sai T
Yermanos, Alexander
author_facet Han, Jiami
Masserey, Solène
Shlesinger, Danielle
Kuhn, Raphael
Papadopoulou, Chrysa
Agrafiotis, Andreas
Kreiner, Victor
Dizerens, Raphael
Hong, Kai-Lin
Weber, Cédric
Greiff, Victor
Oxenius, Annette
Reddy, Sai T
Yermanos, Alexander
author_sort Han, Jiami
collection PubMed
description MOTIVATION: Single-cell sequencing now enables the recovery of full-length immune receptor repertoires [B cell receptor (BCR) and T cell receptor (TCR) repertoires], in addition to gene expression information. The feature-rich datasets produced from such experiments require extensive and diverse computational analyses, each of which can significantly influence the downstream immunological interpretations, such as clonal selection and expansion. Simulations produce validated standard datasets, where the underlying generative model can be precisely defined and furthermore perturbed to investigate specific questions of interest. Currently, there is no tool that can be used to simulate single-cell datasets incorporating immune receptor repertoires and gene expression. RESULTS: We developed Echidna, an R package that simulates immune receptors and transcriptomes at single-cell resolution with user-tunable parameters controlling a wide range of features such as clonal expansion, germline gene usage, somatic hypermutation, transcriptional phenotypes and spatial location. Echidna can additionally simulate time-resolved B cell evolution, producing mutational networks with complex selection histories incorporating class-switching and B cell subtype information. We demonstrated the benchmarking potential of Echidna by simulating clonal lineages and comparing the known simulated networks with those inferred from only the BCR sequences as input. Finally, we simulated immune repertoire information onto existing spatial transcriptomic experiments, thereby generating novel datasets that could be used to develop and integrate methods to profile clonal selection in a spatially resolved manner. Together, Echidna provides a framework that can incorporate experimental data to simulate single-cell immune repertoires to aid software development and bioinformatic benchmarking of clonotyping, phylogenetics, transcriptomics and machine learning strategies. AVAILABILITY AND IMPLEMENTATION: The R package and code used in this manuscript can be found at github.com/alexyermanos/echidna and also in the R package Platypus (Yermanos et al., 2021). Installation instructions and the vignette for Echidna is described in the Platypus Computational Ecosystem (https://alexyermanos.github.io/Platypus/index.html). Publicly available data and corresponding sample accession numbers can be found in Supplementary Tables S2 and S3. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics Advances online.
format Online
Article
Text
id pubmed-9710610
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-97106102023-01-24 Echidna: integrated simulations of single-cell immune receptor repertoires and transcriptomes Han, Jiami Masserey, Solène Shlesinger, Danielle Kuhn, Raphael Papadopoulou, Chrysa Agrafiotis, Andreas Kreiner, Victor Dizerens, Raphael Hong, Kai-Lin Weber, Cédric Greiff, Victor Oxenius, Annette Reddy, Sai T Yermanos, Alexander Bioinform Adv Original Paper MOTIVATION: Single-cell sequencing now enables the recovery of full-length immune receptor repertoires [B cell receptor (BCR) and T cell receptor (TCR) repertoires], in addition to gene expression information. The feature-rich datasets produced from such experiments require extensive and diverse computational analyses, each of which can significantly influence the downstream immunological interpretations, such as clonal selection and expansion. Simulations produce validated standard datasets, where the underlying generative model can be precisely defined and furthermore perturbed to investigate specific questions of interest. Currently, there is no tool that can be used to simulate single-cell datasets incorporating immune receptor repertoires and gene expression. RESULTS: We developed Echidna, an R package that simulates immune receptors and transcriptomes at single-cell resolution with user-tunable parameters controlling a wide range of features such as clonal expansion, germline gene usage, somatic hypermutation, transcriptional phenotypes and spatial location. Echidna can additionally simulate time-resolved B cell evolution, producing mutational networks with complex selection histories incorporating class-switching and B cell subtype information. We demonstrated the benchmarking potential of Echidna by simulating clonal lineages and comparing the known simulated networks with those inferred from only the BCR sequences as input. Finally, we simulated immune repertoire information onto existing spatial transcriptomic experiments, thereby generating novel datasets that could be used to develop and integrate methods to profile clonal selection in a spatially resolved manner. Together, Echidna provides a framework that can incorporate experimental data to simulate single-cell immune repertoires to aid software development and bioinformatic benchmarking of clonotyping, phylogenetics, transcriptomics and machine learning strategies. AVAILABILITY AND IMPLEMENTATION: The R package and code used in this manuscript can be found at github.com/alexyermanos/echidna and also in the R package Platypus (Yermanos et al., 2021). Installation instructions and the vignette for Echidna is described in the Platypus Computational Ecosystem (https://alexyermanos.github.io/Platypus/index.html). Publicly available data and corresponding sample accession numbers can be found in Supplementary Tables S2 and S3. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics Advances online. Oxford University Press 2022-09-02 /pmc/articles/PMC9710610/ /pubmed/36699357 http://dx.doi.org/10.1093/bioadv/vbac062 Text en © The Author(s) 2022. Published by Oxford University Press. https://creativecommons.org/licenses/by/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Original Paper
Han, Jiami
Masserey, Solène
Shlesinger, Danielle
Kuhn, Raphael
Papadopoulou, Chrysa
Agrafiotis, Andreas
Kreiner, Victor
Dizerens, Raphael
Hong, Kai-Lin
Weber, Cédric
Greiff, Victor
Oxenius, Annette
Reddy, Sai T
Yermanos, Alexander
Echidna: integrated simulations of single-cell immune receptor repertoires and transcriptomes
title Echidna: integrated simulations of single-cell immune receptor repertoires and transcriptomes
title_full Echidna: integrated simulations of single-cell immune receptor repertoires and transcriptomes
title_fullStr Echidna: integrated simulations of single-cell immune receptor repertoires and transcriptomes
title_full_unstemmed Echidna: integrated simulations of single-cell immune receptor repertoires and transcriptomes
title_short Echidna: integrated simulations of single-cell immune receptor repertoires and transcriptomes
title_sort echidna: integrated simulations of single-cell immune receptor repertoires and transcriptomes
topic Original Paper
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9710610/
https://www.ncbi.nlm.nih.gov/pubmed/36699357
http://dx.doi.org/10.1093/bioadv/vbac062
work_keys_str_mv AT hanjiami echidnaintegratedsimulationsofsinglecellimmunereceptorrepertoiresandtranscriptomes
AT massereysolene echidnaintegratedsimulationsofsinglecellimmunereceptorrepertoiresandtranscriptomes
AT shlesingerdanielle echidnaintegratedsimulationsofsinglecellimmunereceptorrepertoiresandtranscriptomes
AT kuhnraphael echidnaintegratedsimulationsofsinglecellimmunereceptorrepertoiresandtranscriptomes
AT papadopoulouchrysa echidnaintegratedsimulationsofsinglecellimmunereceptorrepertoiresandtranscriptomes
AT agrafiotisandreas echidnaintegratedsimulationsofsinglecellimmunereceptorrepertoiresandtranscriptomes
AT kreinervictor echidnaintegratedsimulationsofsinglecellimmunereceptorrepertoiresandtranscriptomes
AT dizerensraphael echidnaintegratedsimulationsofsinglecellimmunereceptorrepertoiresandtranscriptomes
AT hongkailin echidnaintegratedsimulationsofsinglecellimmunereceptorrepertoiresandtranscriptomes
AT webercedric echidnaintegratedsimulationsofsinglecellimmunereceptorrepertoiresandtranscriptomes
AT greiffvictor echidnaintegratedsimulationsofsinglecellimmunereceptorrepertoiresandtranscriptomes
AT oxeniusannette echidnaintegratedsimulationsofsinglecellimmunereceptorrepertoiresandtranscriptomes
AT reddysait echidnaintegratedsimulationsofsinglecellimmunereceptorrepertoiresandtranscriptomes
AT yermanosalexander echidnaintegratedsimulationsofsinglecellimmunereceptorrepertoiresandtranscriptomes