Cargando…

immuneSIM: tunable multi-feature simulation of B- and T-cell receptor repertoires for immunoinformatics benchmarking

SUMMARY: B- and T-cell receptor repertoires of the adaptive immune system have become a key target for diagnostics and therapeutics research. Consequently, there is a rapidly growing number of bioinformatics tools for immune repertoire analysis. Benchmarking of such tools is crucial for ensuring rep...

Descripción completa

Detalles Bibliográficos
Autores principales: Weber, Cédric R, Akbar, Rahmad, Yermanos, Alexander, Pavlović, Milena, Snapkov, Igor, Sandve, Geir K, Reddy, Sai T, Greiff, Victor
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7334888/
https://www.ncbi.nlm.nih.gov/pubmed/32154832
http://dx.doi.org/10.1093/bioinformatics/btaa158
_version_ 1783554023242072064
author Weber, Cédric R
Akbar, Rahmad
Yermanos, Alexander
Pavlović, Milena
Snapkov, Igor
Sandve, Geir K
Reddy, Sai T
Greiff, Victor
author_facet Weber, Cédric R
Akbar, Rahmad
Yermanos, Alexander
Pavlović, Milena
Snapkov, Igor
Sandve, Geir K
Reddy, Sai T
Greiff, Victor
author_sort Weber, Cédric R
collection PubMed
description SUMMARY: B- and T-cell receptor repertoires of the adaptive immune system have become a key target for diagnostics and therapeutics research. Consequently, there is a rapidly growing number of bioinformatics tools for immune repertoire analysis. Benchmarking of such tools is crucial for ensuring reproducible and generalizable computational analyses. Currently, however, it remains challenging to create standardized ground truth immune receptor repertoires for immunoinformatics tool benchmarking. Therefore, we developed immuneSIM, an R package that allows the simulation of native-like and aberrant synthetic full-length variable region immune receptor sequences by tuning the following immune receptor features: (i) species and chain type (BCR, TCR, single and paired), (ii) germline gene usage, (iii) occurrence of insertions and deletions, (iv) clonal abundance, (v) somatic hypermutation and (vi) sequence motifs. Each simulated sequence is annotated by the complete set of simulation events that contributed to its in silico generation. immuneSIM permits the benchmarking of key computational tools for immune receptor analysis, such as germline gene annotation, diversity and overlap estimation, sequence similarity, network architecture, clustering analysis and machine learning methods for motif detection. AVAILABILITY AND IMPLEMENTATION: The package is available via https://github.com/GreiffLab/immuneSIM and on CRAN at https://cran.r-project.org/web/packages/immuneSIM. The documentation is hosted at https://immuneSIM.readthedocs.io. CONTACT: sai.reddy@ethz.ch or victor.greiff@medisin.uio.no SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
format Online
Article
Text
id pubmed-7334888
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-73348882020-07-13 immuneSIM: tunable multi-feature simulation of B- and T-cell receptor repertoires for immunoinformatics benchmarking Weber, Cédric R Akbar, Rahmad Yermanos, Alexander Pavlović, Milena Snapkov, Igor Sandve, Geir K Reddy, Sai T Greiff, Victor Bioinformatics Applications Notes SUMMARY: B- and T-cell receptor repertoires of the adaptive immune system have become a key target for diagnostics and therapeutics research. Consequently, there is a rapidly growing number of bioinformatics tools for immune repertoire analysis. Benchmarking of such tools is crucial for ensuring reproducible and generalizable computational analyses. Currently, however, it remains challenging to create standardized ground truth immune receptor repertoires for immunoinformatics tool benchmarking. Therefore, we developed immuneSIM, an R package that allows the simulation of native-like and aberrant synthetic full-length variable region immune receptor sequences by tuning the following immune receptor features: (i) species and chain type (BCR, TCR, single and paired), (ii) germline gene usage, (iii) occurrence of insertions and deletions, (iv) clonal abundance, (v) somatic hypermutation and (vi) sequence motifs. Each simulated sequence is annotated by the complete set of simulation events that contributed to its in silico generation. immuneSIM permits the benchmarking of key computational tools for immune receptor analysis, such as germline gene annotation, diversity and overlap estimation, sequence similarity, network architecture, clustering analysis and machine learning methods for motif detection. AVAILABILITY AND IMPLEMENTATION: The package is available via https://github.com/GreiffLab/immuneSIM and on CRAN at https://cran.r-project.org/web/packages/immuneSIM. The documentation is hosted at https://immuneSIM.readthedocs.io. CONTACT: sai.reddy@ethz.ch or victor.greiff@medisin.uio.no SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online. Oxford University Press 2020-06 2020-04-14 /pmc/articles/PMC7334888/ /pubmed/32154832 http://dx.doi.org/10.1093/bioinformatics/btaa158 Text en © The Author(s) 2020. Published by Oxford University Press. http://creativecommons.org/licenses/by/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Applications Notes
Weber, Cédric R
Akbar, Rahmad
Yermanos, Alexander
Pavlović, Milena
Snapkov, Igor
Sandve, Geir K
Reddy, Sai T
Greiff, Victor
immuneSIM: tunable multi-feature simulation of B- and T-cell receptor repertoires for immunoinformatics benchmarking
title immuneSIM: tunable multi-feature simulation of B- and T-cell receptor repertoires for immunoinformatics benchmarking
title_full immuneSIM: tunable multi-feature simulation of B- and T-cell receptor repertoires for immunoinformatics benchmarking
title_fullStr immuneSIM: tunable multi-feature simulation of B- and T-cell receptor repertoires for immunoinformatics benchmarking
title_full_unstemmed immuneSIM: tunable multi-feature simulation of B- and T-cell receptor repertoires for immunoinformatics benchmarking
title_short immuneSIM: tunable multi-feature simulation of B- and T-cell receptor repertoires for immunoinformatics benchmarking
title_sort immunesim: tunable multi-feature simulation of b- and t-cell receptor repertoires for immunoinformatics benchmarking
topic Applications Notes
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7334888/
https://www.ncbi.nlm.nih.gov/pubmed/32154832
http://dx.doi.org/10.1093/bioinformatics/btaa158
work_keys_str_mv AT webercedricr immunesimtunablemultifeaturesimulationofbandtcellreceptorrepertoiresforimmunoinformaticsbenchmarking
AT akbarrahmad immunesimtunablemultifeaturesimulationofbandtcellreceptorrepertoiresforimmunoinformaticsbenchmarking
AT yermanosalexander immunesimtunablemultifeaturesimulationofbandtcellreceptorrepertoiresforimmunoinformaticsbenchmarking
AT pavlovicmilena immunesimtunablemultifeaturesimulationofbandtcellreceptorrepertoiresforimmunoinformaticsbenchmarking
AT snapkovigor immunesimtunablemultifeaturesimulationofbandtcellreceptorrepertoiresforimmunoinformaticsbenchmarking
AT sandvegeirk immunesimtunablemultifeaturesimulationofbandtcellreceptorrepertoiresforimmunoinformaticsbenchmarking
AT reddysait immunesimtunablemultifeaturesimulationofbandtcellreceptorrepertoiresforimmunoinformaticsbenchmarking
AT greiffvictor immunesimtunablemultifeaturesimulationofbandtcellreceptorrepertoiresforimmunoinformaticsbenchmarking