Cargando…

Practical guidelines for B-cell receptor repertoire sequencing analysis

High-throughput sequencing of B-cell immunoglobulin repertoires is increasingly being applied to gain insights into the adaptive immune response in healthy individuals and in those with a wide range of diseases. Recent applications include the study of autoimmunity, infection, allergy, cancer and ag...

Descripción completa

Detalles Bibliográficos
Autores principales: Yaari, Gur, Kleinstein, Steven H.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2015
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4654805/
https://www.ncbi.nlm.nih.gov/pubmed/26589402
http://dx.doi.org/10.1186/s13073-015-0243-2
_version_ 1782402102985752576
author Yaari, Gur
Kleinstein, Steven H.
author_facet Yaari, Gur
Kleinstein, Steven H.
author_sort Yaari, Gur
collection PubMed
description High-throughput sequencing of B-cell immunoglobulin repertoires is increasingly being applied to gain insights into the adaptive immune response in healthy individuals and in those with a wide range of diseases. Recent applications include the study of autoimmunity, infection, allergy, cancer and aging. As sequencing technologies continue to improve, these repertoire sequencing experiments are producing ever larger datasets, with tens- to hundreds-of-millions of sequences. These data require specialized bioinformatics pipelines to be analyzed effectively. Numerous methods and tools have been developed to handle different steps of the analysis, and integrated software suites have recently been made available. However, the field has yet to converge on a standard pipeline for data processing and analysis. Common file formats for data sharing are also lacking. Here we provide a set of practical guidelines for B-cell receptor repertoire sequencing analysis, starting from raw sequencing reads and proceeding through pre-processing, determination of population structure, and analysis of repertoire properties. These include methods for unique molecular identifiers and sequencing error correction, V(D)J assignment and detection of novel alleles, clonal assignment, lineage tree construction, somatic hypermutation modeling, selection analysis, and analysis of stereotyped or convergent responses. The guidelines presented here highlight the major steps involved in the analysis of B-cell repertoire sequencing data, along with recommendations on how to avoid common pitfalls.
format Online
Article
Text
id pubmed-4654805
institution National Center for Biotechnology Information
language English
publishDate 2015
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-46548052015-11-22 Practical guidelines for B-cell receptor repertoire sequencing analysis Yaari, Gur Kleinstein, Steven H. Genome Med Review High-throughput sequencing of B-cell immunoglobulin repertoires is increasingly being applied to gain insights into the adaptive immune response in healthy individuals and in those with a wide range of diseases. Recent applications include the study of autoimmunity, infection, allergy, cancer and aging. As sequencing technologies continue to improve, these repertoire sequencing experiments are producing ever larger datasets, with tens- to hundreds-of-millions of sequences. These data require specialized bioinformatics pipelines to be analyzed effectively. Numerous methods and tools have been developed to handle different steps of the analysis, and integrated software suites have recently been made available. However, the field has yet to converge on a standard pipeline for data processing and analysis. Common file formats for data sharing are also lacking. Here we provide a set of practical guidelines for B-cell receptor repertoire sequencing analysis, starting from raw sequencing reads and proceeding through pre-processing, determination of population structure, and analysis of repertoire properties. These include methods for unique molecular identifiers and sequencing error correction, V(D)J assignment and detection of novel alleles, clonal assignment, lineage tree construction, somatic hypermutation modeling, selection analysis, and analysis of stereotyped or convergent responses. The guidelines presented here highlight the major steps involved in the analysis of B-cell repertoire sequencing data, along with recommendations on how to avoid common pitfalls. BioMed Central 2015-11-20 /pmc/articles/PMC4654805/ /pubmed/26589402 http://dx.doi.org/10.1186/s13073-015-0243-2 Text en © Yaari and Kleinstein. 2015 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Review
Yaari, Gur
Kleinstein, Steven H.
Practical guidelines for B-cell receptor repertoire sequencing analysis
title Practical guidelines for B-cell receptor repertoire sequencing analysis
title_full Practical guidelines for B-cell receptor repertoire sequencing analysis
title_fullStr Practical guidelines for B-cell receptor repertoire sequencing analysis
title_full_unstemmed Practical guidelines for B-cell receptor repertoire sequencing analysis
title_short Practical guidelines for B-cell receptor repertoire sequencing analysis
title_sort practical guidelines for b-cell receptor repertoire sequencing analysis
topic Review
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4654805/
https://www.ncbi.nlm.nih.gov/pubmed/26589402
http://dx.doi.org/10.1186/s13073-015-0243-2
work_keys_str_mv AT yaarigur practicalguidelinesforbcellreceptorrepertoiresequencinganalysis
AT kleinsteinstevenh practicalguidelinesforbcellreceptorrepertoiresequencinganalysis