Cargando…

High-throughput sequencing of the T-cell receptor repertoire: pitfalls and opportunities

T-cell specificity is determined by the T-cell receptor, a heterodimeric protein coded for by an extremely diverse set of genes produced by imprecise somatic gene recombination. Massively parallel high-throughput sequencing allows millions of different T-cell receptor genes to be characterized from...

Descripción completa

Detalles Bibliográficos
Autores principales: Heather, James M, Ismail, Mazlina, Oakes, Theres, Chain, Benny
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2017
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6054146/
https://www.ncbi.nlm.nih.gov/pubmed/28077404
http://dx.doi.org/10.1093/bib/bbw138
_version_ 1783340960884719616
author Heather, James M
Ismail, Mazlina
Oakes, Theres
Chain, Benny
author_facet Heather, James M
Ismail, Mazlina
Oakes, Theres
Chain, Benny
author_sort Heather, James M
collection PubMed
description T-cell specificity is determined by the T-cell receptor, a heterodimeric protein coded for by an extremely diverse set of genes produced by imprecise somatic gene recombination. Massively parallel high-throughput sequencing allows millions of different T-cell receptor genes to be characterized from a single sample of blood or tissue. However, the extraordinary heterogeneity of the immune repertoire poses significant challenges for subsequent analysis of the data. We outline the major steps in processing of repertoire data, considering low-level processing of raw sequence files and high-level algorithms, which seek to extract biological or pathological information. The latest generation of bioinformatics tools allows millions of DNA sequences to be accurately and rapidly assigned to their respective variable V and J gene segments, and to reconstruct an almost error-free representation of the non-templated additions and deletions that occur. High-level processing can measure the diversity of the repertoire in different samples, quantify V and J usage and identify private and public T-cell receptors. Finally, we discuss the major challenge of linking T-cell receptor sequence to function, and specifically to antigen recognition. Sophisticated machine learning algorithms are being developed that can combine the paradoxical degeneracy and cross-reactivity of individual T-cell receptors with the specificity of the overall T-cell immune response. Computational analysis will provide the key to unlock the potential of the T-cell receptor repertoire to give insight into the fundamental biology of the adaptive immune system and to provide powerful biomarkers of disease.
format Online
Article
Text
id pubmed-6054146
institution National Center for Biotechnology Information
language English
publishDate 2017
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-60541462018-07-25 High-throughput sequencing of the T-cell receptor repertoire: pitfalls and opportunities Heather, James M Ismail, Mazlina Oakes, Theres Chain, Benny Brief Bioinform Paper T-cell specificity is determined by the T-cell receptor, a heterodimeric protein coded for by an extremely diverse set of genes produced by imprecise somatic gene recombination. Massively parallel high-throughput sequencing allows millions of different T-cell receptor genes to be characterized from a single sample of blood or tissue. However, the extraordinary heterogeneity of the immune repertoire poses significant challenges for subsequent analysis of the data. We outline the major steps in processing of repertoire data, considering low-level processing of raw sequence files and high-level algorithms, which seek to extract biological or pathological information. The latest generation of bioinformatics tools allows millions of DNA sequences to be accurately and rapidly assigned to their respective variable V and J gene segments, and to reconstruct an almost error-free representation of the non-templated additions and deletions that occur. High-level processing can measure the diversity of the repertoire in different samples, quantify V and J usage and identify private and public T-cell receptors. Finally, we discuss the major challenge of linking T-cell receptor sequence to function, and specifically to antigen recognition. Sophisticated machine learning algorithms are being developed that can combine the paradoxical degeneracy and cross-reactivity of individual T-cell receptors with the specificity of the overall T-cell immune response. Computational analysis will provide the key to unlock the potential of the T-cell receptor repertoire to give insight into the fundamental biology of the adaptive immune system and to provide powerful biomarkers of disease. Oxford University Press 2017-01-10 /pmc/articles/PMC6054146/ /pubmed/28077404 http://dx.doi.org/10.1093/bib/bbw138 Text en © The Author 2017. Published by Oxford University Press. http://creativecommons.org/licenses/by/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Paper
Heather, James M
Ismail, Mazlina
Oakes, Theres
Chain, Benny
High-throughput sequencing of the T-cell receptor repertoire: pitfalls and opportunities
title High-throughput sequencing of the T-cell receptor repertoire: pitfalls and opportunities
title_full High-throughput sequencing of the T-cell receptor repertoire: pitfalls and opportunities
title_fullStr High-throughput sequencing of the T-cell receptor repertoire: pitfalls and opportunities
title_full_unstemmed High-throughput sequencing of the T-cell receptor repertoire: pitfalls and opportunities
title_short High-throughput sequencing of the T-cell receptor repertoire: pitfalls and opportunities
title_sort high-throughput sequencing of the t-cell receptor repertoire: pitfalls and opportunities
topic Paper
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6054146/
https://www.ncbi.nlm.nih.gov/pubmed/28077404
http://dx.doi.org/10.1093/bib/bbw138
work_keys_str_mv AT heatherjamesm highthroughputsequencingofthetcellreceptorrepertoirepitfallsandopportunities
AT ismailmazlina highthroughputsequencingofthetcellreceptorrepertoirepitfallsandopportunities
AT oakestheres highthroughputsequencingofthetcellreceptorrepertoirepitfallsandopportunities
AT chainbenny highthroughputsequencingofthetcellreceptorrepertoirepitfallsandopportunities