Cargando…

RepSeq Data Representativeness and Robustness Assessment by Shannon Entropy

High-throughput sequencing (HTS) has the potential to decipher the diversity of T cell repertoires and their dynamics during immune responses. Applied to T cell subsets such as T effector and T regulatory cells, it should help identify novel biomarkers of diseases. However, given the extreme diversi...

Descripción completa

Detalles Bibliográficos
Autores principales: Chaara, Wahiba, Gonzalez-Tort, Ariadna, Florez, Laura-Maria, Klatzmann, David, Mariotti-Ferrandiz, Encarnita, Six, Adrien
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Frontiers Media S.A. 2018
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5962720/
https://www.ncbi.nlm.nih.gov/pubmed/29868003
http://dx.doi.org/10.3389/fimmu.2018.01038
_version_ 1783324926269194240
author Chaara, Wahiba
Gonzalez-Tort, Ariadna
Florez, Laura-Maria
Klatzmann, David
Mariotti-Ferrandiz, Encarnita
Six, Adrien
author_facet Chaara, Wahiba
Gonzalez-Tort, Ariadna
Florez, Laura-Maria
Klatzmann, David
Mariotti-Ferrandiz, Encarnita
Six, Adrien
author_sort Chaara, Wahiba
collection PubMed
description High-throughput sequencing (HTS) has the potential to decipher the diversity of T cell repertoires and their dynamics during immune responses. Applied to T cell subsets such as T effector and T regulatory cells, it should help identify novel biomarkers of diseases. However, given the extreme diversity of TCR repertoires, understanding how the sequencing conditions, including cell numbers, biological and technical sampling and sequencing depth, impact the experimental outcome is critical to proper use of these data. Here, we assessed the representativeness and robustness of TCR repertoire diversity assessment according to experimental conditions. By comparative analyses of experimental datasets and computer simulations, we found that (i) for small samples, the number of clonotypes recovered is often higher than the number of cells per sample, even after removing the singletons; (ii) high-sequencing depth for small samples alters the clonotype distributions, which can be corrected by filtering the datasets using Shannon entropy as a threshold; and (iii) a single sequencing run at high depth does not ensure a good coverage of the clonotype richness in highly polyclonal populations, which can be better covered using multiple sequencing. Altogether, our results warrant better understanding and awareness of the limitation of TCR diversity analyses by HTS and justify the development of novel computational tools for improved modeling of the highly complex nature of TCR repertoires.
format Online
Article
Text
id pubmed-5962720
institution National Center for Biotechnology Information
language English
publishDate 2018
publisher Frontiers Media S.A.
record_format MEDLINE/PubMed
spelling pubmed-59627202018-06-04 RepSeq Data Representativeness and Robustness Assessment by Shannon Entropy Chaara, Wahiba Gonzalez-Tort, Ariadna Florez, Laura-Maria Klatzmann, David Mariotti-Ferrandiz, Encarnita Six, Adrien Front Immunol Immunology High-throughput sequencing (HTS) has the potential to decipher the diversity of T cell repertoires and their dynamics during immune responses. Applied to T cell subsets such as T effector and T regulatory cells, it should help identify novel biomarkers of diseases. However, given the extreme diversity of TCR repertoires, understanding how the sequencing conditions, including cell numbers, biological and technical sampling and sequencing depth, impact the experimental outcome is critical to proper use of these data. Here, we assessed the representativeness and robustness of TCR repertoire diversity assessment according to experimental conditions. By comparative analyses of experimental datasets and computer simulations, we found that (i) for small samples, the number of clonotypes recovered is often higher than the number of cells per sample, even after removing the singletons; (ii) high-sequencing depth for small samples alters the clonotype distributions, which can be corrected by filtering the datasets using Shannon entropy as a threshold; and (iii) a single sequencing run at high depth does not ensure a good coverage of the clonotype richness in highly polyclonal populations, which can be better covered using multiple sequencing. Altogether, our results warrant better understanding and awareness of the limitation of TCR diversity analyses by HTS and justify the development of novel computational tools for improved modeling of the highly complex nature of TCR repertoires. Frontiers Media S.A. 2018-05-15 /pmc/articles/PMC5962720/ /pubmed/29868003 http://dx.doi.org/10.3389/fimmu.2018.01038 Text en Copyright © 2018 Chaara, Gonzalez-Tort, Florez, Klatzmann, Mariotti-Ferrandiz and Six. https://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle Immunology
Chaara, Wahiba
Gonzalez-Tort, Ariadna
Florez, Laura-Maria
Klatzmann, David
Mariotti-Ferrandiz, Encarnita
Six, Adrien
RepSeq Data Representativeness and Robustness Assessment by Shannon Entropy
title RepSeq Data Representativeness and Robustness Assessment by Shannon Entropy
title_full RepSeq Data Representativeness and Robustness Assessment by Shannon Entropy
title_fullStr RepSeq Data Representativeness and Robustness Assessment by Shannon Entropy
title_full_unstemmed RepSeq Data Representativeness and Robustness Assessment by Shannon Entropy
title_short RepSeq Data Representativeness and Robustness Assessment by Shannon Entropy
title_sort repseq data representativeness and robustness assessment by shannon entropy
topic Immunology
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5962720/
https://www.ncbi.nlm.nih.gov/pubmed/29868003
http://dx.doi.org/10.3389/fimmu.2018.01038
work_keys_str_mv AT chaarawahiba repseqdatarepresentativenessandrobustnessassessmentbyshannonentropy
AT gonzaleztortariadna repseqdatarepresentativenessandrobustnessassessmentbyshannonentropy
AT florezlauramaria repseqdatarepresentativenessandrobustnessassessmentbyshannonentropy
AT klatzmanndavid repseqdatarepresentativenessandrobustnessassessmentbyshannonentropy
AT mariottiferrandizencarnita repseqdatarepresentativenessandrobustnessassessmentbyshannonentropy
AT sixadrien repseqdatarepresentativenessandrobustnessassessmentbyshannonentropy