Cargando…

IMPre: An Accurate and Efficient Software for Prediction of T- and B-Cell Receptor Germline Genes and Alleles from Rearranged Repertoire Data

Large-scale study of the properties of T-cell receptor (TCR) and B-cell receptor (BCR) repertoires through next-generation sequencing is providing excellent insights into the understanding of adaptive immune responses. Variable(Diversity)Joining [V(D)J] germline genes and alleles must be characteriz...

Descripción completa

Detalles Bibliográficos
Autores principales: Zhang, Wei, Wang, I-Ming, Wang, Changxi, Lin, Liya, Chai, Xianghua, Wu, Jinghua, Bett, Andrew J., Dhanasekaran, Govindarajan, Casimiro, Danilo R., Liu, Xiao
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Frontiers Media S.A. 2016
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5095119/
https://www.ncbi.nlm.nih.gov/pubmed/27867380
http://dx.doi.org/10.3389/fimmu.2016.00457
_version_ 1782465245660315648
author Zhang, Wei
Wang, I-Ming
Wang, Changxi
Lin, Liya
Chai, Xianghua
Wu, Jinghua
Bett, Andrew J.
Dhanasekaran, Govindarajan
Casimiro, Danilo R.
Liu, Xiao
author_facet Zhang, Wei
Wang, I-Ming
Wang, Changxi
Lin, Liya
Chai, Xianghua
Wu, Jinghua
Bett, Andrew J.
Dhanasekaran, Govindarajan
Casimiro, Danilo R.
Liu, Xiao
author_sort Zhang, Wei
collection PubMed
description Large-scale study of the properties of T-cell receptor (TCR) and B-cell receptor (BCR) repertoires through next-generation sequencing is providing excellent insights into the understanding of adaptive immune responses. Variable(Diversity)Joining [V(D)J] germline genes and alleles must be characterized in detail to facilitate repertoire analyses. However, most species do not have well-characterized TCR/BCR germline genes because of their high homology. Also, more germline alleles are required for humans and other species, which limits the capacity for studying immune repertoires. Herein, we developed “Immune Germline Prediction” (IMPre), a tool for predicting germline V/J genes and alleles using deep-sequencing data derived from TCR/BCR repertoires. We developed a new algorithm, “Seed_Clust,” for clustering, produced a multiway tree for assembly and optimized the sequence according to the characteristics of rearrangement. We trained IMPre on human samples of T-cell receptor beta (TRB) and immunoglobulin heavy chain and then tested it on additional human samples. Accuracy of 97.7, 100, 92.9, and 100% was obtained for TRBV, TRBJ, IGHV, and IGHJ, respectively. Analyses of subsampling performance for these samples showed IMPre to be robust using different data quantities. Subsequently, IMPre was tested on samples from rhesus monkeys and human long sequences: the highly accurate results demonstrated IMPre to be stable with animal and multiple data types. With rapid accumulation of high-throughput sequence data for TCR and BCR repertoires, IMPre can be applied broadly for obtaining novel genes and a large number of novel alleles. IMPre is available at https://github.com/zhangwei2015/IMPre.
format Online
Article
Text
id pubmed-5095119
institution National Center for Biotechnology Information
language English
publishDate 2016
publisher Frontiers Media S.A.
record_format MEDLINE/PubMed
spelling pubmed-50951192016-11-18 IMPre: An Accurate and Efficient Software for Prediction of T- and B-Cell Receptor Germline Genes and Alleles from Rearranged Repertoire Data Zhang, Wei Wang, I-Ming Wang, Changxi Lin, Liya Chai, Xianghua Wu, Jinghua Bett, Andrew J. Dhanasekaran, Govindarajan Casimiro, Danilo R. Liu, Xiao Front Immunol Immunology Large-scale study of the properties of T-cell receptor (TCR) and B-cell receptor (BCR) repertoires through next-generation sequencing is providing excellent insights into the understanding of adaptive immune responses. Variable(Diversity)Joining [V(D)J] germline genes and alleles must be characterized in detail to facilitate repertoire analyses. However, most species do not have well-characterized TCR/BCR germline genes because of their high homology. Also, more germline alleles are required for humans and other species, which limits the capacity for studying immune repertoires. Herein, we developed “Immune Germline Prediction” (IMPre), a tool for predicting germline V/J genes and alleles using deep-sequencing data derived from TCR/BCR repertoires. We developed a new algorithm, “Seed_Clust,” for clustering, produced a multiway tree for assembly and optimized the sequence according to the characteristics of rearrangement. We trained IMPre on human samples of T-cell receptor beta (TRB) and immunoglobulin heavy chain and then tested it on additional human samples. Accuracy of 97.7, 100, 92.9, and 100% was obtained for TRBV, TRBJ, IGHV, and IGHJ, respectively. Analyses of subsampling performance for these samples showed IMPre to be robust using different data quantities. Subsequently, IMPre was tested on samples from rhesus monkeys and human long sequences: the highly accurate results demonstrated IMPre to be stable with animal and multiple data types. With rapid accumulation of high-throughput sequence data for TCR and BCR repertoires, IMPre can be applied broadly for obtaining novel genes and a large number of novel alleles. IMPre is available at https://github.com/zhangwei2015/IMPre. Frontiers Media S.A. 2016-11-04 /pmc/articles/PMC5095119/ /pubmed/27867380 http://dx.doi.org/10.3389/fimmu.2016.00457 Text en Copyright © 2016 Zhang, Wang, Wang, Lin, Chai, Wu, Bett, Dhanasekaran, Casimiro and Liu. http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) or licensor are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle Immunology
Zhang, Wei
Wang, I-Ming
Wang, Changxi
Lin, Liya
Chai, Xianghua
Wu, Jinghua
Bett, Andrew J.
Dhanasekaran, Govindarajan
Casimiro, Danilo R.
Liu, Xiao
IMPre: An Accurate and Efficient Software for Prediction of T- and B-Cell Receptor Germline Genes and Alleles from Rearranged Repertoire Data
title IMPre: An Accurate and Efficient Software for Prediction of T- and B-Cell Receptor Germline Genes and Alleles from Rearranged Repertoire Data
title_full IMPre: An Accurate and Efficient Software for Prediction of T- and B-Cell Receptor Germline Genes and Alleles from Rearranged Repertoire Data
title_fullStr IMPre: An Accurate and Efficient Software for Prediction of T- and B-Cell Receptor Germline Genes and Alleles from Rearranged Repertoire Data
title_full_unstemmed IMPre: An Accurate and Efficient Software for Prediction of T- and B-Cell Receptor Germline Genes and Alleles from Rearranged Repertoire Data
title_short IMPre: An Accurate and Efficient Software for Prediction of T- and B-Cell Receptor Germline Genes and Alleles from Rearranged Repertoire Data
title_sort impre: an accurate and efficient software for prediction of t- and b-cell receptor germline genes and alleles from rearranged repertoire data
topic Immunology
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5095119/
https://www.ncbi.nlm.nih.gov/pubmed/27867380
http://dx.doi.org/10.3389/fimmu.2016.00457
work_keys_str_mv AT zhangwei impreanaccurateandefficientsoftwareforpredictionoftandbcellreceptorgermlinegenesandallelesfromrearrangedrepertoiredata
AT wangiming impreanaccurateandefficientsoftwareforpredictionoftandbcellreceptorgermlinegenesandallelesfromrearrangedrepertoiredata
AT wangchangxi impreanaccurateandefficientsoftwareforpredictionoftandbcellreceptorgermlinegenesandallelesfromrearrangedrepertoiredata
AT linliya impreanaccurateandefficientsoftwareforpredictionoftandbcellreceptorgermlinegenesandallelesfromrearrangedrepertoiredata
AT chaixianghua impreanaccurateandefficientsoftwareforpredictionoftandbcellreceptorgermlinegenesandallelesfromrearrangedrepertoiredata
AT wujinghua impreanaccurateandefficientsoftwareforpredictionoftandbcellreceptorgermlinegenesandallelesfromrearrangedrepertoiredata
AT bettandrewj impreanaccurateandefficientsoftwareforpredictionoftandbcellreceptorgermlinegenesandallelesfromrearrangedrepertoiredata
AT dhanasekarangovindarajan impreanaccurateandefficientsoftwareforpredictionoftandbcellreceptorgermlinegenesandallelesfromrearrangedrepertoiredata
AT casimirodanilor impreanaccurateandefficientsoftwareforpredictionoftandbcellreceptorgermlinegenesandallelesfromrearrangedrepertoiredata
AT liuxiao impreanaccurateandefficientsoftwareforpredictionoftandbcellreceptorgermlinegenesandallelesfromrearrangedrepertoiredata