Cargando…
IMPre: An Accurate and Efficient Software for Prediction of T- and B-Cell Receptor Germline Genes and Alleles from Rearranged Repertoire Data
Large-scale study of the properties of T-cell receptor (TCR) and B-cell receptor (BCR) repertoires through next-generation sequencing is providing excellent insights into the understanding of adaptive immune responses. Variable(Diversity)Joining [V(D)J] germline genes and alleles must be characteriz...
Autores principales: | , , , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Frontiers Media S.A.
2016
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5095119/ https://www.ncbi.nlm.nih.gov/pubmed/27867380 http://dx.doi.org/10.3389/fimmu.2016.00457 |
_version_ | 1782465245660315648 |
---|---|
author | Zhang, Wei Wang, I-Ming Wang, Changxi Lin, Liya Chai, Xianghua Wu, Jinghua Bett, Andrew J. Dhanasekaran, Govindarajan Casimiro, Danilo R. Liu, Xiao |
author_facet | Zhang, Wei Wang, I-Ming Wang, Changxi Lin, Liya Chai, Xianghua Wu, Jinghua Bett, Andrew J. Dhanasekaran, Govindarajan Casimiro, Danilo R. Liu, Xiao |
author_sort | Zhang, Wei |
collection | PubMed |
description | Large-scale study of the properties of T-cell receptor (TCR) and B-cell receptor (BCR) repertoires through next-generation sequencing is providing excellent insights into the understanding of adaptive immune responses. Variable(Diversity)Joining [V(D)J] germline genes and alleles must be characterized in detail to facilitate repertoire analyses. However, most species do not have well-characterized TCR/BCR germline genes because of their high homology. Also, more germline alleles are required for humans and other species, which limits the capacity for studying immune repertoires. Herein, we developed “Immune Germline Prediction” (IMPre), a tool for predicting germline V/J genes and alleles using deep-sequencing data derived from TCR/BCR repertoires. We developed a new algorithm, “Seed_Clust,” for clustering, produced a multiway tree for assembly and optimized the sequence according to the characteristics of rearrangement. We trained IMPre on human samples of T-cell receptor beta (TRB) and immunoglobulin heavy chain and then tested it on additional human samples. Accuracy of 97.7, 100, 92.9, and 100% was obtained for TRBV, TRBJ, IGHV, and IGHJ, respectively. Analyses of subsampling performance for these samples showed IMPre to be robust using different data quantities. Subsequently, IMPre was tested on samples from rhesus monkeys and human long sequences: the highly accurate results demonstrated IMPre to be stable with animal and multiple data types. With rapid accumulation of high-throughput sequence data for TCR and BCR repertoires, IMPre can be applied broadly for obtaining novel genes and a large number of novel alleles. IMPre is available at https://github.com/zhangwei2015/IMPre. |
format | Online Article Text |
id | pubmed-5095119 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2016 |
publisher | Frontiers Media S.A. |
record_format | MEDLINE/PubMed |
spelling | pubmed-50951192016-11-18 IMPre: An Accurate and Efficient Software for Prediction of T- and B-Cell Receptor Germline Genes and Alleles from Rearranged Repertoire Data Zhang, Wei Wang, I-Ming Wang, Changxi Lin, Liya Chai, Xianghua Wu, Jinghua Bett, Andrew J. Dhanasekaran, Govindarajan Casimiro, Danilo R. Liu, Xiao Front Immunol Immunology Large-scale study of the properties of T-cell receptor (TCR) and B-cell receptor (BCR) repertoires through next-generation sequencing is providing excellent insights into the understanding of adaptive immune responses. Variable(Diversity)Joining [V(D)J] germline genes and alleles must be characterized in detail to facilitate repertoire analyses. However, most species do not have well-characterized TCR/BCR germline genes because of their high homology. Also, more germline alleles are required for humans and other species, which limits the capacity for studying immune repertoires. Herein, we developed “Immune Germline Prediction” (IMPre), a tool for predicting germline V/J genes and alleles using deep-sequencing data derived from TCR/BCR repertoires. We developed a new algorithm, “Seed_Clust,” for clustering, produced a multiway tree for assembly and optimized the sequence according to the characteristics of rearrangement. We trained IMPre on human samples of T-cell receptor beta (TRB) and immunoglobulin heavy chain and then tested it on additional human samples. Accuracy of 97.7, 100, 92.9, and 100% was obtained for TRBV, TRBJ, IGHV, and IGHJ, respectively. Analyses of subsampling performance for these samples showed IMPre to be robust using different data quantities. Subsequently, IMPre was tested on samples from rhesus monkeys and human long sequences: the highly accurate results demonstrated IMPre to be stable with animal and multiple data types. With rapid accumulation of high-throughput sequence data for TCR and BCR repertoires, IMPre can be applied broadly for obtaining novel genes and a large number of novel alleles. IMPre is available at https://github.com/zhangwei2015/IMPre. Frontiers Media S.A. 2016-11-04 /pmc/articles/PMC5095119/ /pubmed/27867380 http://dx.doi.org/10.3389/fimmu.2016.00457 Text en Copyright © 2016 Zhang, Wang, Wang, Lin, Chai, Wu, Bett, Dhanasekaran, Casimiro and Liu. http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) or licensor are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms. |
spellingShingle | Immunology Zhang, Wei Wang, I-Ming Wang, Changxi Lin, Liya Chai, Xianghua Wu, Jinghua Bett, Andrew J. Dhanasekaran, Govindarajan Casimiro, Danilo R. Liu, Xiao IMPre: An Accurate and Efficient Software for Prediction of T- and B-Cell Receptor Germline Genes and Alleles from Rearranged Repertoire Data |
title | IMPre: An Accurate and Efficient Software for Prediction of T- and B-Cell Receptor Germline Genes and Alleles from Rearranged Repertoire Data |
title_full | IMPre: An Accurate and Efficient Software for Prediction of T- and B-Cell Receptor Germline Genes and Alleles from Rearranged Repertoire Data |
title_fullStr | IMPre: An Accurate and Efficient Software for Prediction of T- and B-Cell Receptor Germline Genes and Alleles from Rearranged Repertoire Data |
title_full_unstemmed | IMPre: An Accurate and Efficient Software for Prediction of T- and B-Cell Receptor Germline Genes and Alleles from Rearranged Repertoire Data |
title_short | IMPre: An Accurate and Efficient Software for Prediction of T- and B-Cell Receptor Germline Genes and Alleles from Rearranged Repertoire Data |
title_sort | impre: an accurate and efficient software for prediction of t- and b-cell receptor germline genes and alleles from rearranged repertoire data |
topic | Immunology |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5095119/ https://www.ncbi.nlm.nih.gov/pubmed/27867380 http://dx.doi.org/10.3389/fimmu.2016.00457 |
work_keys_str_mv | AT zhangwei impreanaccurateandefficientsoftwareforpredictionoftandbcellreceptorgermlinegenesandallelesfromrearrangedrepertoiredata AT wangiming impreanaccurateandefficientsoftwareforpredictionoftandbcellreceptorgermlinegenesandallelesfromrearrangedrepertoiredata AT wangchangxi impreanaccurateandefficientsoftwareforpredictionoftandbcellreceptorgermlinegenesandallelesfromrearrangedrepertoiredata AT linliya impreanaccurateandefficientsoftwareforpredictionoftandbcellreceptorgermlinegenesandallelesfromrearrangedrepertoiredata AT chaixianghua impreanaccurateandefficientsoftwareforpredictionoftandbcellreceptorgermlinegenesandallelesfromrearrangedrepertoiredata AT wujinghua impreanaccurateandefficientsoftwareforpredictionoftandbcellreceptorgermlinegenesandallelesfromrearrangedrepertoiredata AT bettandrewj impreanaccurateandefficientsoftwareforpredictionoftandbcellreceptorgermlinegenesandallelesfromrearrangedrepertoiredata AT dhanasekarangovindarajan impreanaccurateandefficientsoftwareforpredictionoftandbcellreceptorgermlinegenesandallelesfromrearrangedrepertoiredata AT casimirodanilor impreanaccurateandefficientsoftwareforpredictionoftandbcellreceptorgermlinegenesandallelesfromrearrangedrepertoiredata AT liuxiao impreanaccurateandefficientsoftwareforpredictionoftandbcellreceptorgermlinegenesandallelesfromrearrangedrepertoiredata |