Cargando…
Dataset of next-generation sequence reads of nanobody clones in phage display library derived from Indian desert camel (Camelus dromedarius L.)
Next-generation sequences (NGS) dataset of nanobody (Nb) clones in a phage display library (PDL) is of immense value as it serves in many different ways, such as: i). estimating the library size, ii). improving selection and identification of Nbs, iii). informing about frequency of V gene families,...
Autores principales: | , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Elsevier
2020
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7770539/ https://www.ncbi.nlm.nih.gov/pubmed/33385028 http://dx.doi.org/10.1016/j.dib.2020.106663 |
_version_ | 1783629530696515584 |
---|---|
author | Banerjee, Somesh Singh, Ajit Rawat, Jagveer Bansal, Nitish Maan, Sushila |
author_facet | Banerjee, Somesh Singh, Ajit Rawat, Jagveer Bansal, Nitish Maan, Sushila |
author_sort | Banerjee, Somesh |
collection | PubMed |
description | Next-generation sequences (NGS) dataset of nanobody (Nb) clones in a phage display library (PDL) is of immense value as it serves in many different ways, such as: i). estimating the library size, ii). improving selection and identification of Nbs, iii). informing about frequency of V gene families, diversity and length of CDRs, iv). high resolution analysis of natural and synthetic libraries, etc. [1], [2], [3]. We used a fraction of our previously constructed PDL of Nbs derived from an E. coli lipopolysaccharide-immunized Indian desert camel in order to obtain the dataset of NGS reads of Nbs. The cryo-preserved transformants library was revived to extract the Nb-encoding VHH (inserts)-pHEN4 (vector) DNA pool. The DNA sample was used for amplifying VHH pool by PCR [6]. The VHH amplicons band was gel-purified and subjected to NGS using Illumina MiSeq(TM) platform. ‘Nextra XT micro V2 Index’ kit was used for the Nb library DNA sample sequencing, with the adaptors: ‘i7’ (N706: TAGGCATG) and ‘i5’ (S517: GCGTAAGA). The raw data comprised of a total read count of 182146 (matched= 179591; unmatched=2555), with average read length of 130.33 bases and a total of 23.74 Mb. Of 179591 matched reads, 142004 were paired reads and 37587 broken paired reads. The raw data of NGS reads was submitted to NCBI Sequence Reads Archive accessible at URL: https://www.ncbi.nlm.nih.gov/Traces/study/?acc=PRJNA516512 (dataset ref. [7]), and after analysis deposited in Mendeley Datasets repository, which is accessible at URL: [https://data.mendeley.com/datasets/4rsz3snvk5/3] (dataset ref. [8]). The sequence reads were analyzed by bioinformatics tools [9], [10], [11], [12]. The assembled consensus contigs revealed Nb orthologs of diverse Ag-specificities, including those isolated by conventional panning and Sanger-sequenced functional Nbs. Contig 1 CDR1-3 matched to those of anti-Trypanosoma evansi RoTat1.2 variant surface glycoprotein (VSG), while Contig 2 CDR1-3 matched to those of anti-LPS Nb clones isolated from the library. Contig 3 was however incomplete and lacked CDR3. Despite lacking the depth, the NGS data is a useful guide for selection of antigen-specific Nbs from the library, as demonstrated by anti-T. evansi VSG Nbs, and provides templates for Nb-based diagnostic reagents and therapeutic agents. |
format | Online Article Text |
id | pubmed-7770539 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2020 |
publisher | Elsevier |
record_format | MEDLINE/PubMed |
spelling | pubmed-77705392020-12-30 Dataset of next-generation sequence reads of nanobody clones in phage display library derived from Indian desert camel (Camelus dromedarius L.) Banerjee, Somesh Singh, Ajit Rawat, Jagveer Bansal, Nitish Maan, Sushila Data Brief Data Article Next-generation sequences (NGS) dataset of nanobody (Nb) clones in a phage display library (PDL) is of immense value as it serves in many different ways, such as: i). estimating the library size, ii). improving selection and identification of Nbs, iii). informing about frequency of V gene families, diversity and length of CDRs, iv). high resolution analysis of natural and synthetic libraries, etc. [1], [2], [3]. We used a fraction of our previously constructed PDL of Nbs derived from an E. coli lipopolysaccharide-immunized Indian desert camel in order to obtain the dataset of NGS reads of Nbs. The cryo-preserved transformants library was revived to extract the Nb-encoding VHH (inserts)-pHEN4 (vector) DNA pool. The DNA sample was used for amplifying VHH pool by PCR [6]. The VHH amplicons band was gel-purified and subjected to NGS using Illumina MiSeq(TM) platform. ‘Nextra XT micro V2 Index’ kit was used for the Nb library DNA sample sequencing, with the adaptors: ‘i7’ (N706: TAGGCATG) and ‘i5’ (S517: GCGTAAGA). The raw data comprised of a total read count of 182146 (matched= 179591; unmatched=2555), with average read length of 130.33 bases and a total of 23.74 Mb. Of 179591 matched reads, 142004 were paired reads and 37587 broken paired reads. The raw data of NGS reads was submitted to NCBI Sequence Reads Archive accessible at URL: https://www.ncbi.nlm.nih.gov/Traces/study/?acc=PRJNA516512 (dataset ref. [7]), and after analysis deposited in Mendeley Datasets repository, which is accessible at URL: [https://data.mendeley.com/datasets/4rsz3snvk5/3] (dataset ref. [8]). The sequence reads were analyzed by bioinformatics tools [9], [10], [11], [12]. The assembled consensus contigs revealed Nb orthologs of diverse Ag-specificities, including those isolated by conventional panning and Sanger-sequenced functional Nbs. Contig 1 CDR1-3 matched to those of anti-Trypanosoma evansi RoTat1.2 variant surface glycoprotein (VSG), while Contig 2 CDR1-3 matched to those of anti-LPS Nb clones isolated from the library. Contig 3 was however incomplete and lacked CDR3. Despite lacking the depth, the NGS data is a useful guide for selection of antigen-specific Nbs from the library, as demonstrated by anti-T. evansi VSG Nbs, and provides templates for Nb-based diagnostic reagents and therapeutic agents. Elsevier 2020-12-16 /pmc/articles/PMC7770539/ /pubmed/33385028 http://dx.doi.org/10.1016/j.dib.2020.106663 Text en © 2020 The Authors http://creativecommons.org/licenses/by/4.0/ This is an open access article under the CC BY license (http://creativecommons.org/licenses/by/4.0/). |
spellingShingle | Data Article Banerjee, Somesh Singh, Ajit Rawat, Jagveer Bansal, Nitish Maan, Sushila Dataset of next-generation sequence reads of nanobody clones in phage display library derived from Indian desert camel (Camelus dromedarius L.) |
title | Dataset of next-generation sequence reads of nanobody clones in phage display library derived from Indian desert camel (Camelus dromedarius L.) |
title_full | Dataset of next-generation sequence reads of nanobody clones in phage display library derived from Indian desert camel (Camelus dromedarius L.) |
title_fullStr | Dataset of next-generation sequence reads of nanobody clones in phage display library derived from Indian desert camel (Camelus dromedarius L.) |
title_full_unstemmed | Dataset of next-generation sequence reads of nanobody clones in phage display library derived from Indian desert camel (Camelus dromedarius L.) |
title_short | Dataset of next-generation sequence reads of nanobody clones in phage display library derived from Indian desert camel (Camelus dromedarius L.) |
title_sort | dataset of next-generation sequence reads of nanobody clones in phage display library derived from indian desert camel (camelus dromedarius l.) |
topic | Data Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7770539/ https://www.ncbi.nlm.nih.gov/pubmed/33385028 http://dx.doi.org/10.1016/j.dib.2020.106663 |
work_keys_str_mv | AT banerjeesomesh datasetofnextgenerationsequencereadsofnanobodyclonesinphagedisplaylibraryderivedfromindiandesertcamelcamelusdromedariusl AT singhajit datasetofnextgenerationsequencereadsofnanobodyclonesinphagedisplaylibraryderivedfromindiandesertcamelcamelusdromedariusl AT rawatjagveer datasetofnextgenerationsequencereadsofnanobodyclonesinphagedisplaylibraryderivedfromindiandesertcamelcamelusdromedariusl AT bansalnitish datasetofnextgenerationsequencereadsofnanobodyclonesinphagedisplaylibraryderivedfromindiandesertcamelcamelusdromedariusl AT maansushila datasetofnextgenerationsequencereadsofnanobodyclonesinphagedisplaylibraryderivedfromindiandesertcamelcamelusdromedariusl |