Cargando…
A BALB/c IGHV Reference Set, Defined by Haplotype Analysis of Long-Read VDJ-C Sequences From F1 (BALB/c x C57BL/6) Mice
The immunoglobulin genes of inbred mouse strains that are commonly used in models of antibody-mediated human diseases are poorly characterized. This compromises data analysis. To infer the immunoglobulin genes of BALB/c mice, we used long-read SMRT sequencing to amplify VDJ-C sequences from F1 (BALB...
Autores principales: | , , , , , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Frontiers Media S.A.
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9205180/ https://www.ncbi.nlm.nih.gov/pubmed/35720344 http://dx.doi.org/10.3389/fimmu.2022.888555 |
_version_ | 1784729073756405760 |
---|---|
author | Jackson, Katherine J. L. Kos, Justin T. Lees, William Gibson, William S. Smith, Melissa Laird Peres, Ayelet Yaari, Gur Corcoran, Martin Busse, Christian E. Ohlin, Mats Watson, Corey T. Collins, Andrew M. |
author_facet | Jackson, Katherine J. L. Kos, Justin T. Lees, William Gibson, William S. Smith, Melissa Laird Peres, Ayelet Yaari, Gur Corcoran, Martin Busse, Christian E. Ohlin, Mats Watson, Corey T. Collins, Andrew M. |
author_sort | Jackson, Katherine J. L. |
collection | PubMed |
description | The immunoglobulin genes of inbred mouse strains that are commonly used in models of antibody-mediated human diseases are poorly characterized. This compromises data analysis. To infer the immunoglobulin genes of BALB/c mice, we used long-read SMRT sequencing to amplify VDJ-C sequences from F1 (BALB/c x C57BL/6) hybrid animals. Strain variations were identified in the Ighm and Ighg2b genes, and analysis of VDJ rearrangements led to the inference of 278 germline IGHV alleles. 169 alleles are not present in the C57BL/6 genome reference sequence. To establish a set of expressed BALB/c IGHV germline gene sequences, we computationally retrieved IGHV haplotypes from the IgM dataset. Haplotyping led to the confirmation of 162 BALB/c IGHV gene sequences. A musIGHV398 pseudogene variant also appears to be present in the BALB/cByJ substrain, while a functional musIGHV398 gene is highly expressed in the BALB/cJ substrain. Only four of the BALB/c alleles were also observed in the C57BL/6 haplotype. The full set of inferred BALB/c sequences has been used to establish a BALB/c IGHV reference set, hosted at https://ogrdb.airr-community.org. We assessed whether assemblies from the Mouse Genome Project (MGP) are suitable for the determination of the genes of the IGH loci. Only 37 (43.5%) of the 85 confirmed IMGT-named BALB/c IGHV and 33 (42.9%) of the 77 confirmed non-IMGT IGHV were found in a search of the MGP BALB/cJ genome assembly. This suggests that current MGP assemblies are unsuitable for the comprehensive documentation of germline IGHVs and more efforts will be needed to establish strain-specific reference sets. |
format | Online Article Text |
id | pubmed-9205180 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | Frontiers Media S.A. |
record_format | MEDLINE/PubMed |
spelling | pubmed-92051802022-06-18 A BALB/c IGHV Reference Set, Defined by Haplotype Analysis of Long-Read VDJ-C Sequences From F1 (BALB/c x C57BL/6) Mice Jackson, Katherine J. L. Kos, Justin T. Lees, William Gibson, William S. Smith, Melissa Laird Peres, Ayelet Yaari, Gur Corcoran, Martin Busse, Christian E. Ohlin, Mats Watson, Corey T. Collins, Andrew M. Front Immunol Immunology The immunoglobulin genes of inbred mouse strains that are commonly used in models of antibody-mediated human diseases are poorly characterized. This compromises data analysis. To infer the immunoglobulin genes of BALB/c mice, we used long-read SMRT sequencing to amplify VDJ-C sequences from F1 (BALB/c x C57BL/6) hybrid animals. Strain variations were identified in the Ighm and Ighg2b genes, and analysis of VDJ rearrangements led to the inference of 278 germline IGHV alleles. 169 alleles are not present in the C57BL/6 genome reference sequence. To establish a set of expressed BALB/c IGHV germline gene sequences, we computationally retrieved IGHV haplotypes from the IgM dataset. Haplotyping led to the confirmation of 162 BALB/c IGHV gene sequences. A musIGHV398 pseudogene variant also appears to be present in the BALB/cByJ substrain, while a functional musIGHV398 gene is highly expressed in the BALB/cJ substrain. Only four of the BALB/c alleles were also observed in the C57BL/6 haplotype. The full set of inferred BALB/c sequences has been used to establish a BALB/c IGHV reference set, hosted at https://ogrdb.airr-community.org. We assessed whether assemblies from the Mouse Genome Project (MGP) are suitable for the determination of the genes of the IGH loci. Only 37 (43.5%) of the 85 confirmed IMGT-named BALB/c IGHV and 33 (42.9%) of the 77 confirmed non-IMGT IGHV were found in a search of the MGP BALB/cJ genome assembly. This suggests that current MGP assemblies are unsuitable for the comprehensive documentation of germline IGHVs and more efforts will be needed to establish strain-specific reference sets. Frontiers Media S.A. 2022-06-03 /pmc/articles/PMC9205180/ /pubmed/35720344 http://dx.doi.org/10.3389/fimmu.2022.888555 Text en Copyright © 2022 Jackson, Kos, Lees, Gibson, Smith, Peres, Yaari, Corcoran, Busse, Ohlin, Watson and Collins https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms. |
spellingShingle | Immunology Jackson, Katherine J. L. Kos, Justin T. Lees, William Gibson, William S. Smith, Melissa Laird Peres, Ayelet Yaari, Gur Corcoran, Martin Busse, Christian E. Ohlin, Mats Watson, Corey T. Collins, Andrew M. A BALB/c IGHV Reference Set, Defined by Haplotype Analysis of Long-Read VDJ-C Sequences From F1 (BALB/c x C57BL/6) Mice |
title | A BALB/c IGHV Reference Set, Defined by Haplotype Analysis of Long-Read VDJ-C Sequences From F1 (BALB/c x C57BL/6) Mice |
title_full | A BALB/c IGHV Reference Set, Defined by Haplotype Analysis of Long-Read VDJ-C Sequences From F1 (BALB/c x C57BL/6) Mice |
title_fullStr | A BALB/c IGHV Reference Set, Defined by Haplotype Analysis of Long-Read VDJ-C Sequences From F1 (BALB/c x C57BL/6) Mice |
title_full_unstemmed | A BALB/c IGHV Reference Set, Defined by Haplotype Analysis of Long-Read VDJ-C Sequences From F1 (BALB/c x C57BL/6) Mice |
title_short | A BALB/c IGHV Reference Set, Defined by Haplotype Analysis of Long-Read VDJ-C Sequences From F1 (BALB/c x C57BL/6) Mice |
title_sort | balb/c ighv reference set, defined by haplotype analysis of long-read vdj-c sequences from f1 (balb/c x c57bl/6) mice |
topic | Immunology |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9205180/ https://www.ncbi.nlm.nih.gov/pubmed/35720344 http://dx.doi.org/10.3389/fimmu.2022.888555 |
work_keys_str_mv | AT jacksonkatherinejl abalbcighvreferencesetdefinedbyhaplotypeanalysisoflongreadvdjcsequencesfromf1balbcxc57bl6mice AT kosjustint abalbcighvreferencesetdefinedbyhaplotypeanalysisoflongreadvdjcsequencesfromf1balbcxc57bl6mice AT leeswilliam abalbcighvreferencesetdefinedbyhaplotypeanalysisoflongreadvdjcsequencesfromf1balbcxc57bl6mice AT gibsonwilliams abalbcighvreferencesetdefinedbyhaplotypeanalysisoflongreadvdjcsequencesfromf1balbcxc57bl6mice AT smithmelissalaird abalbcighvreferencesetdefinedbyhaplotypeanalysisoflongreadvdjcsequencesfromf1balbcxc57bl6mice AT peresayelet abalbcighvreferencesetdefinedbyhaplotypeanalysisoflongreadvdjcsequencesfromf1balbcxc57bl6mice AT yaarigur abalbcighvreferencesetdefinedbyhaplotypeanalysisoflongreadvdjcsequencesfromf1balbcxc57bl6mice AT corcoranmartin abalbcighvreferencesetdefinedbyhaplotypeanalysisoflongreadvdjcsequencesfromf1balbcxc57bl6mice AT bussechristiane abalbcighvreferencesetdefinedbyhaplotypeanalysisoflongreadvdjcsequencesfromf1balbcxc57bl6mice AT ohlinmats abalbcighvreferencesetdefinedbyhaplotypeanalysisoflongreadvdjcsequencesfromf1balbcxc57bl6mice AT watsoncoreyt abalbcighvreferencesetdefinedbyhaplotypeanalysisoflongreadvdjcsequencesfromf1balbcxc57bl6mice AT collinsandrewm abalbcighvreferencesetdefinedbyhaplotypeanalysisoflongreadvdjcsequencesfromf1balbcxc57bl6mice AT jacksonkatherinejl balbcighvreferencesetdefinedbyhaplotypeanalysisoflongreadvdjcsequencesfromf1balbcxc57bl6mice AT kosjustint balbcighvreferencesetdefinedbyhaplotypeanalysisoflongreadvdjcsequencesfromf1balbcxc57bl6mice AT leeswilliam balbcighvreferencesetdefinedbyhaplotypeanalysisoflongreadvdjcsequencesfromf1balbcxc57bl6mice AT gibsonwilliams balbcighvreferencesetdefinedbyhaplotypeanalysisoflongreadvdjcsequencesfromf1balbcxc57bl6mice AT smithmelissalaird balbcighvreferencesetdefinedbyhaplotypeanalysisoflongreadvdjcsequencesfromf1balbcxc57bl6mice AT peresayelet balbcighvreferencesetdefinedbyhaplotypeanalysisoflongreadvdjcsequencesfromf1balbcxc57bl6mice AT yaarigur balbcighvreferencesetdefinedbyhaplotypeanalysisoflongreadvdjcsequencesfromf1balbcxc57bl6mice AT corcoranmartin balbcighvreferencesetdefinedbyhaplotypeanalysisoflongreadvdjcsequencesfromf1balbcxc57bl6mice AT bussechristiane balbcighvreferencesetdefinedbyhaplotypeanalysisoflongreadvdjcsequencesfromf1balbcxc57bl6mice AT ohlinmats balbcighvreferencesetdefinedbyhaplotypeanalysisoflongreadvdjcsequencesfromf1balbcxc57bl6mice AT watsoncoreyt balbcighvreferencesetdefinedbyhaplotypeanalysisoflongreadvdjcsequencesfromf1balbcxc57bl6mice AT collinsandrewm balbcighvreferencesetdefinedbyhaplotypeanalysisoflongreadvdjcsequencesfromf1balbcxc57bl6mice |