Cargando…

A BALB/c IGHV Reference Set, Defined by Haplotype Analysis of Long-Read VDJ-C Sequences From F1 (BALB/c x C57BL/6) Mice

The immunoglobulin genes of inbred mouse strains that are commonly used in models of antibody-mediated human diseases are poorly characterized. This compromises data analysis. To infer the immunoglobulin genes of BALB/c mice, we used long-read SMRT sequencing to amplify VDJ-C sequences from F1 (BALB...

Descripción completa

Detalles Bibliográficos
Autores principales: Jackson, Katherine J. L., Kos, Justin T., Lees, William, Gibson, William S., Smith, Melissa Laird, Peres, Ayelet, Yaari, Gur, Corcoran, Martin, Busse, Christian E., Ohlin, Mats, Watson, Corey T., Collins, Andrew M.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Frontiers Media S.A. 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9205180/
https://www.ncbi.nlm.nih.gov/pubmed/35720344
http://dx.doi.org/10.3389/fimmu.2022.888555
_version_ 1784729073756405760
author Jackson, Katherine J. L.
Kos, Justin T.
Lees, William
Gibson, William S.
Smith, Melissa Laird
Peres, Ayelet
Yaari, Gur
Corcoran, Martin
Busse, Christian E.
Ohlin, Mats
Watson, Corey T.
Collins, Andrew M.
author_facet Jackson, Katherine J. L.
Kos, Justin T.
Lees, William
Gibson, William S.
Smith, Melissa Laird
Peres, Ayelet
Yaari, Gur
Corcoran, Martin
Busse, Christian E.
Ohlin, Mats
Watson, Corey T.
Collins, Andrew M.
author_sort Jackson, Katherine J. L.
collection PubMed
description The immunoglobulin genes of inbred mouse strains that are commonly used in models of antibody-mediated human diseases are poorly characterized. This compromises data analysis. To infer the immunoglobulin genes of BALB/c mice, we used long-read SMRT sequencing to amplify VDJ-C sequences from F1 (BALB/c x C57BL/6) hybrid animals. Strain variations were identified in the Ighm and Ighg2b genes, and analysis of VDJ rearrangements led to the inference of 278 germline IGHV alleles. 169 alleles are not present in the C57BL/6 genome reference sequence. To establish a set of expressed BALB/c IGHV germline gene sequences, we computationally retrieved IGHV haplotypes from the IgM dataset. Haplotyping led to the confirmation of 162 BALB/c IGHV gene sequences. A musIGHV398 pseudogene variant also appears to be present in the BALB/cByJ substrain, while a functional musIGHV398 gene is highly expressed in the BALB/cJ substrain. Only four of the BALB/c alleles were also observed in the C57BL/6 haplotype. The full set of inferred BALB/c sequences has been used to establish a BALB/c IGHV reference set, hosted at https://ogrdb.airr-community.org. We assessed whether assemblies from the Mouse Genome Project (MGP) are suitable for the determination of the genes of the IGH loci. Only 37 (43.5%) of the 85 confirmed IMGT-named BALB/c IGHV and 33 (42.9%) of the 77 confirmed non-IMGT IGHV were found in a search of the MGP BALB/cJ genome assembly. This suggests that current MGP assemblies are unsuitable for the comprehensive documentation of germline IGHVs and more efforts will be needed to establish strain-specific reference sets.
format Online
Article
Text
id pubmed-9205180
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Frontiers Media S.A.
record_format MEDLINE/PubMed
spelling pubmed-92051802022-06-18 A BALB/c IGHV Reference Set, Defined by Haplotype Analysis of Long-Read VDJ-C Sequences From F1 (BALB/c x C57BL/6) Mice Jackson, Katherine J. L. Kos, Justin T. Lees, William Gibson, William S. Smith, Melissa Laird Peres, Ayelet Yaari, Gur Corcoran, Martin Busse, Christian E. Ohlin, Mats Watson, Corey T. Collins, Andrew M. Front Immunol Immunology The immunoglobulin genes of inbred mouse strains that are commonly used in models of antibody-mediated human diseases are poorly characterized. This compromises data analysis. To infer the immunoglobulin genes of BALB/c mice, we used long-read SMRT sequencing to amplify VDJ-C sequences from F1 (BALB/c x C57BL/6) hybrid animals. Strain variations were identified in the Ighm and Ighg2b genes, and analysis of VDJ rearrangements led to the inference of 278 germline IGHV alleles. 169 alleles are not present in the C57BL/6 genome reference sequence. To establish a set of expressed BALB/c IGHV germline gene sequences, we computationally retrieved IGHV haplotypes from the IgM dataset. Haplotyping led to the confirmation of 162 BALB/c IGHV gene sequences. A musIGHV398 pseudogene variant also appears to be present in the BALB/cByJ substrain, while a functional musIGHV398 gene is highly expressed in the BALB/cJ substrain. Only four of the BALB/c alleles were also observed in the C57BL/6 haplotype. The full set of inferred BALB/c sequences has been used to establish a BALB/c IGHV reference set, hosted at https://ogrdb.airr-community.org. We assessed whether assemblies from the Mouse Genome Project (MGP) are suitable for the determination of the genes of the IGH loci. Only 37 (43.5%) of the 85 confirmed IMGT-named BALB/c IGHV and 33 (42.9%) of the 77 confirmed non-IMGT IGHV were found in a search of the MGP BALB/cJ genome assembly. This suggests that current MGP assemblies are unsuitable for the comprehensive documentation of germline IGHVs and more efforts will be needed to establish strain-specific reference sets. Frontiers Media S.A. 2022-06-03 /pmc/articles/PMC9205180/ /pubmed/35720344 http://dx.doi.org/10.3389/fimmu.2022.888555 Text en Copyright © 2022 Jackson, Kos, Lees, Gibson, Smith, Peres, Yaari, Corcoran, Busse, Ohlin, Watson and Collins https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle Immunology
Jackson, Katherine J. L.
Kos, Justin T.
Lees, William
Gibson, William S.
Smith, Melissa Laird
Peres, Ayelet
Yaari, Gur
Corcoran, Martin
Busse, Christian E.
Ohlin, Mats
Watson, Corey T.
Collins, Andrew M.
A BALB/c IGHV Reference Set, Defined by Haplotype Analysis of Long-Read VDJ-C Sequences From F1 (BALB/c x C57BL/6) Mice
title A BALB/c IGHV Reference Set, Defined by Haplotype Analysis of Long-Read VDJ-C Sequences From F1 (BALB/c x C57BL/6) Mice
title_full A BALB/c IGHV Reference Set, Defined by Haplotype Analysis of Long-Read VDJ-C Sequences From F1 (BALB/c x C57BL/6) Mice
title_fullStr A BALB/c IGHV Reference Set, Defined by Haplotype Analysis of Long-Read VDJ-C Sequences From F1 (BALB/c x C57BL/6) Mice
title_full_unstemmed A BALB/c IGHV Reference Set, Defined by Haplotype Analysis of Long-Read VDJ-C Sequences From F1 (BALB/c x C57BL/6) Mice
title_short A BALB/c IGHV Reference Set, Defined by Haplotype Analysis of Long-Read VDJ-C Sequences From F1 (BALB/c x C57BL/6) Mice
title_sort balb/c ighv reference set, defined by haplotype analysis of long-read vdj-c sequences from f1 (balb/c x c57bl/6) mice
topic Immunology
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9205180/
https://www.ncbi.nlm.nih.gov/pubmed/35720344
http://dx.doi.org/10.3389/fimmu.2022.888555
work_keys_str_mv AT jacksonkatherinejl abalbcighvreferencesetdefinedbyhaplotypeanalysisoflongreadvdjcsequencesfromf1balbcxc57bl6mice
AT kosjustint abalbcighvreferencesetdefinedbyhaplotypeanalysisoflongreadvdjcsequencesfromf1balbcxc57bl6mice
AT leeswilliam abalbcighvreferencesetdefinedbyhaplotypeanalysisoflongreadvdjcsequencesfromf1balbcxc57bl6mice
AT gibsonwilliams abalbcighvreferencesetdefinedbyhaplotypeanalysisoflongreadvdjcsequencesfromf1balbcxc57bl6mice
AT smithmelissalaird abalbcighvreferencesetdefinedbyhaplotypeanalysisoflongreadvdjcsequencesfromf1balbcxc57bl6mice
AT peresayelet abalbcighvreferencesetdefinedbyhaplotypeanalysisoflongreadvdjcsequencesfromf1balbcxc57bl6mice
AT yaarigur abalbcighvreferencesetdefinedbyhaplotypeanalysisoflongreadvdjcsequencesfromf1balbcxc57bl6mice
AT corcoranmartin abalbcighvreferencesetdefinedbyhaplotypeanalysisoflongreadvdjcsequencesfromf1balbcxc57bl6mice
AT bussechristiane abalbcighvreferencesetdefinedbyhaplotypeanalysisoflongreadvdjcsequencesfromf1balbcxc57bl6mice
AT ohlinmats abalbcighvreferencesetdefinedbyhaplotypeanalysisoflongreadvdjcsequencesfromf1balbcxc57bl6mice
AT watsoncoreyt abalbcighvreferencesetdefinedbyhaplotypeanalysisoflongreadvdjcsequencesfromf1balbcxc57bl6mice
AT collinsandrewm abalbcighvreferencesetdefinedbyhaplotypeanalysisoflongreadvdjcsequencesfromf1balbcxc57bl6mice
AT jacksonkatherinejl balbcighvreferencesetdefinedbyhaplotypeanalysisoflongreadvdjcsequencesfromf1balbcxc57bl6mice
AT kosjustint balbcighvreferencesetdefinedbyhaplotypeanalysisoflongreadvdjcsequencesfromf1balbcxc57bl6mice
AT leeswilliam balbcighvreferencesetdefinedbyhaplotypeanalysisoflongreadvdjcsequencesfromf1balbcxc57bl6mice
AT gibsonwilliams balbcighvreferencesetdefinedbyhaplotypeanalysisoflongreadvdjcsequencesfromf1balbcxc57bl6mice
AT smithmelissalaird balbcighvreferencesetdefinedbyhaplotypeanalysisoflongreadvdjcsequencesfromf1balbcxc57bl6mice
AT peresayelet balbcighvreferencesetdefinedbyhaplotypeanalysisoflongreadvdjcsequencesfromf1balbcxc57bl6mice
AT yaarigur balbcighvreferencesetdefinedbyhaplotypeanalysisoflongreadvdjcsequencesfromf1balbcxc57bl6mice
AT corcoranmartin balbcighvreferencesetdefinedbyhaplotypeanalysisoflongreadvdjcsequencesfromf1balbcxc57bl6mice
AT bussechristiane balbcighvreferencesetdefinedbyhaplotypeanalysisoflongreadvdjcsequencesfromf1balbcxc57bl6mice
AT ohlinmats balbcighvreferencesetdefinedbyhaplotypeanalysisoflongreadvdjcsequencesfromf1balbcxc57bl6mice
AT watsoncoreyt balbcighvreferencesetdefinedbyhaplotypeanalysisoflongreadvdjcsequencesfromf1balbcxc57bl6mice
AT collinsandrewm balbcighvreferencesetdefinedbyhaplotypeanalysisoflongreadvdjcsequencesfromf1balbcxc57bl6mice