Cargando…

Genotyping and Copy Number Analysis of Immunoglobin Heavy Chain Variable Genes Using Long Reads

One of the remaining challenges to describing an individual's genetic variation lies in the highly heterogeneous and complex genomic regions that impede the use of classical reference-guided mapping and assembly approaches. Once such region is the Immunoglobulin heavy chain locus (IGH), which i...

Descripción completa

Detalles Bibliográficos
Autores principales: Ford, Michael, Haghshenas, Ehsan, Watson, Corey T., Sahinalp, S. Cenk
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Elsevier 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7044747/
https://www.ncbi.nlm.nih.gov/pubmed/32109676
http://dx.doi.org/10.1016/j.isci.2020.100883
_version_ 1783501636861165568
author Ford, Michael
Haghshenas, Ehsan
Watson, Corey T.
Sahinalp, S. Cenk
author_facet Ford, Michael
Haghshenas, Ehsan
Watson, Corey T.
Sahinalp, S. Cenk
author_sort Ford, Michael
collection PubMed
description One of the remaining challenges to describing an individual's genetic variation lies in the highly heterogeneous and complex genomic regions that impede the use of classical reference-guided mapping and assembly approaches. Once such region is the Immunoglobulin heavy chain locus (IGH), which is critical for the development of antibodies and the adaptive immune system. We describe ImmunoTyper, the first PacBio-based genotyping and copy number calling tool specifically designed for IGH V genes (IGHV). We demonstrate that ImmunoTyper's multi-stage clustering and combinatorial optimization approach represents the most comprehensive IGHV genotyping approach published to date, through validation using gold-standard IGH reference sequence. This preliminary work establishes the feasibility of fine-grained genotype and copy number analysis using error-prone long reads in complex multi-gene loci and opens the door for in-depth investigation into IGHV heterogeneity using accessible and increasingly common whole-genome sequence.
format Online
Article
Text
id pubmed-7044747
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher Elsevier
record_format MEDLINE/PubMed
spelling pubmed-70447472020-03-05 Genotyping and Copy Number Analysis of Immunoglobin Heavy Chain Variable Genes Using Long Reads Ford, Michael Haghshenas, Ehsan Watson, Corey T. Sahinalp, S. Cenk iScience Article One of the remaining challenges to describing an individual's genetic variation lies in the highly heterogeneous and complex genomic regions that impede the use of classical reference-guided mapping and assembly approaches. Once such region is the Immunoglobulin heavy chain locus (IGH), which is critical for the development of antibodies and the adaptive immune system. We describe ImmunoTyper, the first PacBio-based genotyping and copy number calling tool specifically designed for IGH V genes (IGHV). We demonstrate that ImmunoTyper's multi-stage clustering and combinatorial optimization approach represents the most comprehensive IGHV genotyping approach published to date, through validation using gold-standard IGH reference sequence. This preliminary work establishes the feasibility of fine-grained genotype and copy number analysis using error-prone long reads in complex multi-gene loci and opens the door for in-depth investigation into IGHV heterogeneity using accessible and increasingly common whole-genome sequence. Elsevier 2020-02-04 /pmc/articles/PMC7044747/ /pubmed/32109676 http://dx.doi.org/10.1016/j.isci.2020.100883 Text en http://creativecommons.org/licenses/by-nc-nd/4.0/ This is an open access article under the CC BY-NC-ND license (http://creativecommons.org/licenses/by-nc-nd/4.0/).
spellingShingle Article
Ford, Michael
Haghshenas, Ehsan
Watson, Corey T.
Sahinalp, S. Cenk
Genotyping and Copy Number Analysis of Immunoglobin Heavy Chain Variable Genes Using Long Reads
title Genotyping and Copy Number Analysis of Immunoglobin Heavy Chain Variable Genes Using Long Reads
title_full Genotyping and Copy Number Analysis of Immunoglobin Heavy Chain Variable Genes Using Long Reads
title_fullStr Genotyping and Copy Number Analysis of Immunoglobin Heavy Chain Variable Genes Using Long Reads
title_full_unstemmed Genotyping and Copy Number Analysis of Immunoglobin Heavy Chain Variable Genes Using Long Reads
title_short Genotyping and Copy Number Analysis of Immunoglobin Heavy Chain Variable Genes Using Long Reads
title_sort genotyping and copy number analysis of immunoglobin heavy chain variable genes using long reads
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7044747/
https://www.ncbi.nlm.nih.gov/pubmed/32109676
http://dx.doi.org/10.1016/j.isci.2020.100883
work_keys_str_mv AT fordmichael genotypingandcopynumberanalysisofimmunoglobinheavychainvariablegenesusinglongreads
AT haghshenasehsan genotypingandcopynumberanalysisofimmunoglobinheavychainvariablegenesusinglongreads
AT watsoncoreyt genotypingandcopynumberanalysisofimmunoglobinheavychainvariablegenesusinglongreads
AT sahinalpscenk genotypingandcopynumberanalysisofimmunoglobinheavychainvariablegenesusinglongreads