Cargando…

Multiscale analysis of pangenomes enables improved representation of genomic diversity for repetitive and clinically relevant genes

Advancements in sequencing technologies and assembly methods enable the regular production of high-quality genome assemblies characterizing complex regions. However, challenges remain in efficiently interpreting variation at various scales, from smaller tandem repeats to megabase rearrangements, acr...

Descripción completa

Detalles Bibliográficos
Autores principales: Chin, Chen-Shan, Behera, Sairam, Khalak, Asif, Sedlazeck, Fritz J., Sudmant, Peter H., Wagner, Justin, Zook, Justin M.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Nature Publishing Group US 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10406601/
https://www.ncbi.nlm.nih.gov/pubmed/37365340
http://dx.doi.org/10.1038/s41592-023-01914-y
_version_ 1785085778026561536
author Chin, Chen-Shan
Behera, Sairam
Khalak, Asif
Sedlazeck, Fritz J.
Sudmant, Peter H.
Wagner, Justin
Zook, Justin M.
author_facet Chin, Chen-Shan
Behera, Sairam
Khalak, Asif
Sedlazeck, Fritz J.
Sudmant, Peter H.
Wagner, Justin
Zook, Justin M.
author_sort Chin, Chen-Shan
collection PubMed
description Advancements in sequencing technologies and assembly methods enable the regular production of high-quality genome assemblies characterizing complex regions. However, challenges remain in efficiently interpreting variation at various scales, from smaller tandem repeats to megabase rearrangements, across many human genomes. We present a PanGenome Research Tool Kit (PGR-TK) enabling analyses of complex pangenome structural and haplotype variation at multiple scales. We apply the graph decomposition methods in PGR-TK to the class II major histocompatibility complex demonstrating the importance of the human pangenome for analyzing complicated regions. Moreover, we investigate the Y-chromosome genes, DAZ1/DAZ2/DAZ3/DAZ4, of which structural variants have been linked to male infertility, and X-chromosome genes OPN1LW and OPN1MW linked to eye disorders. We further showcase PGR-TK across 395 complex repetitive medically important genes. This highlights the power of PGR-TK to resolve complex variation in regions of the genome that were previously too complex to analyze.
format Online
Article
Text
id pubmed-10406601
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Nature Publishing Group US
record_format MEDLINE/PubMed
spelling pubmed-104066012023-08-09 Multiscale analysis of pangenomes enables improved representation of genomic diversity for repetitive and clinically relevant genes Chin, Chen-Shan Behera, Sairam Khalak, Asif Sedlazeck, Fritz J. Sudmant, Peter H. Wagner, Justin Zook, Justin M. Nat Methods Article Advancements in sequencing technologies and assembly methods enable the regular production of high-quality genome assemblies characterizing complex regions. However, challenges remain in efficiently interpreting variation at various scales, from smaller tandem repeats to megabase rearrangements, across many human genomes. We present a PanGenome Research Tool Kit (PGR-TK) enabling analyses of complex pangenome structural and haplotype variation at multiple scales. We apply the graph decomposition methods in PGR-TK to the class II major histocompatibility complex demonstrating the importance of the human pangenome for analyzing complicated regions. Moreover, we investigate the Y-chromosome genes, DAZ1/DAZ2/DAZ3/DAZ4, of which structural variants have been linked to male infertility, and X-chromosome genes OPN1LW and OPN1MW linked to eye disorders. We further showcase PGR-TK across 395 complex repetitive medically important genes. This highlights the power of PGR-TK to resolve complex variation in regions of the genome that were previously too complex to analyze. Nature Publishing Group US 2023-06-26 2023 /pmc/articles/PMC10406601/ /pubmed/37365340 http://dx.doi.org/10.1038/s41592-023-01914-y Text en © The Author(s), under exclusive licence to Springer Nature America, Inc. 2023 https://creativecommons.org/licenses/by/4.0/Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) .
spellingShingle Article
Chin, Chen-Shan
Behera, Sairam
Khalak, Asif
Sedlazeck, Fritz J.
Sudmant, Peter H.
Wagner, Justin
Zook, Justin M.
Multiscale analysis of pangenomes enables improved representation of genomic diversity for repetitive and clinically relevant genes
title Multiscale analysis of pangenomes enables improved representation of genomic diversity for repetitive and clinically relevant genes
title_full Multiscale analysis of pangenomes enables improved representation of genomic diversity for repetitive and clinically relevant genes
title_fullStr Multiscale analysis of pangenomes enables improved representation of genomic diversity for repetitive and clinically relevant genes
title_full_unstemmed Multiscale analysis of pangenomes enables improved representation of genomic diversity for repetitive and clinically relevant genes
title_short Multiscale analysis of pangenomes enables improved representation of genomic diversity for repetitive and clinically relevant genes
title_sort multiscale analysis of pangenomes enables improved representation of genomic diversity for repetitive and clinically relevant genes
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10406601/
https://www.ncbi.nlm.nih.gov/pubmed/37365340
http://dx.doi.org/10.1038/s41592-023-01914-y
work_keys_str_mv AT chinchenshan multiscaleanalysisofpangenomesenablesimprovedrepresentationofgenomicdiversityforrepetitiveandclinicallyrelevantgenes
AT beherasairam multiscaleanalysisofpangenomesenablesimprovedrepresentationofgenomicdiversityforrepetitiveandclinicallyrelevantgenes
AT khalakasif multiscaleanalysisofpangenomesenablesimprovedrepresentationofgenomicdiversityforrepetitiveandclinicallyrelevantgenes
AT sedlazeckfritzj multiscaleanalysisofpangenomesenablesimprovedrepresentationofgenomicdiversityforrepetitiveandclinicallyrelevantgenes
AT sudmantpeterh multiscaleanalysisofpangenomesenablesimprovedrepresentationofgenomicdiversityforrepetitiveandclinicallyrelevantgenes
AT wagnerjustin multiscaleanalysisofpangenomesenablesimprovedrepresentationofgenomicdiversityforrepetitiveandclinicallyrelevantgenes
AT zookjustinm multiscaleanalysisofpangenomesenablesimprovedrepresentationofgenomicdiversityforrepetitiveandclinicallyrelevantgenes