Cargando…

A hepatitis B virus (HBV) sequence variation graph improves sequence alignment and sample-specific consensus sequence construction for genetic analysis of HBV

Hepatitis B virus (HBV) remains a global public health concern, with over 250 million individuals living with chronic HBV infection (CHB) and no curative therapy currently available. Viral diversity is associated with CHB pathogenesis and immunological control of infection. Improved methods to chara...

Descripción completa

Detalles Bibliográficos
Autores principales: Duchen, Dylan, Clipman, Steven, Vergara, Candelaria, Thio, Chloe L., Thomas, David L., Duggal, Priya, Wojcik, Genevieve L.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Cold Spring Harbor Laboratory 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9882026/
https://www.ncbi.nlm.nih.gov/pubmed/36711598
http://dx.doi.org/10.1101/2023.01.11.523611
_version_ 1784879226995867648
author Duchen, Dylan
Clipman, Steven
Vergara, Candelaria
Thio, Chloe L.
Thomas, David L.
Duggal, Priya
Wojcik, Genevieve L.
author_facet Duchen, Dylan
Clipman, Steven
Vergara, Candelaria
Thio, Chloe L.
Thomas, David L.
Duggal, Priya
Wojcik, Genevieve L.
author_sort Duchen, Dylan
collection PubMed
description Hepatitis B virus (HBV) remains a global public health concern, with over 250 million individuals living with chronic HBV infection (CHB) and no curative therapy currently available. Viral diversity is associated with CHB pathogenesis and immunological control of infection. Improved methods to characterize the viral genome at both the population and intra-host level could aid drug development efforts. Conventionally, HBV sequencing data are aligned to a linear reference genome and only sequences capable of aligning to the reference are captured for analysis. Reference selection has additional consequences, including sample-specific ‘consensus’ sequence construction. It remains unclear how to select a reference from available sequences and whether a single reference is sufficient for genetic analyses. Using simulated short-read sequencing data generated from full-length publicly available HBV genome sequences and HBV sequencing data from a longitudinally sampled individual with CHB, we investigate alternative graph-based alignment approaches. We demonstrate that using a phylogenetically representative ‘genome graph’ for alignment, rather than linear reference sequences, avoids issues of reference ambiguity, improves alignment, and facilitates the construction of sample-specific consensus sequences genetically similar to an individual’s infection. Graph-based methods can therefore improve efforts to characterize the genetics of viral pathogens, including HBV, and may have broad implications in host pathogen research.
format Online
Article
Text
id pubmed-9882026
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Cold Spring Harbor Laboratory
record_format MEDLINE/PubMed
spelling pubmed-98820262023-01-28 A hepatitis B virus (HBV) sequence variation graph improves sequence alignment and sample-specific consensus sequence construction for genetic analysis of HBV Duchen, Dylan Clipman, Steven Vergara, Candelaria Thio, Chloe L. Thomas, David L. Duggal, Priya Wojcik, Genevieve L. bioRxiv Article Hepatitis B virus (HBV) remains a global public health concern, with over 250 million individuals living with chronic HBV infection (CHB) and no curative therapy currently available. Viral diversity is associated with CHB pathogenesis and immunological control of infection. Improved methods to characterize the viral genome at both the population and intra-host level could aid drug development efforts. Conventionally, HBV sequencing data are aligned to a linear reference genome and only sequences capable of aligning to the reference are captured for analysis. Reference selection has additional consequences, including sample-specific ‘consensus’ sequence construction. It remains unclear how to select a reference from available sequences and whether a single reference is sufficient for genetic analyses. Using simulated short-read sequencing data generated from full-length publicly available HBV genome sequences and HBV sequencing data from a longitudinally sampled individual with CHB, we investigate alternative graph-based alignment approaches. We demonstrate that using a phylogenetically representative ‘genome graph’ for alignment, rather than linear reference sequences, avoids issues of reference ambiguity, improves alignment, and facilitates the construction of sample-specific consensus sequences genetically similar to an individual’s infection. Graph-based methods can therefore improve efforts to characterize the genetics of viral pathogens, including HBV, and may have broad implications in host pathogen research. Cold Spring Harbor Laboratory 2023-01-12 /pmc/articles/PMC9882026/ /pubmed/36711598 http://dx.doi.org/10.1101/2023.01.11.523611 Text en https://creativecommons.org/licenses/by/4.0/This work is licensed under a Creative Commons Attribution 4.0 International License (https://creativecommons.org/licenses/by/4.0/) , which allows reusers to distribute, remix, adapt, and build upon the material in any medium or format, so long as attribution is given to the creator. The license allows for commercial use.
spellingShingle Article
Duchen, Dylan
Clipman, Steven
Vergara, Candelaria
Thio, Chloe L.
Thomas, David L.
Duggal, Priya
Wojcik, Genevieve L.
A hepatitis B virus (HBV) sequence variation graph improves sequence alignment and sample-specific consensus sequence construction for genetic analysis of HBV
title A hepatitis B virus (HBV) sequence variation graph improves sequence alignment and sample-specific consensus sequence construction for genetic analysis of HBV
title_full A hepatitis B virus (HBV) sequence variation graph improves sequence alignment and sample-specific consensus sequence construction for genetic analysis of HBV
title_fullStr A hepatitis B virus (HBV) sequence variation graph improves sequence alignment and sample-specific consensus sequence construction for genetic analysis of HBV
title_full_unstemmed A hepatitis B virus (HBV) sequence variation graph improves sequence alignment and sample-specific consensus sequence construction for genetic analysis of HBV
title_short A hepatitis B virus (HBV) sequence variation graph improves sequence alignment and sample-specific consensus sequence construction for genetic analysis of HBV
title_sort hepatitis b virus (hbv) sequence variation graph improves sequence alignment and sample-specific consensus sequence construction for genetic analysis of hbv
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9882026/
https://www.ncbi.nlm.nih.gov/pubmed/36711598
http://dx.doi.org/10.1101/2023.01.11.523611
work_keys_str_mv AT duchendylan ahepatitisbvirushbvsequencevariationgraphimprovessequencealignmentandsamplespecificconsensussequenceconstructionforgeneticanalysisofhbv
AT clipmansteven ahepatitisbvirushbvsequencevariationgraphimprovessequencealignmentandsamplespecificconsensussequenceconstructionforgeneticanalysisofhbv
AT vergaracandelaria ahepatitisbvirushbvsequencevariationgraphimprovessequencealignmentandsamplespecificconsensussequenceconstructionforgeneticanalysisofhbv
AT thiochloel ahepatitisbvirushbvsequencevariationgraphimprovessequencealignmentandsamplespecificconsensussequenceconstructionforgeneticanalysisofhbv
AT thomasdavidl ahepatitisbvirushbvsequencevariationgraphimprovessequencealignmentandsamplespecificconsensussequenceconstructionforgeneticanalysisofhbv
AT duggalpriya ahepatitisbvirushbvsequencevariationgraphimprovessequencealignmentandsamplespecificconsensussequenceconstructionforgeneticanalysisofhbv
AT wojcikgenevievel ahepatitisbvirushbvsequencevariationgraphimprovessequencealignmentandsamplespecificconsensussequenceconstructionforgeneticanalysisofhbv
AT duchendylan hepatitisbvirushbvsequencevariationgraphimprovessequencealignmentandsamplespecificconsensussequenceconstructionforgeneticanalysisofhbv
AT clipmansteven hepatitisbvirushbvsequencevariationgraphimprovessequencealignmentandsamplespecificconsensussequenceconstructionforgeneticanalysisofhbv
AT vergaracandelaria hepatitisbvirushbvsequencevariationgraphimprovessequencealignmentandsamplespecificconsensussequenceconstructionforgeneticanalysisofhbv
AT thiochloel hepatitisbvirushbvsequencevariationgraphimprovessequencealignmentandsamplespecificconsensussequenceconstructionforgeneticanalysisofhbv
AT thomasdavidl hepatitisbvirushbvsequencevariationgraphimprovessequencealignmentandsamplespecificconsensussequenceconstructionforgeneticanalysisofhbv
AT duggalpriya hepatitisbvirushbvsequencevariationgraphimprovessequencealignmentandsamplespecificconsensussequenceconstructionforgeneticanalysisofhbv
AT wojcikgenevievel hepatitisbvirushbvsequencevariationgraphimprovessequencealignmentandsamplespecificconsensussequenceconstructionforgeneticanalysisofhbv