Cargando…

Aligning to the sample-specific reference sequence to optimize the accuracy of next-generation sequencing analysis for hepatitis B virus

BACKGROUND: Hepatitis B virus (HBV) quasispecies are crucial in the pathogenesis of chronic liver disease. Next-generation sequencing (NGS) is powerful for identifying viral quasispecies. To improve mapping quality and single nucleotide variant (SNV) calling accuracy in the NGS analysis of HBV, we c...

Descripción completa

Detalles Bibliográficos
Autores principales: Liu, Wen-Chun, Lin, Chih-Peng, Cheng, Chun-Pei, Ho, Cheng-Hsun, Lan, Kuo-Lun, Cheng, Ji-Hong, Yen, Chia-Jui, Cheng, Pin-Nan, Wu, I-Chin, Li, I-Chen, Chang, Bill Chia-Han, Tseng, Vincent S., Chiu, Yen-Cheng, Chang, Ting-Tsung
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Springer India 2015
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4722079/
https://www.ncbi.nlm.nih.gov/pubmed/26208819
http://dx.doi.org/10.1007/s12072-015-9645-x
_version_ 1782411316287242240
author Liu, Wen-Chun
Lin, Chih-Peng
Cheng, Chun-Pei
Ho, Cheng-Hsun
Lan, Kuo-Lun
Cheng, Ji-Hong
Yen, Chia-Jui
Cheng, Pin-Nan
Wu, I-Chin
Li, I-Chen
Chang, Bill Chia-Han
Tseng, Vincent S.
Chiu, Yen-Cheng
Chang, Ting-Tsung
author_facet Liu, Wen-Chun
Lin, Chih-Peng
Cheng, Chun-Pei
Ho, Cheng-Hsun
Lan, Kuo-Lun
Cheng, Ji-Hong
Yen, Chia-Jui
Cheng, Pin-Nan
Wu, I-Chin
Li, I-Chen
Chang, Bill Chia-Han
Tseng, Vincent S.
Chiu, Yen-Cheng
Chang, Ting-Tsung
author_sort Liu, Wen-Chun
collection PubMed
description BACKGROUND: Hepatitis B virus (HBV) quasispecies are crucial in the pathogenesis of chronic liver disease. Next-generation sequencing (NGS) is powerful for identifying viral quasispecies. To improve mapping quality and single nucleotide variant (SNV) calling accuracy in the NGS analysis of HBV, we compared different mapping references, including the sample-specific reference sequence, same genotype sequences and different genotype sequences, according to the sample. METHODS: Real Illumina HBV datasets from 86 patients, and simulated datasets from 158 HBV strains in the GenBank database, were used to assess mapping quality. SNV calling accuracy was evaluated using different mapping references to align Real Illumina datasets from a single HBV clone. RESULTS: Using the sample-specific reference sequence as a mapping reference produced the largest number of mappable reads and coverages. With a different genotype mapping reference, the consensus sequence derived from the Real Illumina datasets of the single HBV clone showed 21 false SNV callings in polymerase and surface genes, the regions most divergent between the mapping reference and this HBV clone. A ~6 % coverage of most of these false SNVs was yielded even with a same genotype mapping reference, but none with the sample-specific reference sequence. CONCLUSIONS: Using sample-specific reference sequences as a mapping reference in NGS analysis optimized mapping quality and the SNV calling accuracy for HBV quasispecies. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1007/s12072-015-9645-x) contains supplementary material, which is available to authorized users.
format Online
Article
Text
id pubmed-4722079
institution National Center for Biotechnology Information
language English
publishDate 2015
publisher Springer India
record_format MEDLINE/PubMed
spelling pubmed-47220792016-02-01 Aligning to the sample-specific reference sequence to optimize the accuracy of next-generation sequencing analysis for hepatitis B virus Liu, Wen-Chun Lin, Chih-Peng Cheng, Chun-Pei Ho, Cheng-Hsun Lan, Kuo-Lun Cheng, Ji-Hong Yen, Chia-Jui Cheng, Pin-Nan Wu, I-Chin Li, I-Chen Chang, Bill Chia-Han Tseng, Vincent S. Chiu, Yen-Cheng Chang, Ting-Tsung Hepatol Int Original Article BACKGROUND: Hepatitis B virus (HBV) quasispecies are crucial in the pathogenesis of chronic liver disease. Next-generation sequencing (NGS) is powerful for identifying viral quasispecies. To improve mapping quality and single nucleotide variant (SNV) calling accuracy in the NGS analysis of HBV, we compared different mapping references, including the sample-specific reference sequence, same genotype sequences and different genotype sequences, according to the sample. METHODS: Real Illumina HBV datasets from 86 patients, and simulated datasets from 158 HBV strains in the GenBank database, were used to assess mapping quality. SNV calling accuracy was evaluated using different mapping references to align Real Illumina datasets from a single HBV clone. RESULTS: Using the sample-specific reference sequence as a mapping reference produced the largest number of mappable reads and coverages. With a different genotype mapping reference, the consensus sequence derived from the Real Illumina datasets of the single HBV clone showed 21 false SNV callings in polymerase and surface genes, the regions most divergent between the mapping reference and this HBV clone. A ~6 % coverage of most of these false SNVs was yielded even with a same genotype mapping reference, but none with the sample-specific reference sequence. CONCLUSIONS: Using sample-specific reference sequences as a mapping reference in NGS analysis optimized mapping quality and the SNV calling accuracy for HBV quasispecies. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1007/s12072-015-9645-x) contains supplementary material, which is available to authorized users. Springer India 2015-07-25 /pmc/articles/PMC4722079/ /pubmed/26208819 http://dx.doi.org/10.1007/s12072-015-9645-x Text en © The Author(s) 2015 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made.
spellingShingle Original Article
Liu, Wen-Chun
Lin, Chih-Peng
Cheng, Chun-Pei
Ho, Cheng-Hsun
Lan, Kuo-Lun
Cheng, Ji-Hong
Yen, Chia-Jui
Cheng, Pin-Nan
Wu, I-Chin
Li, I-Chen
Chang, Bill Chia-Han
Tseng, Vincent S.
Chiu, Yen-Cheng
Chang, Ting-Tsung
Aligning to the sample-specific reference sequence to optimize the accuracy of next-generation sequencing analysis for hepatitis B virus
title Aligning to the sample-specific reference sequence to optimize the accuracy of next-generation sequencing analysis for hepatitis B virus
title_full Aligning to the sample-specific reference sequence to optimize the accuracy of next-generation sequencing analysis for hepatitis B virus
title_fullStr Aligning to the sample-specific reference sequence to optimize the accuracy of next-generation sequencing analysis for hepatitis B virus
title_full_unstemmed Aligning to the sample-specific reference sequence to optimize the accuracy of next-generation sequencing analysis for hepatitis B virus
title_short Aligning to the sample-specific reference sequence to optimize the accuracy of next-generation sequencing analysis for hepatitis B virus
title_sort aligning to the sample-specific reference sequence to optimize the accuracy of next-generation sequencing analysis for hepatitis b virus
topic Original Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4722079/
https://www.ncbi.nlm.nih.gov/pubmed/26208819
http://dx.doi.org/10.1007/s12072-015-9645-x
work_keys_str_mv AT liuwenchun aligningtothesamplespecificreferencesequencetooptimizetheaccuracyofnextgenerationsequencinganalysisforhepatitisbvirus
AT linchihpeng aligningtothesamplespecificreferencesequencetooptimizetheaccuracyofnextgenerationsequencinganalysisforhepatitisbvirus
AT chengchunpei aligningtothesamplespecificreferencesequencetooptimizetheaccuracyofnextgenerationsequencinganalysisforhepatitisbvirus
AT hochenghsun aligningtothesamplespecificreferencesequencetooptimizetheaccuracyofnextgenerationsequencinganalysisforhepatitisbvirus
AT lankuolun aligningtothesamplespecificreferencesequencetooptimizetheaccuracyofnextgenerationsequencinganalysisforhepatitisbvirus
AT chengjihong aligningtothesamplespecificreferencesequencetooptimizetheaccuracyofnextgenerationsequencinganalysisforhepatitisbvirus
AT yenchiajui aligningtothesamplespecificreferencesequencetooptimizetheaccuracyofnextgenerationsequencinganalysisforhepatitisbvirus
AT chengpinnan aligningtothesamplespecificreferencesequencetooptimizetheaccuracyofnextgenerationsequencinganalysisforhepatitisbvirus
AT wuichin aligningtothesamplespecificreferencesequencetooptimizetheaccuracyofnextgenerationsequencinganalysisforhepatitisbvirus
AT liichen aligningtothesamplespecificreferencesequencetooptimizetheaccuracyofnextgenerationsequencinganalysisforhepatitisbvirus
AT changbillchiahan aligningtothesamplespecificreferencesequencetooptimizetheaccuracyofnextgenerationsequencinganalysisforhepatitisbvirus
AT tsengvincents aligningtothesamplespecificreferencesequencetooptimizetheaccuracyofnextgenerationsequencinganalysisforhepatitisbvirus
AT chiuyencheng aligningtothesamplespecificreferencesequencetooptimizetheaccuracyofnextgenerationsequencinganalysisforhepatitisbvirus
AT changtingtsung aligningtothesamplespecificreferencesequencetooptimizetheaccuracyofnextgenerationsequencinganalysisforhepatitisbvirus