Cargando…

Analysis of the genetic diversity of influenza A viruses using next-generation DNA sequencing

BACKGROUND: Influenza viruses exist as a large group of closely related viral genomes, also called quasispecies. The composition of this influenza viral quasispecies can be determined by an accurate and sensitive sequencing technique and data analysis pipeline. We compared the suitability of two ben...

Descripción completa

Detalles Bibliográficos
Autores principales: Van den Hoecke, Silvie, Verhelst, Judith, Vuylsteke, Marnik, Saelens, Xavier
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2015
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4342091/
https://www.ncbi.nlm.nih.gov/pubmed/25758772
http://dx.doi.org/10.1186/s12864-015-1284-z
_version_ 1782359236835016704
author Van den Hoecke, Silvie
Verhelst, Judith
Vuylsteke, Marnik
Saelens, Xavier
author_facet Van den Hoecke, Silvie
Verhelst, Judith
Vuylsteke, Marnik
Saelens, Xavier
author_sort Van den Hoecke, Silvie
collection PubMed
description BACKGROUND: Influenza viruses exist as a large group of closely related viral genomes, also called quasispecies. The composition of this influenza viral quasispecies can be determined by an accurate and sensitive sequencing technique and data analysis pipeline. We compared the suitability of two benchtop next-generation sequencers for whole genome influenza A quasispecies analysis: the Illumina MiSeq sequencing-by-synthesis and the Ion Torrent PGM semiconductor sequencing technique. RESULTS: We first compared the accuracy and sensitivity of both sequencers using plasmid DNA and different ratios of wild type and mutant plasmid. Illumina MiSeq sequencing reads were one and a half times more accurate than those of the Ion Torrent PGM. The majority of sequencing errors were substitutions on the Illumina MiSeq and insertions and deletions, mostly in homopolymer regions, on the Ion Torrent PGM. To evaluate the suitability of the two techniques for determining the genome diversity of influenza A virus, we generated plasmid-derived PR8 virus and grew this virus in vitro. We also optimized an RT-PCR protocol to obtain uniform coverage of all eight genomic RNA segments. The sequencing reads obtained with both sequencers could successfully be assembled de novo into the segmented influenza virus genome. After mapping of the reads to the reference genome, we found that the detection limit for reliable recognition of variants in the viral genome required a frequency of 0.5% or higher. This threshold exceeds the background error rate resulting from the RT-PCR reaction and the sequencing method. Most of the variants in the PR8 virus genome were present in hemagglutinin, and these mutations were detected by both sequencers. CONCLUSIONS: Our approach underlines the power and limitations of two commonly used next-generation sequencers for the analysis of influenza virus gene diversity. We conclude that the Illumina MiSeq platform is better suited for detecting variant sequences whereas the Ion Torrent PGM platform has a shorter turnaround time. The data analysis pipeline that we propose here will also help to standardize variant calling in small RNA genomes based on next-generation sequencing data.
format Online
Article
Text
id pubmed-4342091
institution National Center for Biotechnology Information
language English
publishDate 2015
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-43420912015-02-27 Analysis of the genetic diversity of influenza A viruses using next-generation DNA sequencing Van den Hoecke, Silvie Verhelst, Judith Vuylsteke, Marnik Saelens, Xavier BMC Genomics Research Article BACKGROUND: Influenza viruses exist as a large group of closely related viral genomes, also called quasispecies. The composition of this influenza viral quasispecies can be determined by an accurate and sensitive sequencing technique and data analysis pipeline. We compared the suitability of two benchtop next-generation sequencers for whole genome influenza A quasispecies analysis: the Illumina MiSeq sequencing-by-synthesis and the Ion Torrent PGM semiconductor sequencing technique. RESULTS: We first compared the accuracy and sensitivity of both sequencers using plasmid DNA and different ratios of wild type and mutant plasmid. Illumina MiSeq sequencing reads were one and a half times more accurate than those of the Ion Torrent PGM. The majority of sequencing errors were substitutions on the Illumina MiSeq and insertions and deletions, mostly in homopolymer regions, on the Ion Torrent PGM. To evaluate the suitability of the two techniques for determining the genome diversity of influenza A virus, we generated plasmid-derived PR8 virus and grew this virus in vitro. We also optimized an RT-PCR protocol to obtain uniform coverage of all eight genomic RNA segments. The sequencing reads obtained with both sequencers could successfully be assembled de novo into the segmented influenza virus genome. After mapping of the reads to the reference genome, we found that the detection limit for reliable recognition of variants in the viral genome required a frequency of 0.5% or higher. This threshold exceeds the background error rate resulting from the RT-PCR reaction and the sequencing method. Most of the variants in the PR8 virus genome were present in hemagglutinin, and these mutations were detected by both sequencers. CONCLUSIONS: Our approach underlines the power and limitations of two commonly used next-generation sequencers for the analysis of influenza virus gene diversity. We conclude that the Illumina MiSeq platform is better suited for detecting variant sequences whereas the Ion Torrent PGM platform has a shorter turnaround time. The data analysis pipeline that we propose here will also help to standardize variant calling in small RNA genomes based on next-generation sequencing data. BioMed Central 2015-02-14 /pmc/articles/PMC4342091/ /pubmed/25758772 http://dx.doi.org/10.1186/s12864-015-1284-z Text en © Van den Hoecke et al.; licensee BioMed Central. 2015 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly credited. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Research Article
Van den Hoecke, Silvie
Verhelst, Judith
Vuylsteke, Marnik
Saelens, Xavier
Analysis of the genetic diversity of influenza A viruses using next-generation DNA sequencing
title Analysis of the genetic diversity of influenza A viruses using next-generation DNA sequencing
title_full Analysis of the genetic diversity of influenza A viruses using next-generation DNA sequencing
title_fullStr Analysis of the genetic diversity of influenza A viruses using next-generation DNA sequencing
title_full_unstemmed Analysis of the genetic diversity of influenza A viruses using next-generation DNA sequencing
title_short Analysis of the genetic diversity of influenza A viruses using next-generation DNA sequencing
title_sort analysis of the genetic diversity of influenza a viruses using next-generation dna sequencing
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4342091/
https://www.ncbi.nlm.nih.gov/pubmed/25758772
http://dx.doi.org/10.1186/s12864-015-1284-z
work_keys_str_mv AT vandenhoeckesilvie analysisofthegeneticdiversityofinfluenzaavirusesusingnextgenerationdnasequencing
AT verhelstjudith analysisofthegeneticdiversityofinfluenzaavirusesusingnextgenerationdnasequencing
AT vuylstekemarnik analysisofthegeneticdiversityofinfluenzaavirusesusingnextgenerationdnasequencing
AT saelensxavier analysisofthegeneticdiversityofinfluenzaavirusesusingnextgenerationdnasequencing