Cargando…
Analysis of the genetic diversity of influenza A viruses using next-generation DNA sequencing
BACKGROUND: Influenza viruses exist as a large group of closely related viral genomes, also called quasispecies. The composition of this influenza viral quasispecies can be determined by an accurate and sensitive sequencing technique and data analysis pipeline. We compared the suitability of two ben...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2015
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4342091/ https://www.ncbi.nlm.nih.gov/pubmed/25758772 http://dx.doi.org/10.1186/s12864-015-1284-z |
_version_ | 1782359236835016704 |
---|---|
author | Van den Hoecke, Silvie Verhelst, Judith Vuylsteke, Marnik Saelens, Xavier |
author_facet | Van den Hoecke, Silvie Verhelst, Judith Vuylsteke, Marnik Saelens, Xavier |
author_sort | Van den Hoecke, Silvie |
collection | PubMed |
description | BACKGROUND: Influenza viruses exist as a large group of closely related viral genomes, also called quasispecies. The composition of this influenza viral quasispecies can be determined by an accurate and sensitive sequencing technique and data analysis pipeline. We compared the suitability of two benchtop next-generation sequencers for whole genome influenza A quasispecies analysis: the Illumina MiSeq sequencing-by-synthesis and the Ion Torrent PGM semiconductor sequencing technique. RESULTS: We first compared the accuracy and sensitivity of both sequencers using plasmid DNA and different ratios of wild type and mutant plasmid. Illumina MiSeq sequencing reads were one and a half times more accurate than those of the Ion Torrent PGM. The majority of sequencing errors were substitutions on the Illumina MiSeq and insertions and deletions, mostly in homopolymer regions, on the Ion Torrent PGM. To evaluate the suitability of the two techniques for determining the genome diversity of influenza A virus, we generated plasmid-derived PR8 virus and grew this virus in vitro. We also optimized an RT-PCR protocol to obtain uniform coverage of all eight genomic RNA segments. The sequencing reads obtained with both sequencers could successfully be assembled de novo into the segmented influenza virus genome. After mapping of the reads to the reference genome, we found that the detection limit for reliable recognition of variants in the viral genome required a frequency of 0.5% or higher. This threshold exceeds the background error rate resulting from the RT-PCR reaction and the sequencing method. Most of the variants in the PR8 virus genome were present in hemagglutinin, and these mutations were detected by both sequencers. CONCLUSIONS: Our approach underlines the power and limitations of two commonly used next-generation sequencers for the analysis of influenza virus gene diversity. We conclude that the Illumina MiSeq platform is better suited for detecting variant sequences whereas the Ion Torrent PGM platform has a shorter turnaround time. The data analysis pipeline that we propose here will also help to standardize variant calling in small RNA genomes based on next-generation sequencing data. |
format | Online Article Text |
id | pubmed-4342091 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2015 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-43420912015-02-27 Analysis of the genetic diversity of influenza A viruses using next-generation DNA sequencing Van den Hoecke, Silvie Verhelst, Judith Vuylsteke, Marnik Saelens, Xavier BMC Genomics Research Article BACKGROUND: Influenza viruses exist as a large group of closely related viral genomes, also called quasispecies. The composition of this influenza viral quasispecies can be determined by an accurate and sensitive sequencing technique and data analysis pipeline. We compared the suitability of two benchtop next-generation sequencers for whole genome influenza A quasispecies analysis: the Illumina MiSeq sequencing-by-synthesis and the Ion Torrent PGM semiconductor sequencing technique. RESULTS: We first compared the accuracy and sensitivity of both sequencers using plasmid DNA and different ratios of wild type and mutant plasmid. Illumina MiSeq sequencing reads were one and a half times more accurate than those of the Ion Torrent PGM. The majority of sequencing errors were substitutions on the Illumina MiSeq and insertions and deletions, mostly in homopolymer regions, on the Ion Torrent PGM. To evaluate the suitability of the two techniques for determining the genome diversity of influenza A virus, we generated plasmid-derived PR8 virus and grew this virus in vitro. We also optimized an RT-PCR protocol to obtain uniform coverage of all eight genomic RNA segments. The sequencing reads obtained with both sequencers could successfully be assembled de novo into the segmented influenza virus genome. After mapping of the reads to the reference genome, we found that the detection limit for reliable recognition of variants in the viral genome required a frequency of 0.5% or higher. This threshold exceeds the background error rate resulting from the RT-PCR reaction and the sequencing method. Most of the variants in the PR8 virus genome were present in hemagglutinin, and these mutations were detected by both sequencers. CONCLUSIONS: Our approach underlines the power and limitations of two commonly used next-generation sequencers for the analysis of influenza virus gene diversity. We conclude that the Illumina MiSeq platform is better suited for detecting variant sequences whereas the Ion Torrent PGM platform has a shorter turnaround time. The data analysis pipeline that we propose here will also help to standardize variant calling in small RNA genomes based on next-generation sequencing data. BioMed Central 2015-02-14 /pmc/articles/PMC4342091/ /pubmed/25758772 http://dx.doi.org/10.1186/s12864-015-1284-z Text en © Van den Hoecke et al.; licensee BioMed Central. 2015 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly credited. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated. |
spellingShingle | Research Article Van den Hoecke, Silvie Verhelst, Judith Vuylsteke, Marnik Saelens, Xavier Analysis of the genetic diversity of influenza A viruses using next-generation DNA sequencing |
title | Analysis of the genetic diversity of influenza A viruses using next-generation DNA sequencing |
title_full | Analysis of the genetic diversity of influenza A viruses using next-generation DNA sequencing |
title_fullStr | Analysis of the genetic diversity of influenza A viruses using next-generation DNA sequencing |
title_full_unstemmed | Analysis of the genetic diversity of influenza A viruses using next-generation DNA sequencing |
title_short | Analysis of the genetic diversity of influenza A viruses using next-generation DNA sequencing |
title_sort | analysis of the genetic diversity of influenza a viruses using next-generation dna sequencing |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4342091/ https://www.ncbi.nlm.nih.gov/pubmed/25758772 http://dx.doi.org/10.1186/s12864-015-1284-z |
work_keys_str_mv | AT vandenhoeckesilvie analysisofthegeneticdiversityofinfluenzaavirusesusingnextgenerationdnasequencing AT verhelstjudith analysisofthegeneticdiversityofinfluenzaavirusesusingnextgenerationdnasequencing AT vuylstekemarnik analysisofthegeneticdiversityofinfluenzaavirusesusingnextgenerationdnasequencing AT saelensxavier analysisofthegeneticdiversityofinfluenzaavirusesusingnextgenerationdnasequencing |