Cargando…

Vipie: web pipeline for parallel characterization of viral populations from multiple NGS samples

BACKGROUND: Next generation sequencing (NGS) technology allows laboratories to investigate virome composition in clinical and environmental samples in a culture-independent way. There is a need for bioinformatic tools capable of parallel processing of virome sequencing data by exactly identical meth...

Descripción completa

Detalles Bibliográficos
Autores principales: Lin, Jake, Kramna, Lenka, Autio, Reija, Hyöty, Heikki, Nykter, Matti, Cinek, Ondrej
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2017
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5430618/
https://www.ncbi.nlm.nih.gov/pubmed/28506246
http://dx.doi.org/10.1186/s12864-017-3721-7
_version_ 1783236253549854720
author Lin, Jake
Kramna, Lenka
Autio, Reija
Hyöty, Heikki
Nykter, Matti
Cinek, Ondrej
author_facet Lin, Jake
Kramna, Lenka
Autio, Reija
Hyöty, Heikki
Nykter, Matti
Cinek, Ondrej
author_sort Lin, Jake
collection PubMed
description BACKGROUND: Next generation sequencing (NGS) technology allows laboratories to investigate virome composition in clinical and environmental samples in a culture-independent way. There is a need for bioinformatic tools capable of parallel processing of virome sequencing data by exactly identical methods: this is especially important in studies of multifactorial diseases, or in parallel comparison of laboratory protocols. RESULTS: We have developed a web-based application allowing direct upload of sequences from multiple virome samples using custom parameters. The samples are then processed in parallel using an identical protocol, and can be easily reanalyzed. The pipeline performs de-novo assembly, taxonomic classification of viruses as well as sample analyses based on user-defined grouping categories. Tables of virus abundance are produced from cross-validation by remapping the sequencing reads to a union of all observed reference viruses. In addition, read sets and reports are created after processing unmapped reads against known human and bacterial ribosome references. Secured interactive results are dynamically plotted with population and diversity charts, clustered heatmaps and a sortable and searchable abundance table. CONCLUSIONS: The Vipie web application is a unique tool for multi-sample metagenomic analysis of viral data, producing searchable hits tables, interactive population maps, alpha diversity measures and clustered heatmaps that are grouped in applicable custom sample categories. Known references such as human genome and bacterial ribosomal genes are optionally removed from unmapped (‘dark matter’) reads. Secured results are accessible and shareable on modern browsers. Vipie is a freely available web-based tool whose code is open source. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12864-017-3721-7) contains supplementary material, which is available to authorized users.
format Online
Article
Text
id pubmed-5430618
institution National Center for Biotechnology Information
language English
publishDate 2017
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-54306182017-05-17 Vipie: web pipeline for parallel characterization of viral populations from multiple NGS samples Lin, Jake Kramna, Lenka Autio, Reija Hyöty, Heikki Nykter, Matti Cinek, Ondrej BMC Genomics Software BACKGROUND: Next generation sequencing (NGS) technology allows laboratories to investigate virome composition in clinical and environmental samples in a culture-independent way. There is a need for bioinformatic tools capable of parallel processing of virome sequencing data by exactly identical methods: this is especially important in studies of multifactorial diseases, or in parallel comparison of laboratory protocols. RESULTS: We have developed a web-based application allowing direct upload of sequences from multiple virome samples using custom parameters. The samples are then processed in parallel using an identical protocol, and can be easily reanalyzed. The pipeline performs de-novo assembly, taxonomic classification of viruses as well as sample analyses based on user-defined grouping categories. Tables of virus abundance are produced from cross-validation by remapping the sequencing reads to a union of all observed reference viruses. In addition, read sets and reports are created after processing unmapped reads against known human and bacterial ribosome references. Secured interactive results are dynamically plotted with population and diversity charts, clustered heatmaps and a sortable and searchable abundance table. CONCLUSIONS: The Vipie web application is a unique tool for multi-sample metagenomic analysis of viral data, producing searchable hits tables, interactive population maps, alpha diversity measures and clustered heatmaps that are grouped in applicable custom sample categories. Known references such as human genome and bacterial ribosomal genes are optionally removed from unmapped (‘dark matter’) reads. Secured results are accessible and shareable on modern browsers. Vipie is a freely available web-based tool whose code is open source. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12864-017-3721-7) contains supplementary material, which is available to authorized users. BioMed Central 2017-05-15 /pmc/articles/PMC5430618/ /pubmed/28506246 http://dx.doi.org/10.1186/s12864-017-3721-7 Text en © The Author(s). 2017 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Software
Lin, Jake
Kramna, Lenka
Autio, Reija
Hyöty, Heikki
Nykter, Matti
Cinek, Ondrej
Vipie: web pipeline for parallel characterization of viral populations from multiple NGS samples
title Vipie: web pipeline for parallel characterization of viral populations from multiple NGS samples
title_full Vipie: web pipeline for parallel characterization of viral populations from multiple NGS samples
title_fullStr Vipie: web pipeline for parallel characterization of viral populations from multiple NGS samples
title_full_unstemmed Vipie: web pipeline for parallel characterization of viral populations from multiple NGS samples
title_short Vipie: web pipeline for parallel characterization of viral populations from multiple NGS samples
title_sort vipie: web pipeline for parallel characterization of viral populations from multiple ngs samples
topic Software
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5430618/
https://www.ncbi.nlm.nih.gov/pubmed/28506246
http://dx.doi.org/10.1186/s12864-017-3721-7
work_keys_str_mv AT linjake vipiewebpipelineforparallelcharacterizationofviralpopulationsfrommultiplengssamples
AT kramnalenka vipiewebpipelineforparallelcharacterizationofviralpopulationsfrommultiplengssamples
AT autioreija vipiewebpipelineforparallelcharacterizationofviralpopulationsfrommultiplengssamples
AT hyotyheikki vipiewebpipelineforparallelcharacterizationofviralpopulationsfrommultiplengssamples
AT nyktermatti vipiewebpipelineforparallelcharacterizationofviralpopulationsfrommultiplengssamples
AT cinekondrej vipiewebpipelineforparallelcharacterizationofviralpopulationsfrommultiplengssamples