Cargando…

Benchmarking different approaches for Norovirus genome assembly in metagenome samples

BACKGROUND: Genome assembly of viruses with high mutation rates, such as Norovirus and other RNA viruses, or from metagenome samples, poses a challenge for the scientific community due to the coexistence of several viral quasispecies and strains. Furthermore, there is no standard method for obtainin...

Descripción completa

Detalles Bibliográficos
Autores principales: Fuentes-Trillo, Azahara, Monzó, Carolina, Manzano, Iris, Santiso-Bellón, Cristina, Andrade, Juliana da Silva Ribeiro de, Gozalbo-Rovira, Roberto, García-García, Ana-Bárbara, Rodríguez-Díaz, Jesús, Chaves, Felipe Javier
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8611953/
https://www.ncbi.nlm.nih.gov/pubmed/34819031
http://dx.doi.org/10.1186/s12864-021-08067-2
Descripción
Sumario:BACKGROUND: Genome assembly of viruses with high mutation rates, such as Norovirus and other RNA viruses, or from metagenome samples, poses a challenge for the scientific community due to the coexistence of several viral quasispecies and strains. Furthermore, there is no standard method for obtaining whole-genome sequences in non-related patients. After polyA RNA isolation and sequencing in eight patients with acute gastroenteritis, we evaluated two de Bruijn graph assemblers (SPAdes and MEGAHIT), combined with four different and common pre-assembly strategies, and compared those yielding whole genome Norovirus contigs. RESULTS: Reference-genome guided strategies with both host and target virus did not present any advantages compared to the assembly of non-filtered data in the case of SPAdes, and in the case of MEGAHIT, only host genome filtering presented improvements. MEGAHIT performed better than SPAdes in most samples, reaching complete genome sequences in most of them for all the strategies employed. Read binning with CD-HIT improved assembly when paired with different analysis strategies, and more notably in the case of SPAdes. CONCLUSIONS: Not all metagenome assemblies are equal and the choice in the workflow depends on the species studied and the prior steps to analysis. We may need different approaches even for samples treated equally due to the presence of high intra host variability. We tested and compared different workflows for the accurate assembly of Norovirus genomes and established their assembly capacities for this purpose. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s12864-021-08067-2.