Cargando…

An Extensive Meta-Metagenomic Search Identifies SARS-CoV-2-Homologous Sequences in Pangolin Lung Viromes

In numerous instances, tracking the biological significance of a nucleic acid sequence can be augmented through the identification of environmental niches in which the sequence of interest is present. Many metagenomic data sets are now available, with deep sequencing of samples from diverse biologic...

Descripción completa

Detalles Bibliográficos
Autores principales: Wahba, Lamia, Jain, Nimit, Fire, Andrew Z., Shoura, Massa J., Artiles, Karen L., McCoy, Matthew J., Jeong, Dae-Eun
Formato: Online Artículo Texto
Lenguaje:English
Publicado: American Society for Microbiology 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7203451/
https://www.ncbi.nlm.nih.gov/pubmed/32376697
http://dx.doi.org/10.1128/mSphere.00160-20
_version_ 1783529871992946688
author Wahba, Lamia
Jain, Nimit
Fire, Andrew Z.
Shoura, Massa J.
Artiles, Karen L.
McCoy, Matthew J.
Jeong, Dae-Eun
author_facet Wahba, Lamia
Jain, Nimit
Fire, Andrew Z.
Shoura, Massa J.
Artiles, Karen L.
McCoy, Matthew J.
Jeong, Dae-Eun
author_sort Wahba, Lamia
collection PubMed
description In numerous instances, tracking the biological significance of a nucleic acid sequence can be augmented through the identification of environmental niches in which the sequence of interest is present. Many metagenomic data sets are now available, with deep sequencing of samples from diverse biological niches. While any individual metagenomic data set can be readily queried using web-based tools, meta-searches through all such data sets are less accessible. In this brief communication, we demonstrate such a meta-metagenomic approach, examining close matches to the severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) in all high-throughput sequencing data sets in the NCBI Sequence Read Archive accessible with the “virome” keyword. In addition to the homology to bat coronaviruses observed in descriptions of the SARS-CoV-2 sequence (F. Wu, S. Zhao, B. Yu, Y. M. Chen, et al., Nature 579:265–269, 2020, https://doi.org/10.1038/s41586-020-2008-3; P. Zhou, X. L. Yang, X. G. Wang, B. Hu, et al., Nature 579:270–273, 2020, https://doi.org/10.1038/s41586-020-2012-7), we note a strong homology to numerous sequence reads in metavirome data sets generated from the lungs of deceased pangolins reported by Liu et al. (P. Liu, W. Chen, and J. P. Chen, Viruses 11:979, 2019, https://doi.org/10.3390/v11110979). While analysis of these reads indicates the presence of a similar viral sequence in pangolin lung, the similarity is not sufficient to either confirm or rule out a role for pangolins as an intermediate host in the recent emergence of SARS-CoV-2. In addition to the implications for SARS-CoV-2 emergence, this study illustrates the utility and limitations of meta-metagenomic search tools in effective and rapid characterization of potentially significant nucleic acid sequences. IMPORTANCE Meta-metagenomic searches allow for high-speed, low-cost identification of potentially significant biological niches for sequences of interest.
format Online
Article
Text
id pubmed-7203451
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher American Society for Microbiology
record_format MEDLINE/PubMed
spelling pubmed-72034512020-05-20 An Extensive Meta-Metagenomic Search Identifies SARS-CoV-2-Homologous Sequences in Pangolin Lung Viromes Wahba, Lamia Jain, Nimit Fire, Andrew Z. Shoura, Massa J. Artiles, Karen L. McCoy, Matthew J. Jeong, Dae-Eun mSphere Observation In numerous instances, tracking the biological significance of a nucleic acid sequence can be augmented through the identification of environmental niches in which the sequence of interest is present. Many metagenomic data sets are now available, with deep sequencing of samples from diverse biological niches. While any individual metagenomic data set can be readily queried using web-based tools, meta-searches through all such data sets are less accessible. In this brief communication, we demonstrate such a meta-metagenomic approach, examining close matches to the severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) in all high-throughput sequencing data sets in the NCBI Sequence Read Archive accessible with the “virome” keyword. In addition to the homology to bat coronaviruses observed in descriptions of the SARS-CoV-2 sequence (F. Wu, S. Zhao, B. Yu, Y. M. Chen, et al., Nature 579:265–269, 2020, https://doi.org/10.1038/s41586-020-2008-3; P. Zhou, X. L. Yang, X. G. Wang, B. Hu, et al., Nature 579:270–273, 2020, https://doi.org/10.1038/s41586-020-2012-7), we note a strong homology to numerous sequence reads in metavirome data sets generated from the lungs of deceased pangolins reported by Liu et al. (P. Liu, W. Chen, and J. P. Chen, Viruses 11:979, 2019, https://doi.org/10.3390/v11110979). While analysis of these reads indicates the presence of a similar viral sequence in pangolin lung, the similarity is not sufficient to either confirm or rule out a role for pangolins as an intermediate host in the recent emergence of SARS-CoV-2. In addition to the implications for SARS-CoV-2 emergence, this study illustrates the utility and limitations of meta-metagenomic search tools in effective and rapid characterization of potentially significant nucleic acid sequences. IMPORTANCE Meta-metagenomic searches allow for high-speed, low-cost identification of potentially significant biological niches for sequences of interest. American Society for Microbiology 2020-05-06 /pmc/articles/PMC7203451/ /pubmed/32376697 http://dx.doi.org/10.1128/mSphere.00160-20 Text en Copyright © 2020 Wahba et al. https://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution 4.0 International license (https://creativecommons.org/licenses/by/4.0/) .
spellingShingle Observation
Wahba, Lamia
Jain, Nimit
Fire, Andrew Z.
Shoura, Massa J.
Artiles, Karen L.
McCoy, Matthew J.
Jeong, Dae-Eun
An Extensive Meta-Metagenomic Search Identifies SARS-CoV-2-Homologous Sequences in Pangolin Lung Viromes
title An Extensive Meta-Metagenomic Search Identifies SARS-CoV-2-Homologous Sequences in Pangolin Lung Viromes
title_full An Extensive Meta-Metagenomic Search Identifies SARS-CoV-2-Homologous Sequences in Pangolin Lung Viromes
title_fullStr An Extensive Meta-Metagenomic Search Identifies SARS-CoV-2-Homologous Sequences in Pangolin Lung Viromes
title_full_unstemmed An Extensive Meta-Metagenomic Search Identifies SARS-CoV-2-Homologous Sequences in Pangolin Lung Viromes
title_short An Extensive Meta-Metagenomic Search Identifies SARS-CoV-2-Homologous Sequences in Pangolin Lung Viromes
title_sort extensive meta-metagenomic search identifies sars-cov-2-homologous sequences in pangolin lung viromes
topic Observation
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7203451/
https://www.ncbi.nlm.nih.gov/pubmed/32376697
http://dx.doi.org/10.1128/mSphere.00160-20
work_keys_str_mv AT wahbalamia anextensivemetametagenomicsearchidentifiessarscov2homologoussequencesinpangolinlungviromes
AT jainnimit anextensivemetametagenomicsearchidentifiessarscov2homologoussequencesinpangolinlungviromes
AT fireandrewz anextensivemetametagenomicsearchidentifiessarscov2homologoussequencesinpangolinlungviromes
AT shouramassaj anextensivemetametagenomicsearchidentifiessarscov2homologoussequencesinpangolinlungviromes
AT artileskarenl anextensivemetametagenomicsearchidentifiessarscov2homologoussequencesinpangolinlungviromes
AT mccoymatthewj anextensivemetametagenomicsearchidentifiessarscov2homologoussequencesinpangolinlungviromes
AT jeongdaeeun anextensivemetametagenomicsearchidentifiessarscov2homologoussequencesinpangolinlungviromes
AT wahbalamia extensivemetametagenomicsearchidentifiessarscov2homologoussequencesinpangolinlungviromes
AT jainnimit extensivemetametagenomicsearchidentifiessarscov2homologoussequencesinpangolinlungviromes
AT fireandrewz extensivemetametagenomicsearchidentifiessarscov2homologoussequencesinpangolinlungviromes
AT shouramassaj extensivemetametagenomicsearchidentifiessarscov2homologoussequencesinpangolinlungviromes
AT artileskarenl extensivemetametagenomicsearchidentifiessarscov2homologoussequencesinpangolinlungviromes
AT mccoymatthewj extensivemetametagenomicsearchidentifiessarscov2homologoussequencesinpangolinlungviromes
AT jeongdaeeun extensivemetametagenomicsearchidentifiessarscov2homologoussequencesinpangolinlungviromes