Cargando…

Host-virus chimeric events in SARS-CoV2 infected cells are infrequent and artifactual

Pathogenic mechanisms underlying severe SARS-CoV2 infection remain largely unelucidated. High throughput sequencing technologies that capture genome and transcriptome information are key approaches to gain detailed mechanistic insights from infected cells. These techniques readily detect both pathog...

Descripción completa

Detalles Bibliográficos
Autores principales: Yan, Bingyu, Chakravorty, Srishti, Mirabelli, Carmen, Wang, Luopin, Trujillo-Ochoa, Jorge L., Chauss, Daniel, Kumar, Dhaneshwar, Lionakis, Michail S., Olson, Matthew R, Wobus, Christiane E., Afzali, Behdad, Kazemian, Majid
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Cold Spring Harbor Laboratory 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7899447/
https://www.ncbi.nlm.nih.gov/pubmed/33619483
http://dx.doi.org/10.1101/2021.02.17.431704
_version_ 1783654044635496448
author Yan, Bingyu
Chakravorty, Srishti
Mirabelli, Carmen
Wang, Luopin
Trujillo-Ochoa, Jorge L.
Chauss, Daniel
Kumar, Dhaneshwar
Lionakis, Michail S.
Olson, Matthew R
Wobus, Christiane E.
Afzali, Behdad
Kazemian, Majid
author_facet Yan, Bingyu
Chakravorty, Srishti
Mirabelli, Carmen
Wang, Luopin
Trujillo-Ochoa, Jorge L.
Chauss, Daniel
Kumar, Dhaneshwar
Lionakis, Michail S.
Olson, Matthew R
Wobus, Christiane E.
Afzali, Behdad
Kazemian, Majid
author_sort Yan, Bingyu
collection PubMed
description Pathogenic mechanisms underlying severe SARS-CoV2 infection remain largely unelucidated. High throughput sequencing technologies that capture genome and transcriptome information are key approaches to gain detailed mechanistic insights from infected cells. These techniques readily detect both pathogen and host-derived sequences, providing a means of studying host-pathogen interactions. Recent studies have reported the presence of host-virus chimeric (HVC) RNA in RNA-seq data from SARS-CoV2 infected cells and interpreted these findings as evidence of viral integration in the human genome as a potential pathogenic mechanism. Since SARS-CoV2 is a positive sense RNA virus that replicates in the cytoplasm it does not have a nuclear phase in its life cycle, it is biologically unlikely to be in a location where splicing events could result in genome integration. Here, we investigated the biological authenticity of HVC events. In contrast to true biological events such as mRNA splicing and genome rearrangement events, which generate reproducible chimeric sequencing fragments across different biological isolates, we found that HVC events across >100 RNA-seq libraries from patients with COVID-19 and infected cell lines, were highly irreproducible. RNA-seq library preparation is inherently error-prone due to random template switching during reverse transcription of RNA to cDNA. By counting chimeric events observed when constructing an RNA-seq library from human RNA and spike-in RNA from an unrelated species, such as fruit-fly, we estimated that ~1% of RNA-seq reads are artifactually chimeric. In SARS-CoV2 RNA-seq we found that the frequency of HVC events was, in fact, not greater than this background “noise”. Finally, we developed a novel experimental approach to enrich SARS-CoV2 sequences from bulk RNA of infected cells. This method enriched viral sequences but did not enrich for HVC events, suggesting that the majority of HVC events are, in all likelihood, artifacts of library construction. In conclusion, our findings indicate that HVC events observed in RNA-sequencing libraries from SARS-CoV2 infected cells are extremely rare and are likely artifacts arising from either random template switching of reverse-transcriptase and/or sequence alignment errors. Therefore, the observed HVC events do not support SARS-CoV2 fusion to cellular genes and/or integration into human genomes.
format Online
Article
Text
id pubmed-7899447
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Cold Spring Harbor Laboratory
record_format MEDLINE/PubMed
spelling pubmed-78994472021-02-23 Host-virus chimeric events in SARS-CoV2 infected cells are infrequent and artifactual Yan, Bingyu Chakravorty, Srishti Mirabelli, Carmen Wang, Luopin Trujillo-Ochoa, Jorge L. Chauss, Daniel Kumar, Dhaneshwar Lionakis, Michail S. Olson, Matthew R Wobus, Christiane E. Afzali, Behdad Kazemian, Majid bioRxiv Article Pathogenic mechanisms underlying severe SARS-CoV2 infection remain largely unelucidated. High throughput sequencing technologies that capture genome and transcriptome information are key approaches to gain detailed mechanistic insights from infected cells. These techniques readily detect both pathogen and host-derived sequences, providing a means of studying host-pathogen interactions. Recent studies have reported the presence of host-virus chimeric (HVC) RNA in RNA-seq data from SARS-CoV2 infected cells and interpreted these findings as evidence of viral integration in the human genome as a potential pathogenic mechanism. Since SARS-CoV2 is a positive sense RNA virus that replicates in the cytoplasm it does not have a nuclear phase in its life cycle, it is biologically unlikely to be in a location where splicing events could result in genome integration. Here, we investigated the biological authenticity of HVC events. In contrast to true biological events such as mRNA splicing and genome rearrangement events, which generate reproducible chimeric sequencing fragments across different biological isolates, we found that HVC events across >100 RNA-seq libraries from patients with COVID-19 and infected cell lines, were highly irreproducible. RNA-seq library preparation is inherently error-prone due to random template switching during reverse transcription of RNA to cDNA. By counting chimeric events observed when constructing an RNA-seq library from human RNA and spike-in RNA from an unrelated species, such as fruit-fly, we estimated that ~1% of RNA-seq reads are artifactually chimeric. In SARS-CoV2 RNA-seq we found that the frequency of HVC events was, in fact, not greater than this background “noise”. Finally, we developed a novel experimental approach to enrich SARS-CoV2 sequences from bulk RNA of infected cells. This method enriched viral sequences but did not enrich for HVC events, suggesting that the majority of HVC events are, in all likelihood, artifacts of library construction. In conclusion, our findings indicate that HVC events observed in RNA-sequencing libraries from SARS-CoV2 infected cells are extremely rare and are likely artifacts arising from either random template switching of reverse-transcriptase and/or sequence alignment errors. Therefore, the observed HVC events do not support SARS-CoV2 fusion to cellular genes and/or integration into human genomes. Cold Spring Harbor Laboratory 2021-02-17 /pmc/articles/PMC7899447/ /pubmed/33619483 http://dx.doi.org/10.1101/2021.02.17.431704 Text en https://creativecommons.org/publicdomain/zero/1.0/This article is a US Government work. It is not subject to copyright under 17 USC 105 and is also made available for use under a CC0 license (https://creativecommons.org/publicdomain/zero/1.0/) .
spellingShingle Article
Yan, Bingyu
Chakravorty, Srishti
Mirabelli, Carmen
Wang, Luopin
Trujillo-Ochoa, Jorge L.
Chauss, Daniel
Kumar, Dhaneshwar
Lionakis, Michail S.
Olson, Matthew R
Wobus, Christiane E.
Afzali, Behdad
Kazemian, Majid
Host-virus chimeric events in SARS-CoV2 infected cells are infrequent and artifactual
title Host-virus chimeric events in SARS-CoV2 infected cells are infrequent and artifactual
title_full Host-virus chimeric events in SARS-CoV2 infected cells are infrequent and artifactual
title_fullStr Host-virus chimeric events in SARS-CoV2 infected cells are infrequent and artifactual
title_full_unstemmed Host-virus chimeric events in SARS-CoV2 infected cells are infrequent and artifactual
title_short Host-virus chimeric events in SARS-CoV2 infected cells are infrequent and artifactual
title_sort host-virus chimeric events in sars-cov2 infected cells are infrequent and artifactual
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7899447/
https://www.ncbi.nlm.nih.gov/pubmed/33619483
http://dx.doi.org/10.1101/2021.02.17.431704
work_keys_str_mv AT yanbingyu hostviruschimericeventsinsarscov2infectedcellsareinfrequentandartifactual
AT chakravortysrishti hostviruschimericeventsinsarscov2infectedcellsareinfrequentandartifactual
AT mirabellicarmen hostviruschimericeventsinsarscov2infectedcellsareinfrequentandartifactual
AT wangluopin hostviruschimericeventsinsarscov2infectedcellsareinfrequentandartifactual
AT trujilloochoajorgel hostviruschimericeventsinsarscov2infectedcellsareinfrequentandartifactual
AT chaussdaniel hostviruschimericeventsinsarscov2infectedcellsareinfrequentandartifactual
AT kumardhaneshwar hostviruschimericeventsinsarscov2infectedcellsareinfrequentandartifactual
AT lionakismichails hostviruschimericeventsinsarscov2infectedcellsareinfrequentandartifactual
AT olsonmatthewr hostviruschimericeventsinsarscov2infectedcellsareinfrequentandartifactual
AT wobuschristianee hostviruschimericeventsinsarscov2infectedcellsareinfrequentandartifactual
AT afzalibehdad hostviruschimericeventsinsarscov2infectedcellsareinfrequentandartifactual
AT kazemianmajid hostviruschimericeventsinsarscov2infectedcellsareinfrequentandartifactual