Cargando…

Discovery of several thousand highly diverse circular DNA viruses

Although millions of distinct virus species likely exist, only approximately 9000 are catalogued in GenBank's RefSeq database. We selectively enriched for the genomes of circular DNA viruses in over 70 animal samples, ranging from nematodes to human tissue specimens. A bioinformatics pipeline,...

Descripción completa

Detalles Bibliográficos
Autores principales: Tisza, Michael J, Pastrana, Diana V, Welch, Nicole L, Stewart, Brittany, Peretti, Alberto, Starrett, Gabriel J, Pang, Yuk-Ying S, Krishnamurthy, Siddharth R, Pesavento, Patricia A, McDermott, David H, Murphy, Philip M, Whited, Jessica L, Miller, Bess, Brenchley, Jason, Rosshart, Stephan P, Rehermann, Barbara, Doorbar, John, Ta'ala, Blake A, Pletnikova, Olga, Troncoso, Juan C, Resnick, Susan M, Bolduc, Ben, Sullivan, Matthew B, Varsani, Arvind, Segall, Anca M, Buck, Christopher B
Formato: Online Artículo Texto
Lenguaje:English
Publicado: eLife Sciences Publications, Ltd 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7000223/
https://www.ncbi.nlm.nih.gov/pubmed/32014111
http://dx.doi.org/10.7554/eLife.51971
_version_ 1783494001026924544
author Tisza, Michael J
Pastrana, Diana V
Welch, Nicole L
Stewart, Brittany
Peretti, Alberto
Starrett, Gabriel J
Pang, Yuk-Ying S
Krishnamurthy, Siddharth R
Pesavento, Patricia A
McDermott, David H
Murphy, Philip M
Whited, Jessica L
Miller, Bess
Brenchley, Jason
Rosshart, Stephan P
Rehermann, Barbara
Doorbar, John
Ta'ala, Blake A
Pletnikova, Olga
Troncoso, Juan C
Resnick, Susan M
Bolduc, Ben
Sullivan, Matthew B
Varsani, Arvind
Segall, Anca M
Buck, Christopher B
author_facet Tisza, Michael J
Pastrana, Diana V
Welch, Nicole L
Stewart, Brittany
Peretti, Alberto
Starrett, Gabriel J
Pang, Yuk-Ying S
Krishnamurthy, Siddharth R
Pesavento, Patricia A
McDermott, David H
Murphy, Philip M
Whited, Jessica L
Miller, Bess
Brenchley, Jason
Rosshart, Stephan P
Rehermann, Barbara
Doorbar, John
Ta'ala, Blake A
Pletnikova, Olga
Troncoso, Juan C
Resnick, Susan M
Bolduc, Ben
Sullivan, Matthew B
Varsani, Arvind
Segall, Anca M
Buck, Christopher B
author_sort Tisza, Michael J
collection PubMed
description Although millions of distinct virus species likely exist, only approximately 9000 are catalogued in GenBank's RefSeq database. We selectively enriched for the genomes of circular DNA viruses in over 70 animal samples, ranging from nematodes to human tissue specimens. A bioinformatics pipeline, Cenote-Taker, was developed to automatically annotate over 2500 complete genomes in a GenBank-compliant format. The new genomes belong to dozens of established and emerging viral families. Some appear to be the result of previously undescribed recombination events between ssDNA and ssRNA viruses. In addition, hundreds of circular DNA elements that do not encode any discernable similarities to previously characterized sequences were identified. To characterize these ‘dark matter’ sequences, we used an artificial neural network to identify candidate viral capsid proteins, several of which formed virus-like particles when expressed in culture. These data further the understanding of viral sequence diversity and allow for high throughput documentation of the virosphere.
format Online
Article
Text
id pubmed-7000223
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher eLife Sciences Publications, Ltd
record_format MEDLINE/PubMed
spelling pubmed-70002232020-02-06 Discovery of several thousand highly diverse circular DNA viruses Tisza, Michael J Pastrana, Diana V Welch, Nicole L Stewart, Brittany Peretti, Alberto Starrett, Gabriel J Pang, Yuk-Ying S Krishnamurthy, Siddharth R Pesavento, Patricia A McDermott, David H Murphy, Philip M Whited, Jessica L Miller, Bess Brenchley, Jason Rosshart, Stephan P Rehermann, Barbara Doorbar, John Ta'ala, Blake A Pletnikova, Olga Troncoso, Juan C Resnick, Susan M Bolduc, Ben Sullivan, Matthew B Varsani, Arvind Segall, Anca M Buck, Christopher B eLife Evolutionary Biology Although millions of distinct virus species likely exist, only approximately 9000 are catalogued in GenBank's RefSeq database. We selectively enriched for the genomes of circular DNA viruses in over 70 animal samples, ranging from nematodes to human tissue specimens. A bioinformatics pipeline, Cenote-Taker, was developed to automatically annotate over 2500 complete genomes in a GenBank-compliant format. The new genomes belong to dozens of established and emerging viral families. Some appear to be the result of previously undescribed recombination events between ssDNA and ssRNA viruses. In addition, hundreds of circular DNA elements that do not encode any discernable similarities to previously characterized sequences were identified. To characterize these ‘dark matter’ sequences, we used an artificial neural network to identify candidate viral capsid proteins, several of which formed virus-like particles when expressed in culture. These data further the understanding of viral sequence diversity and allow for high throughput documentation of the virosphere. eLife Sciences Publications, Ltd 2020-02-04 /pmc/articles/PMC7000223/ /pubmed/32014111 http://dx.doi.org/10.7554/eLife.51971 Text en http://creativecommons.org/publicdomain/zero/1.0/ http://creativecommons.org/publicdomain/zero/1.0/This is an open-access article, free of all copyright, and may be freely reproduced, distributed, transmitted, modified, built upon, or otherwise used by anyone for any lawful purpose. The work is made available under the Creative Commons CC0 public domain dedication (http://creativecommons.org/publicdomain/zero/1.0/) .
spellingShingle Evolutionary Biology
Tisza, Michael J
Pastrana, Diana V
Welch, Nicole L
Stewart, Brittany
Peretti, Alberto
Starrett, Gabriel J
Pang, Yuk-Ying S
Krishnamurthy, Siddharth R
Pesavento, Patricia A
McDermott, David H
Murphy, Philip M
Whited, Jessica L
Miller, Bess
Brenchley, Jason
Rosshart, Stephan P
Rehermann, Barbara
Doorbar, John
Ta'ala, Blake A
Pletnikova, Olga
Troncoso, Juan C
Resnick, Susan M
Bolduc, Ben
Sullivan, Matthew B
Varsani, Arvind
Segall, Anca M
Buck, Christopher B
Discovery of several thousand highly diverse circular DNA viruses
title Discovery of several thousand highly diverse circular DNA viruses
title_full Discovery of several thousand highly diverse circular DNA viruses
title_fullStr Discovery of several thousand highly diverse circular DNA viruses
title_full_unstemmed Discovery of several thousand highly diverse circular DNA viruses
title_short Discovery of several thousand highly diverse circular DNA viruses
title_sort discovery of several thousand highly diverse circular dna viruses
topic Evolutionary Biology
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7000223/
https://www.ncbi.nlm.nih.gov/pubmed/32014111
http://dx.doi.org/10.7554/eLife.51971
work_keys_str_mv AT tiszamichaelj discoveryofseveralthousandhighlydiversecirculardnaviruses
AT pastranadianav discoveryofseveralthousandhighlydiversecirculardnaviruses
AT welchnicolel discoveryofseveralthousandhighlydiversecirculardnaviruses
AT stewartbrittany discoveryofseveralthousandhighlydiversecirculardnaviruses
AT perettialberto discoveryofseveralthousandhighlydiversecirculardnaviruses
AT starrettgabrielj discoveryofseveralthousandhighlydiversecirculardnaviruses
AT pangyukyings discoveryofseveralthousandhighlydiversecirculardnaviruses
AT krishnamurthysiddharthr discoveryofseveralthousandhighlydiversecirculardnaviruses
AT pesaventopatriciaa discoveryofseveralthousandhighlydiversecirculardnaviruses
AT mcdermottdavidh discoveryofseveralthousandhighlydiversecirculardnaviruses
AT murphyphilipm discoveryofseveralthousandhighlydiversecirculardnaviruses
AT whitedjessical discoveryofseveralthousandhighlydiversecirculardnaviruses
AT millerbess discoveryofseveralthousandhighlydiversecirculardnaviruses
AT brenchleyjason discoveryofseveralthousandhighlydiversecirculardnaviruses
AT rosshartstephanp discoveryofseveralthousandhighlydiversecirculardnaviruses
AT rehermannbarbara discoveryofseveralthousandhighlydiversecirculardnaviruses
AT doorbarjohn discoveryofseveralthousandhighlydiversecirculardnaviruses
AT taalablakea discoveryofseveralthousandhighlydiversecirculardnaviruses
AT pletnikovaolga discoveryofseveralthousandhighlydiversecirculardnaviruses
AT troncosojuanc discoveryofseveralthousandhighlydiversecirculardnaviruses
AT resnicksusanm discoveryofseveralthousandhighlydiversecirculardnaviruses
AT bolducben discoveryofseveralthousandhighlydiversecirculardnaviruses
AT sullivanmatthewb discoveryofseveralthousandhighlydiversecirculardnaviruses
AT varsaniarvind discoveryofseveralthousandhighlydiversecirculardnaviruses
AT segallancam discoveryofseveralthousandhighlydiversecirculardnaviruses
AT buckchristopherb discoveryofseveralthousandhighlydiversecirculardnaviruses