Cargando…

Data mining reveals tissue-specific expression and host lineage-associated forms of Apis mellifera filamentous virus

BACKGROUND: Apis mellifera filamentous virus (AmFV) is a large double-stranded DNA virus of uncertain phylogenetic position that infects honey bees (Apis mellifera). Little is known about AmFV evolution or molecular aspects of infection. Accurate annotation of open-reading frames (ORFs) is challenge...

Descripción completa

Detalles Bibliográficos
Autor principal: Cornman, Robert S.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: PeerJ Inc. 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10655722/
https://www.ncbi.nlm.nih.gov/pubmed/38025724
http://dx.doi.org/10.7717/peerj.16455
_version_ 1785147978447585280
author Cornman, Robert S.
author_facet Cornman, Robert S.
author_sort Cornman, Robert S.
collection PubMed
description BACKGROUND: Apis mellifera filamentous virus (AmFV) is a large double-stranded DNA virus of uncertain phylogenetic position that infects honey bees (Apis mellifera). Little is known about AmFV evolution or molecular aspects of infection. Accurate annotation of open-reading frames (ORFs) is challenged by weak homology to other known viruses. This study was undertaken to evaluate ORFs (including coding-frame conservation, codon bias, and purifying selection), quantify genetic variation within AmFV, identify host characteristics that covary with infection rate, and examine viral expression patterns in different tissues. METHODS: Short-read data were accessed from the Sequence Read Archive (SRA) of the National Center for Biotechnology Information (NCBI). Sequence reads were downloaded from accessions meeting search criteria and scanned for kmers representative of AmFV genomic sequence. Samples with kmer counts above specified thresholds were downloaded in full for mapping to reference sequences and de novo assembly. RESULTS: At least three distinct evolutionary lineages of AmFV exist. Clade 1 predominates in Europe but in the Americas and Africa it is replaced by the other clades as infection level increases in hosts. Only clade 3 was found at high relative abundance in hosts with African ancestry, whereas all clades achieved high relative abundance in bees of non-African ancestry. In Europe and Africa, clade 2 was generally detected only in low-level infections but was locally dominant in some North American samples. The geographic distribution of clade 3 was consistent with an introduction to the Americas with ‘Africanized’ honey bees in the 1950s. Localized genomic regions of very high nucleotide divergence in individual isolates suggest recombination with additional, as-yet unidentified AmFV lineages. A set of 155 high-confidence ORFs was annotated based on evolutionary conservation in six AmFV genome sequences representative of the three clades. Pairwise protein-level identity averaged 94.6% across ORFs (range 77.1–100%), which generally exhibited low evolutionary rates and moderate to strong codon bias. However, no robust example of positive diversifying selection on coding sequence was found in these alignments. Most of the genome was detected in RNA short-read alignments. Transcriptome assembly often yielded contigs in excess of 50 kb and containing ORFs in both orientations, and the termini of long transcripts were associated with tandem repeats. Lower levels of AmFV RNA were detected in brain tissue compared to abdominal tissue, and a distinct set of ORFs had minimal to no detectable expression in brain tissue. A scan of DNA accessions from the parasitic mite Varroa destructor was inconclusive with respect to replication in that species. DISCUSSION: Collectively, these results expand our understanding of this enigmatic virus, revealing transcriptional complexity and co-evolutionary associations with host lineage.
format Online
Article
Text
id pubmed-10655722
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher PeerJ Inc.
record_format MEDLINE/PubMed
spelling pubmed-106557222023-11-14 Data mining reveals tissue-specific expression and host lineage-associated forms of Apis mellifera filamentous virus Cornman, Robert S. PeerJ Agricultural Science BACKGROUND: Apis mellifera filamentous virus (AmFV) is a large double-stranded DNA virus of uncertain phylogenetic position that infects honey bees (Apis mellifera). Little is known about AmFV evolution or molecular aspects of infection. Accurate annotation of open-reading frames (ORFs) is challenged by weak homology to other known viruses. This study was undertaken to evaluate ORFs (including coding-frame conservation, codon bias, and purifying selection), quantify genetic variation within AmFV, identify host characteristics that covary with infection rate, and examine viral expression patterns in different tissues. METHODS: Short-read data were accessed from the Sequence Read Archive (SRA) of the National Center for Biotechnology Information (NCBI). Sequence reads were downloaded from accessions meeting search criteria and scanned for kmers representative of AmFV genomic sequence. Samples with kmer counts above specified thresholds were downloaded in full for mapping to reference sequences and de novo assembly. RESULTS: At least three distinct evolutionary lineages of AmFV exist. Clade 1 predominates in Europe but in the Americas and Africa it is replaced by the other clades as infection level increases in hosts. Only clade 3 was found at high relative abundance in hosts with African ancestry, whereas all clades achieved high relative abundance in bees of non-African ancestry. In Europe and Africa, clade 2 was generally detected only in low-level infections but was locally dominant in some North American samples. The geographic distribution of clade 3 was consistent with an introduction to the Americas with ‘Africanized’ honey bees in the 1950s. Localized genomic regions of very high nucleotide divergence in individual isolates suggest recombination with additional, as-yet unidentified AmFV lineages. A set of 155 high-confidence ORFs was annotated based on evolutionary conservation in six AmFV genome sequences representative of the three clades. Pairwise protein-level identity averaged 94.6% across ORFs (range 77.1–100%), which generally exhibited low evolutionary rates and moderate to strong codon bias. However, no robust example of positive diversifying selection on coding sequence was found in these alignments. Most of the genome was detected in RNA short-read alignments. Transcriptome assembly often yielded contigs in excess of 50 kb and containing ORFs in both orientations, and the termini of long transcripts were associated with tandem repeats. Lower levels of AmFV RNA were detected in brain tissue compared to abdominal tissue, and a distinct set of ORFs had minimal to no detectable expression in brain tissue. A scan of DNA accessions from the parasitic mite Varroa destructor was inconclusive with respect to replication in that species. DISCUSSION: Collectively, these results expand our understanding of this enigmatic virus, revealing transcriptional complexity and co-evolutionary associations with host lineage. PeerJ Inc. 2023-11-14 /pmc/articles/PMC10655722/ /pubmed/38025724 http://dx.doi.org/10.7717/peerj.16455 Text en https://creativecommons.org/publicdomain/zero/1.0/This is an open access article, free of all copyright, made available under the Creative Commons Public Domain Dedication (https://creativecommons.org/publicdomain/zero/1.0/) . This work may be freely reproduced, distributed, transmitted, modified, built upon, or otherwise used by anyone for any lawful purpose.
spellingShingle Agricultural Science
Cornman, Robert S.
Data mining reveals tissue-specific expression and host lineage-associated forms of Apis mellifera filamentous virus
title Data mining reveals tissue-specific expression and host lineage-associated forms of Apis mellifera filamentous virus
title_full Data mining reveals tissue-specific expression and host lineage-associated forms of Apis mellifera filamentous virus
title_fullStr Data mining reveals tissue-specific expression and host lineage-associated forms of Apis mellifera filamentous virus
title_full_unstemmed Data mining reveals tissue-specific expression and host lineage-associated forms of Apis mellifera filamentous virus
title_short Data mining reveals tissue-specific expression and host lineage-associated forms of Apis mellifera filamentous virus
title_sort data mining reveals tissue-specific expression and host lineage-associated forms of apis mellifera filamentous virus
topic Agricultural Science
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10655722/
https://www.ncbi.nlm.nih.gov/pubmed/38025724
http://dx.doi.org/10.7717/peerj.16455
work_keys_str_mv AT cornmanroberts dataminingrevealstissuespecificexpressionandhostlineageassociatedformsofapismelliferafilamentousvirus