Cargando…
Data mining reveals tissue-specific expression and host lineage-associated forms of Apis mellifera filamentous virus
BACKGROUND: Apis mellifera filamentous virus (AmFV) is a large double-stranded DNA virus of uncertain phylogenetic position that infects honey bees (Apis mellifera). Little is known about AmFV evolution or molecular aspects of infection. Accurate annotation of open-reading frames (ORFs) is challenge...
Autor principal: | |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
PeerJ Inc.
2023
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10655722/ https://www.ncbi.nlm.nih.gov/pubmed/38025724 http://dx.doi.org/10.7717/peerj.16455 |
_version_ | 1785147978447585280 |
---|---|
author | Cornman, Robert S. |
author_facet | Cornman, Robert S. |
author_sort | Cornman, Robert S. |
collection | PubMed |
description | BACKGROUND: Apis mellifera filamentous virus (AmFV) is a large double-stranded DNA virus of uncertain phylogenetic position that infects honey bees (Apis mellifera). Little is known about AmFV evolution or molecular aspects of infection. Accurate annotation of open-reading frames (ORFs) is challenged by weak homology to other known viruses. This study was undertaken to evaluate ORFs (including coding-frame conservation, codon bias, and purifying selection), quantify genetic variation within AmFV, identify host characteristics that covary with infection rate, and examine viral expression patterns in different tissues. METHODS: Short-read data were accessed from the Sequence Read Archive (SRA) of the National Center for Biotechnology Information (NCBI). Sequence reads were downloaded from accessions meeting search criteria and scanned for kmers representative of AmFV genomic sequence. Samples with kmer counts above specified thresholds were downloaded in full for mapping to reference sequences and de novo assembly. RESULTS: At least three distinct evolutionary lineages of AmFV exist. Clade 1 predominates in Europe but in the Americas and Africa it is replaced by the other clades as infection level increases in hosts. Only clade 3 was found at high relative abundance in hosts with African ancestry, whereas all clades achieved high relative abundance in bees of non-African ancestry. In Europe and Africa, clade 2 was generally detected only in low-level infections but was locally dominant in some North American samples. The geographic distribution of clade 3 was consistent with an introduction to the Americas with ‘Africanized’ honey bees in the 1950s. Localized genomic regions of very high nucleotide divergence in individual isolates suggest recombination with additional, as-yet unidentified AmFV lineages. A set of 155 high-confidence ORFs was annotated based on evolutionary conservation in six AmFV genome sequences representative of the three clades. Pairwise protein-level identity averaged 94.6% across ORFs (range 77.1–100%), which generally exhibited low evolutionary rates and moderate to strong codon bias. However, no robust example of positive diversifying selection on coding sequence was found in these alignments. Most of the genome was detected in RNA short-read alignments. Transcriptome assembly often yielded contigs in excess of 50 kb and containing ORFs in both orientations, and the termini of long transcripts were associated with tandem repeats. Lower levels of AmFV RNA were detected in brain tissue compared to abdominal tissue, and a distinct set of ORFs had minimal to no detectable expression in brain tissue. A scan of DNA accessions from the parasitic mite Varroa destructor was inconclusive with respect to replication in that species. DISCUSSION: Collectively, these results expand our understanding of this enigmatic virus, revealing transcriptional complexity and co-evolutionary associations with host lineage. |
format | Online Article Text |
id | pubmed-10655722 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2023 |
publisher | PeerJ Inc. |
record_format | MEDLINE/PubMed |
spelling | pubmed-106557222023-11-14 Data mining reveals tissue-specific expression and host lineage-associated forms of Apis mellifera filamentous virus Cornman, Robert S. PeerJ Agricultural Science BACKGROUND: Apis mellifera filamentous virus (AmFV) is a large double-stranded DNA virus of uncertain phylogenetic position that infects honey bees (Apis mellifera). Little is known about AmFV evolution or molecular aspects of infection. Accurate annotation of open-reading frames (ORFs) is challenged by weak homology to other known viruses. This study was undertaken to evaluate ORFs (including coding-frame conservation, codon bias, and purifying selection), quantify genetic variation within AmFV, identify host characteristics that covary with infection rate, and examine viral expression patterns in different tissues. METHODS: Short-read data were accessed from the Sequence Read Archive (SRA) of the National Center for Biotechnology Information (NCBI). Sequence reads were downloaded from accessions meeting search criteria and scanned for kmers representative of AmFV genomic sequence. Samples with kmer counts above specified thresholds were downloaded in full for mapping to reference sequences and de novo assembly. RESULTS: At least three distinct evolutionary lineages of AmFV exist. Clade 1 predominates in Europe but in the Americas and Africa it is replaced by the other clades as infection level increases in hosts. Only clade 3 was found at high relative abundance in hosts with African ancestry, whereas all clades achieved high relative abundance in bees of non-African ancestry. In Europe and Africa, clade 2 was generally detected only in low-level infections but was locally dominant in some North American samples. The geographic distribution of clade 3 was consistent with an introduction to the Americas with ‘Africanized’ honey bees in the 1950s. Localized genomic regions of very high nucleotide divergence in individual isolates suggest recombination with additional, as-yet unidentified AmFV lineages. A set of 155 high-confidence ORFs was annotated based on evolutionary conservation in six AmFV genome sequences representative of the three clades. Pairwise protein-level identity averaged 94.6% across ORFs (range 77.1–100%), which generally exhibited low evolutionary rates and moderate to strong codon bias. However, no robust example of positive diversifying selection on coding sequence was found in these alignments. Most of the genome was detected in RNA short-read alignments. Transcriptome assembly often yielded contigs in excess of 50 kb and containing ORFs in both orientations, and the termini of long transcripts were associated with tandem repeats. Lower levels of AmFV RNA were detected in brain tissue compared to abdominal tissue, and a distinct set of ORFs had minimal to no detectable expression in brain tissue. A scan of DNA accessions from the parasitic mite Varroa destructor was inconclusive with respect to replication in that species. DISCUSSION: Collectively, these results expand our understanding of this enigmatic virus, revealing transcriptional complexity and co-evolutionary associations with host lineage. PeerJ Inc. 2023-11-14 /pmc/articles/PMC10655722/ /pubmed/38025724 http://dx.doi.org/10.7717/peerj.16455 Text en https://creativecommons.org/publicdomain/zero/1.0/This is an open access article, free of all copyright, made available under the Creative Commons Public Domain Dedication (https://creativecommons.org/publicdomain/zero/1.0/) . This work may be freely reproduced, distributed, transmitted, modified, built upon, or otherwise used by anyone for any lawful purpose. |
spellingShingle | Agricultural Science Cornman, Robert S. Data mining reveals tissue-specific expression and host lineage-associated forms of Apis mellifera filamentous virus |
title | Data mining reveals tissue-specific expression and host lineage-associated forms of Apis mellifera filamentous virus |
title_full | Data mining reveals tissue-specific expression and host lineage-associated forms of Apis mellifera filamentous virus |
title_fullStr | Data mining reveals tissue-specific expression and host lineage-associated forms of Apis mellifera filamentous virus |
title_full_unstemmed | Data mining reveals tissue-specific expression and host lineage-associated forms of Apis mellifera filamentous virus |
title_short | Data mining reveals tissue-specific expression and host lineage-associated forms of Apis mellifera filamentous virus |
title_sort | data mining reveals tissue-specific expression and host lineage-associated forms of apis mellifera filamentous virus |
topic | Agricultural Science |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10655722/ https://www.ncbi.nlm.nih.gov/pubmed/38025724 http://dx.doi.org/10.7717/peerj.16455 |
work_keys_str_mv | AT cornmanroberts dataminingrevealstissuespecificexpressionandhostlineageassociatedformsofapismelliferafilamentousvirus |