Cargando…
The Challenges of Analysing Highly Diverse Picobirnavirus Sequence Data
The reliable identification and classification of infectious diseases is critical for understanding their biology and controlling their impact. Recent advances in sequencing technology have allowed insight into the remarkable diversity of the virosphere, of which a large component remains undiscover...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
MDPI
2018
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6316005/ https://www.ncbi.nlm.nih.gov/pubmed/30513931 http://dx.doi.org/10.3390/v10120685 |
_version_ | 1783384427732140032 |
---|---|
author | Knox, Matthew A. Gedye, Kristene R. Hayman, David T. S. |
author_facet | Knox, Matthew A. Gedye, Kristene R. Hayman, David T. S. |
author_sort | Knox, Matthew A. |
collection | PubMed |
description | The reliable identification and classification of infectious diseases is critical for understanding their biology and controlling their impact. Recent advances in sequencing technology have allowed insight into the remarkable diversity of the virosphere, of which a large component remains undiscovered. For these emerging or undescribed viruses, the process of classifying unknown sequences is heavily reliant on existing nucleotide sequence information in public databases. However, due to the enormous diversity of viruses, and past focus on the most prevalent and impactful virus types, databases are often incomplete. Picobirnaviridae is a dsRNA virus family with broad host and geographic range, but with relatively little sequence information in public databases. The family contains one genus, Picobirnavirus, which may be associated with gastric illness in humans and animals. Little further information is available due in part to difficulties in identification. Here, we investigate diversity both within the genus Picobirnavirus and among other dsRNA virus types using a combined phylogenetic and functional (protein structure homology-modelling) approach. Our results show that diversity within picobirnavirus exceeds that seen between many other dsRNA genera. Furthermore, we find that commonly used practices employed to classify picobirnavirus, such as analysis of short fragments and trimming of sequences, can influence phylogenetic conclusions. The degree of phylogenetic and functional divergence among picobirnavirus sequences in our study suggests an enormous undiscovered diversity, which contributes to the undescribed “viral dark matter” component of metagenomic studies. |
format | Online Article Text |
id | pubmed-6316005 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2018 |
publisher | MDPI |
record_format | MEDLINE/PubMed |
spelling | pubmed-63160052019-01-10 The Challenges of Analysing Highly Diverse Picobirnavirus Sequence Data Knox, Matthew A. Gedye, Kristene R. Hayman, David T. S. Viruses Article The reliable identification and classification of infectious diseases is critical for understanding their biology and controlling their impact. Recent advances in sequencing technology have allowed insight into the remarkable diversity of the virosphere, of which a large component remains undiscovered. For these emerging or undescribed viruses, the process of classifying unknown sequences is heavily reliant on existing nucleotide sequence information in public databases. However, due to the enormous diversity of viruses, and past focus on the most prevalent and impactful virus types, databases are often incomplete. Picobirnaviridae is a dsRNA virus family with broad host and geographic range, but with relatively little sequence information in public databases. The family contains one genus, Picobirnavirus, which may be associated with gastric illness in humans and animals. Little further information is available due in part to difficulties in identification. Here, we investigate diversity both within the genus Picobirnavirus and among other dsRNA virus types using a combined phylogenetic and functional (protein structure homology-modelling) approach. Our results show that diversity within picobirnavirus exceeds that seen between many other dsRNA genera. Furthermore, we find that commonly used practices employed to classify picobirnavirus, such as analysis of short fragments and trimming of sequences, can influence phylogenetic conclusions. The degree of phylogenetic and functional divergence among picobirnavirus sequences in our study suggests an enormous undiscovered diversity, which contributes to the undescribed “viral dark matter” component of metagenomic studies. MDPI 2018-12-03 /pmc/articles/PMC6316005/ /pubmed/30513931 http://dx.doi.org/10.3390/v10120685 Text en © 2018 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/). |
spellingShingle | Article Knox, Matthew A. Gedye, Kristene R. Hayman, David T. S. The Challenges of Analysing Highly Diverse Picobirnavirus Sequence Data |
title | The Challenges of Analysing Highly Diverse Picobirnavirus Sequence Data |
title_full | The Challenges of Analysing Highly Diverse Picobirnavirus Sequence Data |
title_fullStr | The Challenges of Analysing Highly Diverse Picobirnavirus Sequence Data |
title_full_unstemmed | The Challenges of Analysing Highly Diverse Picobirnavirus Sequence Data |
title_short | The Challenges of Analysing Highly Diverse Picobirnavirus Sequence Data |
title_sort | challenges of analysing highly diverse picobirnavirus sequence data |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6316005/ https://www.ncbi.nlm.nih.gov/pubmed/30513931 http://dx.doi.org/10.3390/v10120685 |
work_keys_str_mv | AT knoxmatthewa thechallengesofanalysinghighlydiversepicobirnavirussequencedata AT gedyekristener thechallengesofanalysinghighlydiversepicobirnavirussequencedata AT haymandavidts thechallengesofanalysinghighlydiversepicobirnavirussequencedata AT knoxmatthewa challengesofanalysinghighlydiversepicobirnavirussequencedata AT gedyekristener challengesofanalysinghighlydiversepicobirnavirussequencedata AT haymandavidts challengesofanalysinghighlydiversepicobirnavirussequencedata |