Cargando…

GLUE: a flexible software system for virus sequence data

BACKGROUND: Virus genome sequences, generated in ever-higher volumes, can provide new scientific insights and inform our responses to epidemics and outbreaks. To facilitate interpretation, such data must be organised and processed within scalable computing resources that encapsulate virology experti...

Descripción completa

Detalles Bibliográficos
Autores principales: Singer, Joshua B., Thomson, Emma C., McLauchlan, John, Hughes, Joseph, Gifford, Robert J.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2018
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6299651/
https://www.ncbi.nlm.nih.gov/pubmed/30563445
http://dx.doi.org/10.1186/s12859-018-2459-9
_version_ 1783381531599831040
author Singer, Joshua B.
Thomson, Emma C.
McLauchlan, John
Hughes, Joseph
Gifford, Robert J.
author_facet Singer, Joshua B.
Thomson, Emma C.
McLauchlan, John
Hughes, Joseph
Gifford, Robert J.
author_sort Singer, Joshua B.
collection PubMed
description BACKGROUND: Virus genome sequences, generated in ever-higher volumes, can provide new scientific insights and inform our responses to epidemics and outbreaks. To facilitate interpretation, such data must be organised and processed within scalable computing resources that encapsulate virology expertise. GLUE (Genes Linked by Underlying Evolution) is a data-centric bioinformatics environment for building such resources. The GLUE core data schema organises sequence data along evolutionary lines, capturing not only nucleotide data but associated items such as alignments, genotype definitions, genome annotations and motifs. Its flexible design emphasises applicability to different viruses and to diverse needs within research, clinical or public health contexts. RESULTS: HCV-GLUE is a case study GLUE resource for hepatitis C virus (HCV). It includes an interactive public web application providing sequence analysis in the form of a maximum-likelihood-based genotyping method, antiviral resistance detection and graphical sequence visualisation. HCV sequence data from GenBank is categorised and stored in a large-scale sequence alignment which is accessible via web-based queries. Whereas this web resource provides a range of basic functionality, the underlying GLUE project can also be downloaded and extended by bioinformaticians addressing more advanced questions. CONCLUSION: GLUE can be used to rapidly develop virus sequence data resources with public health, research and clinical applications. This streamlined approach, with its focus on reuse, will help realise the full value of virus sequence data.
format Online
Article
Text
id pubmed-6299651
institution National Center for Biotechnology Information
language English
publishDate 2018
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-62996512018-12-20 GLUE: a flexible software system for virus sequence data Singer, Joshua B. Thomson, Emma C. McLauchlan, John Hughes, Joseph Gifford, Robert J. BMC Bioinformatics Software BACKGROUND: Virus genome sequences, generated in ever-higher volumes, can provide new scientific insights and inform our responses to epidemics and outbreaks. To facilitate interpretation, such data must be organised and processed within scalable computing resources that encapsulate virology expertise. GLUE (Genes Linked by Underlying Evolution) is a data-centric bioinformatics environment for building such resources. The GLUE core data schema organises sequence data along evolutionary lines, capturing not only nucleotide data but associated items such as alignments, genotype definitions, genome annotations and motifs. Its flexible design emphasises applicability to different viruses and to diverse needs within research, clinical or public health contexts. RESULTS: HCV-GLUE is a case study GLUE resource for hepatitis C virus (HCV). It includes an interactive public web application providing sequence analysis in the form of a maximum-likelihood-based genotyping method, antiviral resistance detection and graphical sequence visualisation. HCV sequence data from GenBank is categorised and stored in a large-scale sequence alignment which is accessible via web-based queries. Whereas this web resource provides a range of basic functionality, the underlying GLUE project can also be downloaded and extended by bioinformaticians addressing more advanced questions. CONCLUSION: GLUE can be used to rapidly develop virus sequence data resources with public health, research and clinical applications. This streamlined approach, with its focus on reuse, will help realise the full value of virus sequence data. BioMed Central 2018-12-18 /pmc/articles/PMC6299651/ /pubmed/30563445 http://dx.doi.org/10.1186/s12859-018-2459-9 Text en © The Author(s) 2018 Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License(http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver(http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Software
Singer, Joshua B.
Thomson, Emma C.
McLauchlan, John
Hughes, Joseph
Gifford, Robert J.
GLUE: a flexible software system for virus sequence data
title GLUE: a flexible software system for virus sequence data
title_full GLUE: a flexible software system for virus sequence data
title_fullStr GLUE: a flexible software system for virus sequence data
title_full_unstemmed GLUE: a flexible software system for virus sequence data
title_short GLUE: a flexible software system for virus sequence data
title_sort glue: a flexible software system for virus sequence data
topic Software
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6299651/
https://www.ncbi.nlm.nih.gov/pubmed/30563445
http://dx.doi.org/10.1186/s12859-018-2459-9
work_keys_str_mv AT singerjoshuab glueaflexiblesoftwaresystemforvirussequencedata
AT thomsonemmac glueaflexiblesoftwaresystemforvirussequencedata
AT mclauchlanjohn glueaflexiblesoftwaresystemforvirussequencedata
AT hughesjoseph glueaflexiblesoftwaresystemforvirussequencedata
AT giffordrobertj glueaflexiblesoftwaresystemforvirussequencedata