Cargando…

A Simple Method to Detect Candidate Overlapping Genes in Viruses Using Single Genome Sequences

Overlapping genes in viruses maximize the coding capacity of their genomes and allow the generation of new genes without major increases in genome size. Despite their importance, the evolution and function of overlapping genes are often not well understood, in part due to difficulties in their detec...

Descripción completa

Detalles Bibliográficos
Autores principales: Schlub, Timothy E, Buchmann, Jan P, Holmes, Edward C
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2018
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6188560/
https://www.ncbi.nlm.nih.gov/pubmed/30099499
http://dx.doi.org/10.1093/molbev/msy155
_version_ 1783363202906587136
author Schlub, Timothy E
Buchmann, Jan P
Holmes, Edward C
author_facet Schlub, Timothy E
Buchmann, Jan P
Holmes, Edward C
author_sort Schlub, Timothy E
collection PubMed
description Overlapping genes in viruses maximize the coding capacity of their genomes and allow the generation of new genes without major increases in genome size. Despite their importance, the evolution and function of overlapping genes are often not well understood, in part due to difficulties in their detection. In addition, most bioinformatic approaches for the detection of overlapping genes require the comparison of multiple genome sequences that may not be available in metagenomic surveys of virus biodiversity. We introduce a simple new method for identifying candidate functional overlapping genes using single virus genome sequences. Our method uses randomization tests to estimate the expected length of open reading frames and then identifies overlapping open reading frames that significantly exceed this length and are thus predicted to be functional. We applied this method to 2548 reference RNA virus genomes and find that it has both high sensitivity and low false discovery for genes that overlap by at least 50 nucleotides. Notably, this analysis provided evidence for 29 previously undiscovered functional overlapping genes, some of which are coded in the antisense direction suggesting there are limitations in our current understanding of RNA virus replication.
format Online
Article
Text
id pubmed-6188560
institution National Center for Biotechnology Information
language English
publishDate 2018
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-61885602018-10-22 A Simple Method to Detect Candidate Overlapping Genes in Viruses Using Single Genome Sequences Schlub, Timothy E Buchmann, Jan P Holmes, Edward C Mol Biol Evol Methods Overlapping genes in viruses maximize the coding capacity of their genomes and allow the generation of new genes without major increases in genome size. Despite their importance, the evolution and function of overlapping genes are often not well understood, in part due to difficulties in their detection. In addition, most bioinformatic approaches for the detection of overlapping genes require the comparison of multiple genome sequences that may not be available in metagenomic surveys of virus biodiversity. We introduce a simple new method for identifying candidate functional overlapping genes using single virus genome sequences. Our method uses randomization tests to estimate the expected length of open reading frames and then identifies overlapping open reading frames that significantly exceed this length and are thus predicted to be functional. We applied this method to 2548 reference RNA virus genomes and find that it has both high sensitivity and low false discovery for genes that overlap by at least 50 nucleotides. Notably, this analysis provided evidence for 29 previously undiscovered functional overlapping genes, some of which are coded in the antisense direction suggesting there are limitations in our current understanding of RNA virus replication. Oxford University Press 2018-10 2018-08-07 /pmc/articles/PMC6188560/ /pubmed/30099499 http://dx.doi.org/10.1093/molbev/msy155 Text en © The Author(s) 2018. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution. http://creativecommons.org/licenses/by-nc/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/4.0/), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial re-use, please contact journals.permissions@oup.com
spellingShingle Methods
Schlub, Timothy E
Buchmann, Jan P
Holmes, Edward C
A Simple Method to Detect Candidate Overlapping Genes in Viruses Using Single Genome Sequences
title A Simple Method to Detect Candidate Overlapping Genes in Viruses Using Single Genome Sequences
title_full A Simple Method to Detect Candidate Overlapping Genes in Viruses Using Single Genome Sequences
title_fullStr A Simple Method to Detect Candidate Overlapping Genes in Viruses Using Single Genome Sequences
title_full_unstemmed A Simple Method to Detect Candidate Overlapping Genes in Viruses Using Single Genome Sequences
title_short A Simple Method to Detect Candidate Overlapping Genes in Viruses Using Single Genome Sequences
title_sort simple method to detect candidate overlapping genes in viruses using single genome sequences
topic Methods
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6188560/
https://www.ncbi.nlm.nih.gov/pubmed/30099499
http://dx.doi.org/10.1093/molbev/msy155
work_keys_str_mv AT schlubtimothye asimplemethodtodetectcandidateoverlappinggenesinvirusesusingsinglegenomesequences
AT buchmannjanp asimplemethodtodetectcandidateoverlappinggenesinvirusesusingsinglegenomesequences
AT holmesedwardc asimplemethodtodetectcandidateoverlappinggenesinvirusesusingsinglegenomesequences
AT schlubtimothye simplemethodtodetectcandidateoverlappinggenesinvirusesusingsinglegenomesequences
AT buchmannjanp simplemethodtodetectcandidateoverlappinggenesinvirusesusingsinglegenomesequences
AT holmesedwardc simplemethodtodetectcandidateoverlappinggenesinvirusesusingsinglegenomesequences