Cargando…

Detection of atypical genes in virus families using a one-class SVM

BACKGROUND: The diversity of viruses, the absence of universally common genes in them, and their ability to act as carriers of genetic material make assessment of evolutionary paths of viral genes very difficult. One important factor contributing to this complexity is horizontal gene transfer. RESUL...

Descripción completa

Detalles Bibliográficos
Autores principales: Metzler, Saskia, Kalinina, Olga V
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2014
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4210486/
https://www.ncbi.nlm.nih.gov/pubmed/25336138
http://dx.doi.org/10.1186/1471-2164-15-913
_version_ 1782341380392091648
author Metzler, Saskia
Kalinina, Olga V
author_facet Metzler, Saskia
Kalinina, Olga V
author_sort Metzler, Saskia
collection PubMed
description BACKGROUND: The diversity of viruses, the absence of universally common genes in them, and their ability to act as carriers of genetic material make assessment of evolutionary paths of viral genes very difficult. One important factor contributing to this complexity is horizontal gene transfer. RESULTS: We explore the possibility for the systematic identification of atypical genes within virus families, including viruses whose genome is not encoded by a double-stranded DNA. Our method is based on gene statistical features that differ in genes that were subject of recent horizontal gene transfer from those of the genome in which they are observed. We employ a one-class SVM approach to detect atypical genes within a virus family basing of their statistical signatures and without explicit knowledge of the source species. The simplicity of the statistical features used makes the method applicable to various viruses irrespective of their genome size or type. CONCLUSIONS: On simulated data, the method can robustly identify alien genes irrespective of the coding nucleic acid found in a virus. It also compares well to results obtained in related studies for double-stranded DNA viruses. Its value in practice is confirmed by the identification of isolated examples of horizontal gene transfer events that have already been described in the literature. A Python package implementing the method and the results for the analyzed virus families are available at http://svm-agp.bioinf.mpi-inf.mpg.de.
format Online
Article
Text
id pubmed-4210486
institution National Center for Biotechnology Information
language English
publishDate 2014
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-42104862014-10-29 Detection of atypical genes in virus families using a one-class SVM Metzler, Saskia Kalinina, Olga V BMC Genomics Methodology Article BACKGROUND: The diversity of viruses, the absence of universally common genes in them, and their ability to act as carriers of genetic material make assessment of evolutionary paths of viral genes very difficult. One important factor contributing to this complexity is horizontal gene transfer. RESULTS: We explore the possibility for the systematic identification of atypical genes within virus families, including viruses whose genome is not encoded by a double-stranded DNA. Our method is based on gene statistical features that differ in genes that were subject of recent horizontal gene transfer from those of the genome in which they are observed. We employ a one-class SVM approach to detect atypical genes within a virus family basing of their statistical signatures and without explicit knowledge of the source species. The simplicity of the statistical features used makes the method applicable to various viruses irrespective of their genome size or type. CONCLUSIONS: On simulated data, the method can robustly identify alien genes irrespective of the coding nucleic acid found in a virus. It also compares well to results obtained in related studies for double-stranded DNA viruses. Its value in practice is confirmed by the identification of isolated examples of horizontal gene transfer events that have already been described in the literature. A Python package implementing the method and the results for the analyzed virus families are available at http://svm-agp.bioinf.mpi-inf.mpg.de. BioMed Central 2014-10-20 /pmc/articles/PMC4210486/ /pubmed/25336138 http://dx.doi.org/10.1186/1471-2164-15-913 Text en © Metzler and Kalinina; licensee BioMed Central Ltd. 2014 This article is published under license to BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly credited. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Methodology Article
Metzler, Saskia
Kalinina, Olga V
Detection of atypical genes in virus families using a one-class SVM
title Detection of atypical genes in virus families using a one-class SVM
title_full Detection of atypical genes in virus families using a one-class SVM
title_fullStr Detection of atypical genes in virus families using a one-class SVM
title_full_unstemmed Detection of atypical genes in virus families using a one-class SVM
title_short Detection of atypical genes in virus families using a one-class SVM
title_sort detection of atypical genes in virus families using a one-class svm
topic Methodology Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4210486/
https://www.ncbi.nlm.nih.gov/pubmed/25336138
http://dx.doi.org/10.1186/1471-2164-15-913
work_keys_str_mv AT metzlersaskia detectionofatypicalgenesinvirusfamiliesusingaoneclasssvm
AT kalininaolgav detectionofatypicalgenesinvirusfamiliesusingaoneclasssvm