Cargando…
Detection of atypical genes in virus families using a one-class SVM
BACKGROUND: The diversity of viruses, the absence of universally common genes in them, and their ability to act as carriers of genetic material make assessment of evolutionary paths of viral genes very difficult. One important factor contributing to this complexity is horizontal gene transfer. RESUL...
Autores principales: | , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2014
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4210486/ https://www.ncbi.nlm.nih.gov/pubmed/25336138 http://dx.doi.org/10.1186/1471-2164-15-913 |
_version_ | 1782341380392091648 |
---|---|
author | Metzler, Saskia Kalinina, Olga V |
author_facet | Metzler, Saskia Kalinina, Olga V |
author_sort | Metzler, Saskia |
collection | PubMed |
description | BACKGROUND: The diversity of viruses, the absence of universally common genes in them, and their ability to act as carriers of genetic material make assessment of evolutionary paths of viral genes very difficult. One important factor contributing to this complexity is horizontal gene transfer. RESULTS: We explore the possibility for the systematic identification of atypical genes within virus families, including viruses whose genome is not encoded by a double-stranded DNA. Our method is based on gene statistical features that differ in genes that were subject of recent horizontal gene transfer from those of the genome in which they are observed. We employ a one-class SVM approach to detect atypical genes within a virus family basing of their statistical signatures and without explicit knowledge of the source species. The simplicity of the statistical features used makes the method applicable to various viruses irrespective of their genome size or type. CONCLUSIONS: On simulated data, the method can robustly identify alien genes irrespective of the coding nucleic acid found in a virus. It also compares well to results obtained in related studies for double-stranded DNA viruses. Its value in practice is confirmed by the identification of isolated examples of horizontal gene transfer events that have already been described in the literature. A Python package implementing the method and the results for the analyzed virus families are available at http://svm-agp.bioinf.mpi-inf.mpg.de. |
format | Online Article Text |
id | pubmed-4210486 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2014 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-42104862014-10-29 Detection of atypical genes in virus families using a one-class SVM Metzler, Saskia Kalinina, Olga V BMC Genomics Methodology Article BACKGROUND: The diversity of viruses, the absence of universally common genes in them, and their ability to act as carriers of genetic material make assessment of evolutionary paths of viral genes very difficult. One important factor contributing to this complexity is horizontal gene transfer. RESULTS: We explore the possibility for the systematic identification of atypical genes within virus families, including viruses whose genome is not encoded by a double-stranded DNA. Our method is based on gene statistical features that differ in genes that were subject of recent horizontal gene transfer from those of the genome in which they are observed. We employ a one-class SVM approach to detect atypical genes within a virus family basing of their statistical signatures and without explicit knowledge of the source species. The simplicity of the statistical features used makes the method applicable to various viruses irrespective of their genome size or type. CONCLUSIONS: On simulated data, the method can robustly identify alien genes irrespective of the coding nucleic acid found in a virus. It also compares well to results obtained in related studies for double-stranded DNA viruses. Its value in practice is confirmed by the identification of isolated examples of horizontal gene transfer events that have already been described in the literature. A Python package implementing the method and the results for the analyzed virus families are available at http://svm-agp.bioinf.mpi-inf.mpg.de. BioMed Central 2014-10-20 /pmc/articles/PMC4210486/ /pubmed/25336138 http://dx.doi.org/10.1186/1471-2164-15-913 Text en © Metzler and Kalinina; licensee BioMed Central Ltd. 2014 This article is published under license to BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly credited. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated. |
spellingShingle | Methodology Article Metzler, Saskia Kalinina, Olga V Detection of atypical genes in virus families using a one-class SVM |
title | Detection of atypical genes in virus families using a one-class SVM |
title_full | Detection of atypical genes in virus families using a one-class SVM |
title_fullStr | Detection of atypical genes in virus families using a one-class SVM |
title_full_unstemmed | Detection of atypical genes in virus families using a one-class SVM |
title_short | Detection of atypical genes in virus families using a one-class SVM |
title_sort | detection of atypical genes in virus families using a one-class svm |
topic | Methodology Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4210486/ https://www.ncbi.nlm.nih.gov/pubmed/25336138 http://dx.doi.org/10.1186/1471-2164-15-913 |
work_keys_str_mv | AT metzlersaskia detectionofatypicalgenesinvirusfamiliesusingaoneclasssvm AT kalininaolgav detectionofatypicalgenesinvirusfamiliesusingaoneclasssvm |