Cargando…

Feature Selection Methods for Identifying Genetic Determinants of Host Species in RNA Viruses

Despite environmental, social and ecological dependencies, emergence of zoonotic viruses in human populations is clearly also affected by genetic factors which determine cross-species transmission potential. RNA viruses pose an interesting case study given their mutation rates are orders of magnitud...

Descripción completa

Detalles Bibliográficos
Autores principales: Aguas, Ricardo, Ferguson, Neil M.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2013
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3794897/
https://www.ncbi.nlm.nih.gov/pubmed/24130470
http://dx.doi.org/10.1371/journal.pcbi.1003254
_version_ 1782287285896609792
author Aguas, Ricardo
Ferguson, Neil M.
author_facet Aguas, Ricardo
Ferguson, Neil M.
author_sort Aguas, Ricardo
collection PubMed
description Despite environmental, social and ecological dependencies, emergence of zoonotic viruses in human populations is clearly also affected by genetic factors which determine cross-species transmission potential. RNA viruses pose an interesting case study given their mutation rates are orders of magnitude higher than any other pathogen – as reflected by the recent emergence of SARS and Influenza for example. Here, we show how feature selection techniques can be used to reliably classify viral sequences by host species, and to identify the crucial minority of host-specific sites in pathogen genomic data. The variability in alleles at those sites can be translated into prediction probabilities that a particular pathogen isolate is adapted to a given host. We illustrate the power of these methods by: 1) identifying the sites explaining SARS coronavirus differences between human, bat and palm civet samples; 2) showing how cross species jumps of rabies virus among bat populations can be readily identified; and 3) de novo identification of likely functional influenza host discriminant markers.
format Online
Article
Text
id pubmed-3794897
institution National Center for Biotechnology Information
language English
publishDate 2013
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-37948972013-10-15 Feature Selection Methods for Identifying Genetic Determinants of Host Species in RNA Viruses Aguas, Ricardo Ferguson, Neil M. PLoS Comput Biol Research Article Despite environmental, social and ecological dependencies, emergence of zoonotic viruses in human populations is clearly also affected by genetic factors which determine cross-species transmission potential. RNA viruses pose an interesting case study given their mutation rates are orders of magnitude higher than any other pathogen – as reflected by the recent emergence of SARS and Influenza for example. Here, we show how feature selection techniques can be used to reliably classify viral sequences by host species, and to identify the crucial minority of host-specific sites in pathogen genomic data. The variability in alleles at those sites can be translated into prediction probabilities that a particular pathogen isolate is adapted to a given host. We illustrate the power of these methods by: 1) identifying the sites explaining SARS coronavirus differences between human, bat and palm civet samples; 2) showing how cross species jumps of rabies virus among bat populations can be readily identified; and 3) de novo identification of likely functional influenza host discriminant markers. Public Library of Science 2013-10-10 /pmc/articles/PMC3794897/ /pubmed/24130470 http://dx.doi.org/10.1371/journal.pcbi.1003254 Text en © 2013 Aguas and Ferguson http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited.
spellingShingle Research Article
Aguas, Ricardo
Ferguson, Neil M.
Feature Selection Methods for Identifying Genetic Determinants of Host Species in RNA Viruses
title Feature Selection Methods for Identifying Genetic Determinants of Host Species in RNA Viruses
title_full Feature Selection Methods for Identifying Genetic Determinants of Host Species in RNA Viruses
title_fullStr Feature Selection Methods for Identifying Genetic Determinants of Host Species in RNA Viruses
title_full_unstemmed Feature Selection Methods for Identifying Genetic Determinants of Host Species in RNA Viruses
title_short Feature Selection Methods for Identifying Genetic Determinants of Host Species in RNA Viruses
title_sort feature selection methods for identifying genetic determinants of host species in rna viruses
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3794897/
https://www.ncbi.nlm.nih.gov/pubmed/24130470
http://dx.doi.org/10.1371/journal.pcbi.1003254
work_keys_str_mv AT aguasricardo featureselectionmethodsforidentifyinggeneticdeterminantsofhostspeciesinrnaviruses
AT fergusonneilm featureselectionmethodsforidentifyinggeneticdeterminantsofhostspeciesinrnaviruses