Cargando…

GiRaF: robust, computational identification of influenza reassortments via graph mining

Reassortments in the influenza virus—a process where strains exchange genetic segments—have been implicated in two out of three pandemics of the 20th century as well as the 2009 H1N1 outbreak. While advances in sequencing have led to an explosion in the number of whole-genome sequences that are avai...

Descripción completa

Detalles Bibliográficos
Autores principales: Nagarajan, Niranjan, Kingsford, Carl
Formato: Texto
Lenguaje:English
Publicado: Oxford University Press 2011
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3064795/
https://www.ncbi.nlm.nih.gov/pubmed/21177643
http://dx.doi.org/10.1093/nar/gkq1232
_version_ 1782200921495699456
author Nagarajan, Niranjan
Kingsford, Carl
author_facet Nagarajan, Niranjan
Kingsford, Carl
author_sort Nagarajan, Niranjan
collection PubMed
description Reassortments in the influenza virus—a process where strains exchange genetic segments—have been implicated in two out of three pandemics of the 20th century as well as the 2009 H1N1 outbreak. While advances in sequencing have led to an explosion in the number of whole-genome sequences that are available, an understanding of the rate and distribution of reassortments and their role in viral evolution is still lacking. An important factor in this is the paucity of automated tools for confident identification of reassortments from sequence data due to the challenges of analyzing large, uncertain viral phylogenies. We describe here a novel computational method, called GiRaF (Graph-incompatibility-based Reassortment Finder), that robustly identifies reassortments in a fully automated fashion while accounting for uncertainties in the inferred phylogenies. The algorithms behind GiRaF search large collections of Markov chain Monte Carlo (MCMC)-sampled trees for groups of incompatible splits using a fast biclique enumeration algorithm coupled with several statistical tests to identify sets of taxa with differential phylogenetic placement. GiRaF correctly finds known reassortments in human, avian, and swine influenza populations, including the evolutionary events that led to the recent ‘swine flu’ outbreak. GiRaF also identifies several previously unreported reassortments via whole-genome studies to catalog events in H5N1 and swine influenza isolates.
format Text
id pubmed-3064795
institution National Center for Biotechnology Information
language English
publishDate 2011
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-30647952011-03-28 GiRaF: robust, computational identification of influenza reassortments via graph mining Nagarajan, Niranjan Kingsford, Carl Nucleic Acids Res Methods Online Reassortments in the influenza virus—a process where strains exchange genetic segments—have been implicated in two out of three pandemics of the 20th century as well as the 2009 H1N1 outbreak. While advances in sequencing have led to an explosion in the number of whole-genome sequences that are available, an understanding of the rate and distribution of reassortments and their role in viral evolution is still lacking. An important factor in this is the paucity of automated tools for confident identification of reassortments from sequence data due to the challenges of analyzing large, uncertain viral phylogenies. We describe here a novel computational method, called GiRaF (Graph-incompatibility-based Reassortment Finder), that robustly identifies reassortments in a fully automated fashion while accounting for uncertainties in the inferred phylogenies. The algorithms behind GiRaF search large collections of Markov chain Monte Carlo (MCMC)-sampled trees for groups of incompatible splits using a fast biclique enumeration algorithm coupled with several statistical tests to identify sets of taxa with differential phylogenetic placement. GiRaF correctly finds known reassortments in human, avian, and swine influenza populations, including the evolutionary events that led to the recent ‘swine flu’ outbreak. GiRaF also identifies several previously unreported reassortments via whole-genome studies to catalog events in H5N1 and swine influenza isolates. Oxford University Press 2011-03 2010-12-21 /pmc/articles/PMC3064795/ /pubmed/21177643 http://dx.doi.org/10.1093/nar/gkq1232 Text en © The Author(s) 2010. Published by Oxford University Press. http://creativecommons.org/licenses/by-nc/2.5 This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/2.5), which permits unrestricted non-commercial use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Methods Online
Nagarajan, Niranjan
Kingsford, Carl
GiRaF: robust, computational identification of influenza reassortments via graph mining
title GiRaF: robust, computational identification of influenza reassortments via graph mining
title_full GiRaF: robust, computational identification of influenza reassortments via graph mining
title_fullStr GiRaF: robust, computational identification of influenza reassortments via graph mining
title_full_unstemmed GiRaF: robust, computational identification of influenza reassortments via graph mining
title_short GiRaF: robust, computational identification of influenza reassortments via graph mining
title_sort giraf: robust, computational identification of influenza reassortments via graph mining
topic Methods Online
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3064795/
https://www.ncbi.nlm.nih.gov/pubmed/21177643
http://dx.doi.org/10.1093/nar/gkq1232
work_keys_str_mv AT nagarajanniranjan girafrobustcomputationalidentificationofinfluenzareassortmentsviagraphmining
AT kingsfordcarl girafrobustcomputationalidentificationofinfluenzareassortmentsviagraphmining