Cargando…

Genome-wide epistasis and co-selection study using mutual information

Covariance-based discovery of polymorphisms under co-selective pressure or epistasis has received considerable recent attention in population genomics. Both statistical modeling of the population level covariation of alleles across the chromosome and model-free testing of dependencies between pairs...

Descripción completa

Detalles Bibliográficos
Autores principales: Pensar, Johan, Puranen, Santeri, Arnold, Brian, MacAlasdair, Neil, Kuronen, Juri, Tonkin-Hill, Gerry, Pesonen, Maiju, Xu, Yingying, Sipola, Aleksi, Sánchez-Busó, Leonor, Lees, John A, Chewapreecha, Claire, Bentley, Stephen D, Harris, Simon R, Parkhill, Julian, Croucher, Nicholas J, Corander, Jukka
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6765119/
https://www.ncbi.nlm.nih.gov/pubmed/31361894
http://dx.doi.org/10.1093/nar/gkz656
_version_ 1783454505544712192
author Pensar, Johan
Puranen, Santeri
Arnold, Brian
MacAlasdair, Neil
Kuronen, Juri
Tonkin-Hill, Gerry
Pesonen, Maiju
Xu, Yingying
Sipola, Aleksi
Sánchez-Busó, Leonor
Lees, John A
Chewapreecha, Claire
Bentley, Stephen D
Harris, Simon R
Parkhill, Julian
Croucher, Nicholas J
Corander, Jukka
author_facet Pensar, Johan
Puranen, Santeri
Arnold, Brian
MacAlasdair, Neil
Kuronen, Juri
Tonkin-Hill, Gerry
Pesonen, Maiju
Xu, Yingying
Sipola, Aleksi
Sánchez-Busó, Leonor
Lees, John A
Chewapreecha, Claire
Bentley, Stephen D
Harris, Simon R
Parkhill, Julian
Croucher, Nicholas J
Corander, Jukka
author_sort Pensar, Johan
collection PubMed
description Covariance-based discovery of polymorphisms under co-selective pressure or epistasis has received considerable recent attention in population genomics. Both statistical modeling of the population level covariation of alleles across the chromosome and model-free testing of dependencies between pairs of polymorphisms have been shown to successfully uncover patterns of selection in bacterial populations. Here we introduce a model-free method, SpydrPick, whose computational efficiency enables analysis at the scale of pan-genomes of many bacteria. SpydrPick incorporates an efficient correction for population structure, which adjusts for the phylogenetic signal in the data without requiring an explicit phylogenetic tree. We also introduce a new type of visualization of the results similar to the Manhattan plots used in genome-wide association studies, which enables rapid exploration of the identified signals of co-evolution. Simulations demonstrate the usefulness of our method and give some insight to when this type of analysis is most likely to be successful. Application of the method to large population genomic datasets of two major human pathogens, Streptococcus pneumoniae and Neisseria meningitidis, revealed both previously identified and novel putative targets of co-selection related to virulence and antibiotic resistance, highlighting the potential of this approach to drive molecular discoveries, even in the absence of phenotypic data.
format Online
Article
Text
id pubmed-6765119
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-67651192019-10-02 Genome-wide epistasis and co-selection study using mutual information Pensar, Johan Puranen, Santeri Arnold, Brian MacAlasdair, Neil Kuronen, Juri Tonkin-Hill, Gerry Pesonen, Maiju Xu, Yingying Sipola, Aleksi Sánchez-Busó, Leonor Lees, John A Chewapreecha, Claire Bentley, Stephen D Harris, Simon R Parkhill, Julian Croucher, Nicholas J Corander, Jukka Nucleic Acids Res Methods Online Covariance-based discovery of polymorphisms under co-selective pressure or epistasis has received considerable recent attention in population genomics. Both statistical modeling of the population level covariation of alleles across the chromosome and model-free testing of dependencies between pairs of polymorphisms have been shown to successfully uncover patterns of selection in bacterial populations. Here we introduce a model-free method, SpydrPick, whose computational efficiency enables analysis at the scale of pan-genomes of many bacteria. SpydrPick incorporates an efficient correction for population structure, which adjusts for the phylogenetic signal in the data without requiring an explicit phylogenetic tree. We also introduce a new type of visualization of the results similar to the Manhattan plots used in genome-wide association studies, which enables rapid exploration of the identified signals of co-evolution. Simulations demonstrate the usefulness of our method and give some insight to when this type of analysis is most likely to be successful. Application of the method to large population genomic datasets of two major human pathogens, Streptococcus pneumoniae and Neisseria meningitidis, revealed both previously identified and novel putative targets of co-selection related to virulence and antibiotic resistance, highlighting the potential of this approach to drive molecular discoveries, even in the absence of phenotypic data. Oxford University Press 2019-10-10 2019-07-30 /pmc/articles/PMC6765119/ /pubmed/31361894 http://dx.doi.org/10.1093/nar/gkz656 Text en © The Author(s) 2019. Published by Oxford University Press on behalf of Nucleic Acids Research. http://creativecommons.org/licenses/by/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Methods Online
Pensar, Johan
Puranen, Santeri
Arnold, Brian
MacAlasdair, Neil
Kuronen, Juri
Tonkin-Hill, Gerry
Pesonen, Maiju
Xu, Yingying
Sipola, Aleksi
Sánchez-Busó, Leonor
Lees, John A
Chewapreecha, Claire
Bentley, Stephen D
Harris, Simon R
Parkhill, Julian
Croucher, Nicholas J
Corander, Jukka
Genome-wide epistasis and co-selection study using mutual information
title Genome-wide epistasis and co-selection study using mutual information
title_full Genome-wide epistasis and co-selection study using mutual information
title_fullStr Genome-wide epistasis and co-selection study using mutual information
title_full_unstemmed Genome-wide epistasis and co-selection study using mutual information
title_short Genome-wide epistasis and co-selection study using mutual information
title_sort genome-wide epistasis and co-selection study using mutual information
topic Methods Online
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6765119/
https://www.ncbi.nlm.nih.gov/pubmed/31361894
http://dx.doi.org/10.1093/nar/gkz656
work_keys_str_mv AT pensarjohan genomewideepistasisandcoselectionstudyusingmutualinformation
AT puranensanteri genomewideepistasisandcoselectionstudyusingmutualinformation
AT arnoldbrian genomewideepistasisandcoselectionstudyusingmutualinformation
AT macalasdairneil genomewideepistasisandcoselectionstudyusingmutualinformation
AT kuronenjuri genomewideepistasisandcoselectionstudyusingmutualinformation
AT tonkinhillgerry genomewideepistasisandcoselectionstudyusingmutualinformation
AT pesonenmaiju genomewideepistasisandcoselectionstudyusingmutualinformation
AT xuyingying genomewideepistasisandcoselectionstudyusingmutualinformation
AT sipolaaleksi genomewideepistasisandcoselectionstudyusingmutualinformation
AT sanchezbusoleonor genomewideepistasisandcoselectionstudyusingmutualinformation
AT leesjohna genomewideepistasisandcoselectionstudyusingmutualinformation
AT chewapreechaclaire genomewideepistasisandcoselectionstudyusingmutualinformation
AT bentleystephend genomewideepistasisandcoselectionstudyusingmutualinformation
AT harrissimonr genomewideepistasisandcoselectionstudyusingmutualinformation
AT parkhilljulian genomewideepistasisandcoselectionstudyusingmutualinformation
AT crouchernicholasj genomewideepistasisandcoselectionstudyusingmutualinformation
AT coranderjukka genomewideepistasisandcoselectionstudyusingmutualinformation