Cargando…

A phylogeny-based sampling strategy and power calculator informs genome-wide associations study design for microbial pathogens

Whole genome sequencing is increasingly used to study phenotypic variation among infectious pathogens and to evaluate their relative transmissibility, virulence, and immunogenicity. To date, relatively little has been published on how and how many pathogen strains should be selected for studies asso...

Descripción completa

Detalles Bibliográficos
Autores principales: Farhat, Maha R, Shapiro, B Jesse, Sheppard, Samuel K, Colijn, Caroline, Murray, Megan
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2014
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4256898/
https://www.ncbi.nlm.nih.gov/pubmed/25484920
http://dx.doi.org/10.1186/s13073-014-0101-7
_version_ 1782347646049976320
author Farhat, Maha R
Shapiro, B Jesse
Sheppard, Samuel K
Colijn, Caroline
Murray, Megan
author_facet Farhat, Maha R
Shapiro, B Jesse
Sheppard, Samuel K
Colijn, Caroline
Murray, Megan
author_sort Farhat, Maha R
collection PubMed
description Whole genome sequencing is increasingly used to study phenotypic variation among infectious pathogens and to evaluate their relative transmissibility, virulence, and immunogenicity. To date, relatively little has been published on how and how many pathogen strains should be selected for studies associating phenotype and genotype. There are specific challenges when identifying genetic associations in bacteria which often comprise highly structured populations. Here we consider general methodological questions related to sampling and analysis focusing on clonal to moderately recombining pathogens. We propose that a matched sampling scheme constitutes an efficient study design, and provide a power calculator based on phylogenetic convergence. We demonstrate this approach by applying it to genomic datasets for two microbial pathogens: Mycobacterium tuberculosis and Campylobacter species. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s13073-014-0101-7) contains supplementary material, which is available to authorized users.
format Online
Article
Text
id pubmed-4256898
institution National Center for Biotechnology Information
language English
publishDate 2014
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-42568982014-12-06 A phylogeny-based sampling strategy and power calculator informs genome-wide associations study design for microbial pathogens Farhat, Maha R Shapiro, B Jesse Sheppard, Samuel K Colijn, Caroline Murray, Megan Genome Med Method Whole genome sequencing is increasingly used to study phenotypic variation among infectious pathogens and to evaluate their relative transmissibility, virulence, and immunogenicity. To date, relatively little has been published on how and how many pathogen strains should be selected for studies associating phenotype and genotype. There are specific challenges when identifying genetic associations in bacteria which often comprise highly structured populations. Here we consider general methodological questions related to sampling and analysis focusing on clonal to moderately recombining pathogens. We propose that a matched sampling scheme constitutes an efficient study design, and provide a power calculator based on phylogenetic convergence. We demonstrate this approach by applying it to genomic datasets for two microbial pathogens: Mycobacterium tuberculosis and Campylobacter species. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s13073-014-0101-7) contains supplementary material, which is available to authorized users. BioMed Central 2014-11-15 /pmc/articles/PMC4256898/ /pubmed/25484920 http://dx.doi.org/10.1186/s13073-014-0101-7 Text en © Farhat et al.; licensee BioMed Central Ltd. 2014 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly credited. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Method
Farhat, Maha R
Shapiro, B Jesse
Sheppard, Samuel K
Colijn, Caroline
Murray, Megan
A phylogeny-based sampling strategy and power calculator informs genome-wide associations study design for microbial pathogens
title A phylogeny-based sampling strategy and power calculator informs genome-wide associations study design for microbial pathogens
title_full A phylogeny-based sampling strategy and power calculator informs genome-wide associations study design for microbial pathogens
title_fullStr A phylogeny-based sampling strategy and power calculator informs genome-wide associations study design for microbial pathogens
title_full_unstemmed A phylogeny-based sampling strategy and power calculator informs genome-wide associations study design for microbial pathogens
title_short A phylogeny-based sampling strategy and power calculator informs genome-wide associations study design for microbial pathogens
title_sort phylogeny-based sampling strategy and power calculator informs genome-wide associations study design for microbial pathogens
topic Method
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4256898/
https://www.ncbi.nlm.nih.gov/pubmed/25484920
http://dx.doi.org/10.1186/s13073-014-0101-7
work_keys_str_mv AT farhatmahar aphylogenybasedsamplingstrategyandpowercalculatorinformsgenomewideassociationsstudydesignformicrobialpathogens
AT shapirobjesse aphylogenybasedsamplingstrategyandpowercalculatorinformsgenomewideassociationsstudydesignformicrobialpathogens
AT sheppardsamuelk aphylogenybasedsamplingstrategyandpowercalculatorinformsgenomewideassociationsstudydesignformicrobialpathogens
AT colijncaroline aphylogenybasedsamplingstrategyandpowercalculatorinformsgenomewideassociationsstudydesignformicrobialpathogens
AT murraymegan aphylogenybasedsamplingstrategyandpowercalculatorinformsgenomewideassociationsstudydesignformicrobialpathogens
AT farhatmahar phylogenybasedsamplingstrategyandpowercalculatorinformsgenomewideassociationsstudydesignformicrobialpathogens
AT shapirobjesse phylogenybasedsamplingstrategyandpowercalculatorinformsgenomewideassociationsstudydesignformicrobialpathogens
AT sheppardsamuelk phylogenybasedsamplingstrategyandpowercalculatorinformsgenomewideassociationsstudydesignformicrobialpathogens
AT colijncaroline phylogenybasedsamplingstrategyandpowercalculatorinformsgenomewideassociationsstudydesignformicrobialpathogens
AT murraymegan phylogenybasedsamplingstrategyandpowercalculatorinformsgenomewideassociationsstudydesignformicrobialpathogens