Cargando…

Population-level genome-wide STR discovery and validation for population structure and genetic diversity assessment of Plasmodium species

Short tandem repeats (STRs) are highly informative genetic markers that have been used extensively in population genetics analysis. They are an important source of genetic diversity and can also have functional impact. Despite the availability of bioinformatic methods that permit large-scale genome-...

Descripción completa

Detalles Bibliográficos
Autores principales: Han, Jiru, Munro, Jacob E., Kocoski, Anthony, Barry, Alyssa E., Bahlo, Melanie
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8782505/
https://www.ncbi.nlm.nih.gov/pubmed/35007277
http://dx.doi.org/10.1371/journal.pgen.1009604
_version_ 1784638331008581632
author Han, Jiru
Munro, Jacob E.
Kocoski, Anthony
Barry, Alyssa E.
Bahlo, Melanie
author_facet Han, Jiru
Munro, Jacob E.
Kocoski, Anthony
Barry, Alyssa E.
Bahlo, Melanie
author_sort Han, Jiru
collection PubMed
description Short tandem repeats (STRs) are highly informative genetic markers that have been used extensively in population genetics analysis. They are an important source of genetic diversity and can also have functional impact. Despite the availability of bioinformatic methods that permit large-scale genome-wide genotyping of STRs from whole genome sequencing data, they have not previously been applied to sequencing data from large collections of malaria parasite field samples. Here, we have genotyped STRs using HipSTR in more than 3,000 Plasmodium falciparum and 174 Plasmodium vivax published whole-genome sequence data from samples collected across the globe. High levels of noise and variability in the resultant callset necessitated the development of a novel method for quality control of STR genotype calls. A set of high-quality STR loci (6,768 from P. falciparum and 3,496 from P. vivax) were used to study Plasmodium genetic diversity, population structures and genomic signatures of selection and these were compared to genome-wide single nucleotide polymorphism (SNP) genotyping data. In addition, the genome-wide information about genetic variation and other characteristics of STRs in P. falciparum and P. vivax have been available in an interactive web-based R Shiny application PlasmoSTR (https://github.com/bahlolab/PlasmoSTR).
format Online
Article
Text
id pubmed-8782505
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-87825052022-01-22 Population-level genome-wide STR discovery and validation for population structure and genetic diversity assessment of Plasmodium species Han, Jiru Munro, Jacob E. Kocoski, Anthony Barry, Alyssa E. Bahlo, Melanie PLoS Genet Research Article Short tandem repeats (STRs) are highly informative genetic markers that have been used extensively in population genetics analysis. They are an important source of genetic diversity and can also have functional impact. Despite the availability of bioinformatic methods that permit large-scale genome-wide genotyping of STRs from whole genome sequencing data, they have not previously been applied to sequencing data from large collections of malaria parasite field samples. Here, we have genotyped STRs using HipSTR in more than 3,000 Plasmodium falciparum and 174 Plasmodium vivax published whole-genome sequence data from samples collected across the globe. High levels of noise and variability in the resultant callset necessitated the development of a novel method for quality control of STR genotype calls. A set of high-quality STR loci (6,768 from P. falciparum and 3,496 from P. vivax) were used to study Plasmodium genetic diversity, population structures and genomic signatures of selection and these were compared to genome-wide single nucleotide polymorphism (SNP) genotyping data. In addition, the genome-wide information about genetic variation and other characteristics of STRs in P. falciparum and P. vivax have been available in an interactive web-based R Shiny application PlasmoSTR (https://github.com/bahlolab/PlasmoSTR). Public Library of Science 2022-01-10 /pmc/articles/PMC8782505/ /pubmed/35007277 http://dx.doi.org/10.1371/journal.pgen.1009604 Text en © 2022 Han et al https://creativecommons.org/licenses/by/4.0/This is an open access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle Research Article
Han, Jiru
Munro, Jacob E.
Kocoski, Anthony
Barry, Alyssa E.
Bahlo, Melanie
Population-level genome-wide STR discovery and validation for population structure and genetic diversity assessment of Plasmodium species
title Population-level genome-wide STR discovery and validation for population structure and genetic diversity assessment of Plasmodium species
title_full Population-level genome-wide STR discovery and validation for population structure and genetic diversity assessment of Plasmodium species
title_fullStr Population-level genome-wide STR discovery and validation for population structure and genetic diversity assessment of Plasmodium species
title_full_unstemmed Population-level genome-wide STR discovery and validation for population structure and genetic diversity assessment of Plasmodium species
title_short Population-level genome-wide STR discovery and validation for population structure and genetic diversity assessment of Plasmodium species
title_sort population-level genome-wide str discovery and validation for population structure and genetic diversity assessment of plasmodium species
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8782505/
https://www.ncbi.nlm.nih.gov/pubmed/35007277
http://dx.doi.org/10.1371/journal.pgen.1009604
work_keys_str_mv AT hanjiru populationlevelgenomewidestrdiscoveryandvalidationforpopulationstructureandgeneticdiversityassessmentofplasmodiumspecies
AT munrojacobe populationlevelgenomewidestrdiscoveryandvalidationforpopulationstructureandgeneticdiversityassessmentofplasmodiumspecies
AT kocoskianthony populationlevelgenomewidestrdiscoveryandvalidationforpopulationstructureandgeneticdiversityassessmentofplasmodiumspecies
AT barryalyssae populationlevelgenomewidestrdiscoveryandvalidationforpopulationstructureandgeneticdiversityassessmentofplasmodiumspecies
AT bahlomelanie populationlevelgenomewidestrdiscoveryandvalidationforpopulationstructureandgeneticdiversityassessmentofplasmodiumspecies