Cargando…

Reduced representation approaches produce similar results to whole genome sequencing for some common phylogeographic analyses

When designing phylogeographic investigations researchers can choose to collect many different types of molecular markers, including mitochondrial genes or genomes, SNPs from reduced representation protocols, large sequence capture data sets, and even whole genomes. Given that the statistical power...

Descripción completa

Detalles Bibliográficos
Autores principales: Duckett, Drew J., Calder, Kailee, Sullivan, Jack, Tank, David C., Carstens, Bryan C.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10688678/
https://www.ncbi.nlm.nih.gov/pubmed/38032899
http://dx.doi.org/10.1371/journal.pone.0291941
_version_ 1785152213452062720
author Duckett, Drew J.
Calder, Kailee
Sullivan, Jack
Tank, David C.
Carstens, Bryan C.
author_facet Duckett, Drew J.
Calder, Kailee
Sullivan, Jack
Tank, David C.
Carstens, Bryan C.
author_sort Duckett, Drew J.
collection PubMed
description When designing phylogeographic investigations researchers can choose to collect many different types of molecular markers, including mitochondrial genes or genomes, SNPs from reduced representation protocols, large sequence capture data sets, and even whole genomes. Given that the statistical power and accuracy of various analyses are expected to differ depending on both the type of marker and the amount of data collected, an exploration of the variance across methodological results as a function of marker type should provide valuable information to researchers. Here we collect mitochondrial Cytochrome b sequences, whole mitochondrial genomes, single nucleotide polymorphisms (SNP)s isolated using a genotype by sequencing (GBS) protocol, sequences from ultraconserved elements, and low-coverage nuclear genomes from the North American water vole (Microtus richardsoni). We estimate genetic distances, population genetic structure, and historical demography using data from each of these datasets and compare the results across markers. As anticipated, the results exhibit differences across marker types, particularly in terms of the resolution offered by different analyses. A cost-benefit analysis indicates that SNPs collected using a GBS protocol are the most cost-effective molecular marker, with inferences that mirror those collected from the whole genome data at a fraction of the cost per sample.
format Online
Article
Text
id pubmed-10688678
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-106886782023-12-01 Reduced representation approaches produce similar results to whole genome sequencing for some common phylogeographic analyses Duckett, Drew J. Calder, Kailee Sullivan, Jack Tank, David C. Carstens, Bryan C. PLoS One Research Article When designing phylogeographic investigations researchers can choose to collect many different types of molecular markers, including mitochondrial genes or genomes, SNPs from reduced representation protocols, large sequence capture data sets, and even whole genomes. Given that the statistical power and accuracy of various analyses are expected to differ depending on both the type of marker and the amount of data collected, an exploration of the variance across methodological results as a function of marker type should provide valuable information to researchers. Here we collect mitochondrial Cytochrome b sequences, whole mitochondrial genomes, single nucleotide polymorphisms (SNP)s isolated using a genotype by sequencing (GBS) protocol, sequences from ultraconserved elements, and low-coverage nuclear genomes from the North American water vole (Microtus richardsoni). We estimate genetic distances, population genetic structure, and historical demography using data from each of these datasets and compare the results across markers. As anticipated, the results exhibit differences across marker types, particularly in terms of the resolution offered by different analyses. A cost-benefit analysis indicates that SNPs collected using a GBS protocol are the most cost-effective molecular marker, with inferences that mirror those collected from the whole genome data at a fraction of the cost per sample. Public Library of Science 2023-11-30 /pmc/articles/PMC10688678/ /pubmed/38032899 http://dx.doi.org/10.1371/journal.pone.0291941 Text en © 2023 Duckett et al https://creativecommons.org/licenses/by/4.0/This is an open access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle Research Article
Duckett, Drew J.
Calder, Kailee
Sullivan, Jack
Tank, David C.
Carstens, Bryan C.
Reduced representation approaches produce similar results to whole genome sequencing for some common phylogeographic analyses
title Reduced representation approaches produce similar results to whole genome sequencing for some common phylogeographic analyses
title_full Reduced representation approaches produce similar results to whole genome sequencing for some common phylogeographic analyses
title_fullStr Reduced representation approaches produce similar results to whole genome sequencing for some common phylogeographic analyses
title_full_unstemmed Reduced representation approaches produce similar results to whole genome sequencing for some common phylogeographic analyses
title_short Reduced representation approaches produce similar results to whole genome sequencing for some common phylogeographic analyses
title_sort reduced representation approaches produce similar results to whole genome sequencing for some common phylogeographic analyses
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10688678/
https://www.ncbi.nlm.nih.gov/pubmed/38032899
http://dx.doi.org/10.1371/journal.pone.0291941
work_keys_str_mv AT duckettdrewj reducedrepresentationapproachesproducesimilarresultstowholegenomesequencingforsomecommonphylogeographicanalyses
AT calderkailee reducedrepresentationapproachesproducesimilarresultstowholegenomesequencingforsomecommonphylogeographicanalyses
AT sullivanjack reducedrepresentationapproachesproducesimilarresultstowholegenomesequencingforsomecommonphylogeographicanalyses
AT tankdavidc reducedrepresentationapproachesproducesimilarresultstowholegenomesequencingforsomecommonphylogeographicanalyses
AT carstensbryanc reducedrepresentationapproachesproducesimilarresultstowholegenomesequencingforsomecommonphylogeographicanalyses