Cargando…
Reduced representation approaches produce similar results to whole genome sequencing for some common phylogeographic analyses
When designing phylogeographic investigations researchers can choose to collect many different types of molecular markers, including mitochondrial genes or genomes, SNPs from reduced representation protocols, large sequence capture data sets, and even whole genomes. Given that the statistical power...
Autores principales: | , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Public Library of Science
2023
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10688678/ https://www.ncbi.nlm.nih.gov/pubmed/38032899 http://dx.doi.org/10.1371/journal.pone.0291941 |
_version_ | 1785152213452062720 |
---|---|
author | Duckett, Drew J. Calder, Kailee Sullivan, Jack Tank, David C. Carstens, Bryan C. |
author_facet | Duckett, Drew J. Calder, Kailee Sullivan, Jack Tank, David C. Carstens, Bryan C. |
author_sort | Duckett, Drew J. |
collection | PubMed |
description | When designing phylogeographic investigations researchers can choose to collect many different types of molecular markers, including mitochondrial genes or genomes, SNPs from reduced representation protocols, large sequence capture data sets, and even whole genomes. Given that the statistical power and accuracy of various analyses are expected to differ depending on both the type of marker and the amount of data collected, an exploration of the variance across methodological results as a function of marker type should provide valuable information to researchers. Here we collect mitochondrial Cytochrome b sequences, whole mitochondrial genomes, single nucleotide polymorphisms (SNP)s isolated using a genotype by sequencing (GBS) protocol, sequences from ultraconserved elements, and low-coverage nuclear genomes from the North American water vole (Microtus richardsoni). We estimate genetic distances, population genetic structure, and historical demography using data from each of these datasets and compare the results across markers. As anticipated, the results exhibit differences across marker types, particularly in terms of the resolution offered by different analyses. A cost-benefit analysis indicates that SNPs collected using a GBS protocol are the most cost-effective molecular marker, with inferences that mirror those collected from the whole genome data at a fraction of the cost per sample. |
format | Online Article Text |
id | pubmed-10688678 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2023 |
publisher | Public Library of Science |
record_format | MEDLINE/PubMed |
spelling | pubmed-106886782023-12-01 Reduced representation approaches produce similar results to whole genome sequencing for some common phylogeographic analyses Duckett, Drew J. Calder, Kailee Sullivan, Jack Tank, David C. Carstens, Bryan C. PLoS One Research Article When designing phylogeographic investigations researchers can choose to collect many different types of molecular markers, including mitochondrial genes or genomes, SNPs from reduced representation protocols, large sequence capture data sets, and even whole genomes. Given that the statistical power and accuracy of various analyses are expected to differ depending on both the type of marker and the amount of data collected, an exploration of the variance across methodological results as a function of marker type should provide valuable information to researchers. Here we collect mitochondrial Cytochrome b sequences, whole mitochondrial genomes, single nucleotide polymorphisms (SNP)s isolated using a genotype by sequencing (GBS) protocol, sequences from ultraconserved elements, and low-coverage nuclear genomes from the North American water vole (Microtus richardsoni). We estimate genetic distances, population genetic structure, and historical demography using data from each of these datasets and compare the results across markers. As anticipated, the results exhibit differences across marker types, particularly in terms of the resolution offered by different analyses. A cost-benefit analysis indicates that SNPs collected using a GBS protocol are the most cost-effective molecular marker, with inferences that mirror those collected from the whole genome data at a fraction of the cost per sample. Public Library of Science 2023-11-30 /pmc/articles/PMC10688678/ /pubmed/38032899 http://dx.doi.org/10.1371/journal.pone.0291941 Text en © 2023 Duckett et al https://creativecommons.org/licenses/by/4.0/This is an open access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited. |
spellingShingle | Research Article Duckett, Drew J. Calder, Kailee Sullivan, Jack Tank, David C. Carstens, Bryan C. Reduced representation approaches produce similar results to whole genome sequencing for some common phylogeographic analyses |
title | Reduced representation approaches produce similar results to whole genome sequencing for some common phylogeographic analyses |
title_full | Reduced representation approaches produce similar results to whole genome sequencing for some common phylogeographic analyses |
title_fullStr | Reduced representation approaches produce similar results to whole genome sequencing for some common phylogeographic analyses |
title_full_unstemmed | Reduced representation approaches produce similar results to whole genome sequencing for some common phylogeographic analyses |
title_short | Reduced representation approaches produce similar results to whole genome sequencing for some common phylogeographic analyses |
title_sort | reduced representation approaches produce similar results to whole genome sequencing for some common phylogeographic analyses |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10688678/ https://www.ncbi.nlm.nih.gov/pubmed/38032899 http://dx.doi.org/10.1371/journal.pone.0291941 |
work_keys_str_mv | AT duckettdrewj reducedrepresentationapproachesproducesimilarresultstowholegenomesequencingforsomecommonphylogeographicanalyses AT calderkailee reducedrepresentationapproachesproducesimilarresultstowholegenomesequencingforsomecommonphylogeographicanalyses AT sullivanjack reducedrepresentationapproachesproducesimilarresultstowholegenomesequencingforsomecommonphylogeographicanalyses AT tankdavidc reducedrepresentationapproachesproducesimilarresultstowholegenomesequencingforsomecommonphylogeographicanalyses AT carstensbryanc reducedrepresentationapproachesproducesimilarresultstowholegenomesequencingforsomecommonphylogeographicanalyses |