Cargando…

The impact of sequencing depth and relatedness of the reference genome in population genomic studies: A case study with two caddisfly species (Trichoptera, Rhyacophilidae, Himalopsyche)

Whole genome sequencing for generating SNP data is increasingly used in population genetic studies. However, obtaining genomes for massive numbers of samples is still not within the budgets of many researchers. It is thus imperative to select an appropriate reference genome and sequencing depth to e...

Descripción completa

Detalles Bibliográficos
Autores principales: Deng, Xi‐Ling, Frandsen, Paul B., Dikow, Rebecca B., Favre, Adrien, Shah, Deep Narayan, Shah, Ram Devi Tachamo, Schneider, Julio V., Heckenhauer, Jacqueline, Pauls, Steffen U.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: John Wiley and Sons Inc. 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9745013/
https://www.ncbi.nlm.nih.gov/pubmed/36523526
http://dx.doi.org/10.1002/ece3.9583
_version_ 1784849047311351808
author Deng, Xi‐Ling
Frandsen, Paul B.
Dikow, Rebecca B.
Favre, Adrien
Shah, Deep Narayan
Shah, Ram Devi Tachamo
Schneider, Julio V.
Heckenhauer, Jacqueline
Pauls, Steffen U.
author_facet Deng, Xi‐Ling
Frandsen, Paul B.
Dikow, Rebecca B.
Favre, Adrien
Shah, Deep Narayan
Shah, Ram Devi Tachamo
Schneider, Julio V.
Heckenhauer, Jacqueline
Pauls, Steffen U.
author_sort Deng, Xi‐Ling
collection PubMed
description Whole genome sequencing for generating SNP data is increasingly used in population genetic studies. However, obtaining genomes for massive numbers of samples is still not within the budgets of many researchers. It is thus imperative to select an appropriate reference genome and sequencing depth to ensure the accuracy of the results for a specific research question, while balancing cost and feasibility. To evaluate the effect of the choice of the reference genome and sequencing depth on downstream analyses, we used five confamilial reference genomes of variable relatedness and three levels of sequencing depth (3.5×, 7.5× and 12×) in a population genomic study on two caddisfly species: Himalopsyche digitata and H. tibetana. Using these 30 datasets (five reference genomes × three depths × two target species), we estimated population genetic indices (inbreeding coefficient, nucleotide diversity, pairwise F (ST), and genome‐wide distribution of F (ST)) based on variants and population structure (PCA and admixture) based on genotype likelihood estimates. The results showed that both distantly related reference genomes and lower sequencing depth lead to degradation of resolution. In addition, choosing a more closely related reference genome may significantly remedy the defects caused by low depth. Therefore, we conclude that population genetic studies would benefit from closely related reference genomes, especially as the costs of obtaining a high‐quality reference genome continue to decrease. However, to determine a cost‐efficient strategy for a specific population genomic study, a trade‐off between reference genome relatedness and sequencing depth can be considered.
format Online
Article
Text
id pubmed-9745013
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher John Wiley and Sons Inc.
record_format MEDLINE/PubMed
spelling pubmed-97450132022-12-14 The impact of sequencing depth and relatedness of the reference genome in population genomic studies: A case study with two caddisfly species (Trichoptera, Rhyacophilidae, Himalopsyche) Deng, Xi‐Ling Frandsen, Paul B. Dikow, Rebecca B. Favre, Adrien Shah, Deep Narayan Shah, Ram Devi Tachamo Schneider, Julio V. Heckenhauer, Jacqueline Pauls, Steffen U. Ecol Evol Research Articles Whole genome sequencing for generating SNP data is increasingly used in population genetic studies. However, obtaining genomes for massive numbers of samples is still not within the budgets of many researchers. It is thus imperative to select an appropriate reference genome and sequencing depth to ensure the accuracy of the results for a specific research question, while balancing cost and feasibility. To evaluate the effect of the choice of the reference genome and sequencing depth on downstream analyses, we used five confamilial reference genomes of variable relatedness and three levels of sequencing depth (3.5×, 7.5× and 12×) in a population genomic study on two caddisfly species: Himalopsyche digitata and H. tibetana. Using these 30 datasets (five reference genomes × three depths × two target species), we estimated population genetic indices (inbreeding coefficient, nucleotide diversity, pairwise F (ST), and genome‐wide distribution of F (ST)) based on variants and population structure (PCA and admixture) based on genotype likelihood estimates. The results showed that both distantly related reference genomes and lower sequencing depth lead to degradation of resolution. In addition, choosing a more closely related reference genome may significantly remedy the defects caused by low depth. Therefore, we conclude that population genetic studies would benefit from closely related reference genomes, especially as the costs of obtaining a high‐quality reference genome continue to decrease. However, to determine a cost‐efficient strategy for a specific population genomic study, a trade‐off between reference genome relatedness and sequencing depth can be considered. John Wiley and Sons Inc. 2022-12-12 /pmc/articles/PMC9745013/ /pubmed/36523526 http://dx.doi.org/10.1002/ece3.9583 Text en © 2022 The Authors. Ecology and Evolution published by John Wiley & Sons Ltd. https://creativecommons.org/licenses/by/4.0/This is an open access article under the terms of the http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) License, which permits use, distribution and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research Articles
Deng, Xi‐Ling
Frandsen, Paul B.
Dikow, Rebecca B.
Favre, Adrien
Shah, Deep Narayan
Shah, Ram Devi Tachamo
Schneider, Julio V.
Heckenhauer, Jacqueline
Pauls, Steffen U.
The impact of sequencing depth and relatedness of the reference genome in population genomic studies: A case study with two caddisfly species (Trichoptera, Rhyacophilidae, Himalopsyche)
title The impact of sequencing depth and relatedness of the reference genome in population genomic studies: A case study with two caddisfly species (Trichoptera, Rhyacophilidae, Himalopsyche)
title_full The impact of sequencing depth and relatedness of the reference genome in population genomic studies: A case study with two caddisfly species (Trichoptera, Rhyacophilidae, Himalopsyche)
title_fullStr The impact of sequencing depth and relatedness of the reference genome in population genomic studies: A case study with two caddisfly species (Trichoptera, Rhyacophilidae, Himalopsyche)
title_full_unstemmed The impact of sequencing depth and relatedness of the reference genome in population genomic studies: A case study with two caddisfly species (Trichoptera, Rhyacophilidae, Himalopsyche)
title_short The impact of sequencing depth and relatedness of the reference genome in population genomic studies: A case study with two caddisfly species (Trichoptera, Rhyacophilidae, Himalopsyche)
title_sort impact of sequencing depth and relatedness of the reference genome in population genomic studies: a case study with two caddisfly species (trichoptera, rhyacophilidae, himalopsyche)
topic Research Articles
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9745013/
https://www.ncbi.nlm.nih.gov/pubmed/36523526
http://dx.doi.org/10.1002/ece3.9583
work_keys_str_mv AT dengxiling theimpactofsequencingdepthandrelatednessofthereferencegenomeinpopulationgenomicstudiesacasestudywithtwocaddisflyspeciestrichopterarhyacophilidaehimalopsyche
AT frandsenpaulb theimpactofsequencingdepthandrelatednessofthereferencegenomeinpopulationgenomicstudiesacasestudywithtwocaddisflyspeciestrichopterarhyacophilidaehimalopsyche
AT dikowrebeccab theimpactofsequencingdepthandrelatednessofthereferencegenomeinpopulationgenomicstudiesacasestudywithtwocaddisflyspeciestrichopterarhyacophilidaehimalopsyche
AT favreadrien theimpactofsequencingdepthandrelatednessofthereferencegenomeinpopulationgenomicstudiesacasestudywithtwocaddisflyspeciestrichopterarhyacophilidaehimalopsyche
AT shahdeepnarayan theimpactofsequencingdepthandrelatednessofthereferencegenomeinpopulationgenomicstudiesacasestudywithtwocaddisflyspeciestrichopterarhyacophilidaehimalopsyche
AT shahramdevitachamo theimpactofsequencingdepthandrelatednessofthereferencegenomeinpopulationgenomicstudiesacasestudywithtwocaddisflyspeciestrichopterarhyacophilidaehimalopsyche
AT schneiderjuliov theimpactofsequencingdepthandrelatednessofthereferencegenomeinpopulationgenomicstudiesacasestudywithtwocaddisflyspeciestrichopterarhyacophilidaehimalopsyche
AT heckenhauerjacqueline theimpactofsequencingdepthandrelatednessofthereferencegenomeinpopulationgenomicstudiesacasestudywithtwocaddisflyspeciestrichopterarhyacophilidaehimalopsyche
AT paulssteffenu theimpactofsequencingdepthandrelatednessofthereferencegenomeinpopulationgenomicstudiesacasestudywithtwocaddisflyspeciestrichopterarhyacophilidaehimalopsyche
AT dengxiling impactofsequencingdepthandrelatednessofthereferencegenomeinpopulationgenomicstudiesacasestudywithtwocaddisflyspeciestrichopterarhyacophilidaehimalopsyche
AT frandsenpaulb impactofsequencingdepthandrelatednessofthereferencegenomeinpopulationgenomicstudiesacasestudywithtwocaddisflyspeciestrichopterarhyacophilidaehimalopsyche
AT dikowrebeccab impactofsequencingdepthandrelatednessofthereferencegenomeinpopulationgenomicstudiesacasestudywithtwocaddisflyspeciestrichopterarhyacophilidaehimalopsyche
AT favreadrien impactofsequencingdepthandrelatednessofthereferencegenomeinpopulationgenomicstudiesacasestudywithtwocaddisflyspeciestrichopterarhyacophilidaehimalopsyche
AT shahdeepnarayan impactofsequencingdepthandrelatednessofthereferencegenomeinpopulationgenomicstudiesacasestudywithtwocaddisflyspeciestrichopterarhyacophilidaehimalopsyche
AT shahramdevitachamo impactofsequencingdepthandrelatednessofthereferencegenomeinpopulationgenomicstudiesacasestudywithtwocaddisflyspeciestrichopterarhyacophilidaehimalopsyche
AT schneiderjuliov impactofsequencingdepthandrelatednessofthereferencegenomeinpopulationgenomicstudiesacasestudywithtwocaddisflyspeciestrichopterarhyacophilidaehimalopsyche
AT heckenhauerjacqueline impactofsequencingdepthandrelatednessofthereferencegenomeinpopulationgenomicstudiesacasestudywithtwocaddisflyspeciestrichopterarhyacophilidaehimalopsyche
AT paulssteffenu impactofsequencingdepthandrelatednessofthereferencegenomeinpopulationgenomicstudiesacasestudywithtwocaddisflyspeciestrichopterarhyacophilidaehimalopsyche