Cargando…

Performance of methods to detect genetic variants from bisulphite sequencing data in a non‐model species

The profiling of epigenetic marks like DNA methylation has become a central aspect of studies in evolution and ecology. Bisulphite sequencing is commonly used for assessing genome‐wide DNA methylation at single nucleotide resolution but these data can also provide information on genetic variants lik...

Descripción completa

Detalles Bibliográficos
Autores principales: Lindner, Melanie, Gawehns, Fleur, te Molder, Sebastiaan, Visser, Marcel E., van Oers, Kees, Laine, Veronika N.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: John Wiley and Sons Inc. 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9290141/
https://www.ncbi.nlm.nih.gov/pubmed/34435438
http://dx.doi.org/10.1111/1755-0998.13493
_version_ 1784748820800733184
author Lindner, Melanie
Gawehns, Fleur
te Molder, Sebastiaan
Visser, Marcel E.
van Oers, Kees
Laine, Veronika N.
author_facet Lindner, Melanie
Gawehns, Fleur
te Molder, Sebastiaan
Visser, Marcel E.
van Oers, Kees
Laine, Veronika N.
author_sort Lindner, Melanie
collection PubMed
description The profiling of epigenetic marks like DNA methylation has become a central aspect of studies in evolution and ecology. Bisulphite sequencing is commonly used for assessing genome‐wide DNA methylation at single nucleotide resolution but these data can also provide information on genetic variants like single nucleotide polymorphisms (SNPs). However, bisulphite conversion causes unmethylated cytosines to appear as thymines, complicating the alignment and subsequent SNP calling. Several tools have been developed to overcome this challenge, but there is no independent evaluation of such tools for non‐model species, which often lack genomic references. Here, we used whole‐genome bisulphite sequencing (WGBS) data from four female great tits (Parus major) to evaluate the performance of seven tools for SNP calling from bisulphite sequencing data. We used SNPs from whole‐genome resequencing data of the same samples as baseline SNPs to assess common performance metrics like sensitivity, precision, and the number of true positive, false positive, and false negative SNPs for the full range of variant and genotype quality values. We found clear differences between the tools in either optimizing precision (bis‐snp), sensitivity (biscuit), or a compromise between both (all other tools). Overall, the choice of SNP caller strongly depends on which performance parameter should be maximized and whether ascertainment bias should be minimized to optimize downstream analysis, highlighting the need for studies that assess such differences.
format Online
Article
Text
id pubmed-9290141
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher John Wiley and Sons Inc.
record_format MEDLINE/PubMed
spelling pubmed-92901412022-07-20 Performance of methods to detect genetic variants from bisulphite sequencing data in a non‐model species Lindner, Melanie Gawehns, Fleur te Molder, Sebastiaan Visser, Marcel E. van Oers, Kees Laine, Veronika N. Mol Ecol Resour RESOURCE ARTICLES The profiling of epigenetic marks like DNA methylation has become a central aspect of studies in evolution and ecology. Bisulphite sequencing is commonly used for assessing genome‐wide DNA methylation at single nucleotide resolution but these data can also provide information on genetic variants like single nucleotide polymorphisms (SNPs). However, bisulphite conversion causes unmethylated cytosines to appear as thymines, complicating the alignment and subsequent SNP calling. Several tools have been developed to overcome this challenge, but there is no independent evaluation of such tools for non‐model species, which often lack genomic references. Here, we used whole‐genome bisulphite sequencing (WGBS) data from four female great tits (Parus major) to evaluate the performance of seven tools for SNP calling from bisulphite sequencing data. We used SNPs from whole‐genome resequencing data of the same samples as baseline SNPs to assess common performance metrics like sensitivity, precision, and the number of true positive, false positive, and false negative SNPs for the full range of variant and genotype quality values. We found clear differences between the tools in either optimizing precision (bis‐snp), sensitivity (biscuit), or a compromise between both (all other tools). Overall, the choice of SNP caller strongly depends on which performance parameter should be maximized and whether ascertainment bias should be minimized to optimize downstream analysis, highlighting the need for studies that assess such differences. John Wiley and Sons Inc. 2021-09-06 2022-02 /pmc/articles/PMC9290141/ /pubmed/34435438 http://dx.doi.org/10.1111/1755-0998.13493 Text en © 2021 The Authors. Molecular Ecology Resources published by John Wiley & Sons Ltd. https://creativecommons.org/licenses/by-nc/4.0/This is an open access article under the terms of the http://creativecommons.org/licenses/by-nc/4.0/ (https://creativecommons.org/licenses/by-nc/4.0/) License, which permits use, distribution and reproduction in any medium, provided the original work is properly cited and is not used for commercial purposes.
spellingShingle RESOURCE ARTICLES
Lindner, Melanie
Gawehns, Fleur
te Molder, Sebastiaan
Visser, Marcel E.
van Oers, Kees
Laine, Veronika N.
Performance of methods to detect genetic variants from bisulphite sequencing data in a non‐model species
title Performance of methods to detect genetic variants from bisulphite sequencing data in a non‐model species
title_full Performance of methods to detect genetic variants from bisulphite sequencing data in a non‐model species
title_fullStr Performance of methods to detect genetic variants from bisulphite sequencing data in a non‐model species
title_full_unstemmed Performance of methods to detect genetic variants from bisulphite sequencing data in a non‐model species
title_short Performance of methods to detect genetic variants from bisulphite sequencing data in a non‐model species
title_sort performance of methods to detect genetic variants from bisulphite sequencing data in a non‐model species
topic RESOURCE ARTICLES
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9290141/
https://www.ncbi.nlm.nih.gov/pubmed/34435438
http://dx.doi.org/10.1111/1755-0998.13493
work_keys_str_mv AT lindnermelanie performanceofmethodstodetectgeneticvariantsfrombisulphitesequencingdatainanonmodelspecies
AT gawehnsfleur performanceofmethodstodetectgeneticvariantsfrombisulphitesequencingdatainanonmodelspecies
AT temoldersebastiaan performanceofmethodstodetectgeneticvariantsfrombisulphitesequencingdatainanonmodelspecies
AT vissermarcele performanceofmethodstodetectgeneticvariantsfrombisulphitesequencingdatainanonmodelspecies
AT vanoerskees performanceofmethodstodetectgeneticvariantsfrombisulphitesequencingdatainanonmodelspecies
AT laineveronikan performanceofmethodstodetectgeneticvariantsfrombisulphitesequencingdatainanonmodelspecies