Cargando…

Accurate recapture identification for genetic mark–recapture studies with error-tolerant likelihood-based match calling and sample clustering

Error-tolerant likelihood-based match calling presents a promising technique to accurately identify recapture events in genetic mark–recapture studies by combining probabilities of latent genotypes and probabilities of observed genotypes, which may contain genotyping errors. Combined with clustering...

Descripción completa

Detalles Bibliográficos
Autores principales: Sethi, Suresh A., Linden, Daniel, Wenburg, John, Lewis, Cara, Lemons, Patrick, Fuller, Angela, Hare, Matthew P.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: The Royal Society Publishing 2016
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5210676/
https://www.ncbi.nlm.nih.gov/pubmed/28083094
http://dx.doi.org/10.1098/rsos.160457
_version_ 1782490933481177088
author Sethi, Suresh A.
Linden, Daniel
Wenburg, John
Lewis, Cara
Lemons, Patrick
Fuller, Angela
Hare, Matthew P.
author_facet Sethi, Suresh A.
Linden, Daniel
Wenburg, John
Lewis, Cara
Lemons, Patrick
Fuller, Angela
Hare, Matthew P.
author_sort Sethi, Suresh A.
collection PubMed
description Error-tolerant likelihood-based match calling presents a promising technique to accurately identify recapture events in genetic mark–recapture studies by combining probabilities of latent genotypes and probabilities of observed genotypes, which may contain genotyping errors. Combined with clustering algorithms to group samples into sets of recaptures based upon pairwise match calls, these tools can be used to reconstruct accurate capture histories for mark–recapture modelling. Here, we assess the performance of a recently introduced error-tolerant likelihood-based match-calling model and sample clustering algorithm for genetic mark–recapture studies. We assessed both biallelic (i.e. single nucleotide polymorphisms; SNP) and multiallelic (i.e. microsatellite; MSAT) markers using a combination of simulation analyses and case study data on Pacific walrus (Odobenus rosmarus divergens) and fishers (Pekania pennanti). A novel two-stage clustering approach is demonstrated for genetic mark–recapture applications. First, repeat captures within a sampling occasion are identified. Subsequently, recaptures across sampling occasions are identified. The likelihood-based matching protocol performed well in simulation trials, demonstrating utility for use in a wide range of genetic mark–recapture studies. Moderately sized SNP (64+) and MSAT (10–15) panels produced accurate match calls for recaptures and accurate non-match calls for samples from closely related individuals in the face of low to moderate genotyping error. Furthermore, matching performance remained stable or increased as the number of genetic markers increased, genotyping error notwithstanding.
format Online
Article
Text
id pubmed-5210676
institution National Center for Biotechnology Information
language English
publishDate 2016
publisher The Royal Society Publishing
record_format MEDLINE/PubMed
spelling pubmed-52106762017-01-12 Accurate recapture identification for genetic mark–recapture studies with error-tolerant likelihood-based match calling and sample clustering Sethi, Suresh A. Linden, Daniel Wenburg, John Lewis, Cara Lemons, Patrick Fuller, Angela Hare, Matthew P. R Soc Open Sci Genetics Error-tolerant likelihood-based match calling presents a promising technique to accurately identify recapture events in genetic mark–recapture studies by combining probabilities of latent genotypes and probabilities of observed genotypes, which may contain genotyping errors. Combined with clustering algorithms to group samples into sets of recaptures based upon pairwise match calls, these tools can be used to reconstruct accurate capture histories for mark–recapture modelling. Here, we assess the performance of a recently introduced error-tolerant likelihood-based match-calling model and sample clustering algorithm for genetic mark–recapture studies. We assessed both biallelic (i.e. single nucleotide polymorphisms; SNP) and multiallelic (i.e. microsatellite; MSAT) markers using a combination of simulation analyses and case study data on Pacific walrus (Odobenus rosmarus divergens) and fishers (Pekania pennanti). A novel two-stage clustering approach is demonstrated for genetic mark–recapture applications. First, repeat captures within a sampling occasion are identified. Subsequently, recaptures across sampling occasions are identified. The likelihood-based matching protocol performed well in simulation trials, demonstrating utility for use in a wide range of genetic mark–recapture studies. Moderately sized SNP (64+) and MSAT (10–15) panels produced accurate match calls for recaptures and accurate non-match calls for samples from closely related individuals in the face of low to moderate genotyping error. Furthermore, matching performance remained stable or increased as the number of genetic markers increased, genotyping error notwithstanding. The Royal Society Publishing 2016-12-21 /pmc/articles/PMC5210676/ /pubmed/28083094 http://dx.doi.org/10.1098/rsos.160457 Text en © 2016 The Authors. http://creativecommons.org/licenses/by/4.0/ Published by the Royal Society under the terms of the Creative Commons Attribution License http://creativecommons.org/licenses/by/4.0/, which permits unrestricted use, provided the original author and source are credited.
spellingShingle Genetics
Sethi, Suresh A.
Linden, Daniel
Wenburg, John
Lewis, Cara
Lemons, Patrick
Fuller, Angela
Hare, Matthew P.
Accurate recapture identification for genetic mark–recapture studies with error-tolerant likelihood-based match calling and sample clustering
title Accurate recapture identification for genetic mark–recapture studies with error-tolerant likelihood-based match calling and sample clustering
title_full Accurate recapture identification for genetic mark–recapture studies with error-tolerant likelihood-based match calling and sample clustering
title_fullStr Accurate recapture identification for genetic mark–recapture studies with error-tolerant likelihood-based match calling and sample clustering
title_full_unstemmed Accurate recapture identification for genetic mark–recapture studies with error-tolerant likelihood-based match calling and sample clustering
title_short Accurate recapture identification for genetic mark–recapture studies with error-tolerant likelihood-based match calling and sample clustering
title_sort accurate recapture identification for genetic mark–recapture studies with error-tolerant likelihood-based match calling and sample clustering
topic Genetics
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5210676/
https://www.ncbi.nlm.nih.gov/pubmed/28083094
http://dx.doi.org/10.1098/rsos.160457
work_keys_str_mv AT sethisuresha accuraterecaptureidentificationforgeneticmarkrecapturestudieswitherrortolerantlikelihoodbasedmatchcallingandsampleclustering
AT lindendaniel accuraterecaptureidentificationforgeneticmarkrecapturestudieswitherrortolerantlikelihoodbasedmatchcallingandsampleclustering
AT wenburgjohn accuraterecaptureidentificationforgeneticmarkrecapturestudieswitherrortolerantlikelihoodbasedmatchcallingandsampleclustering
AT lewiscara accuraterecaptureidentificationforgeneticmarkrecapturestudieswitherrortolerantlikelihoodbasedmatchcallingandsampleclustering
AT lemonspatrick accuraterecaptureidentificationforgeneticmarkrecapturestudieswitherrortolerantlikelihoodbasedmatchcallingandsampleclustering
AT fullerangela accuraterecaptureidentificationforgeneticmarkrecapturestudieswitherrortolerantlikelihoodbasedmatchcallingandsampleclustering
AT harematthewp accuraterecaptureidentificationforgeneticmarkrecapturestudieswitherrortolerantlikelihoodbasedmatchcallingandsampleclustering