Cargando…

MultiplexSSR: A pipeline for developing multiplex SSR‐PCR assays from resequencing data

Next‐generation sequencing has greatly promoted the investigation of single nucleotide polymorphisms, while studies of simple sequence repeats are sharply decreasing. However, simple sequence repeats still present some advantages in conservation genetics. In this study, an end‐to‐end pipeline referr...

Descripción completa

Detalles Bibliográficos
Autores principales: Guo, Liang, Yang, Quan, Yang, Jing‐Wen, Zhang, Nan, Liu, Bao‐Suo, Zhu, Ke‐Cheng, Guo, Hua‐Yang, Jiang, Shi‐Gui, Zhang, Dian‐Chang
Formato: Online Artículo Texto
Lenguaje:English
Publicado: John Wiley and Sons Inc. 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7083706/
https://www.ncbi.nlm.nih.gov/pubmed/32211176
http://dx.doi.org/10.1002/ece3.6121
_version_ 1783508575271780352
author Guo, Liang
Yang, Quan
Yang, Jing‐Wen
Zhang, Nan
Liu, Bao‐Suo
Zhu, Ke‐Cheng
Guo, Hua‐Yang
Jiang, Shi‐Gui
Zhang, Dian‐Chang
author_facet Guo, Liang
Yang, Quan
Yang, Jing‐Wen
Zhang, Nan
Liu, Bao‐Suo
Zhu, Ke‐Cheng
Guo, Hua‐Yang
Jiang, Shi‐Gui
Zhang, Dian‐Chang
author_sort Guo, Liang
collection PubMed
description Next‐generation sequencing has greatly promoted the investigation of single nucleotide polymorphisms, while studies of simple sequence repeats are sharply decreasing. However, simple sequence repeats still present some advantages in conservation genetics. In this study, an end‐to‐end pipeline referred to as MultiplexSSR was established to develop multiplex PCR assays in batches with highly polymorphic simple sequence repeats for capillary platforms from resequencing data. The distribution of single sequence repeats in the genome, the error profiles of genotypes and allelotypes, and the increase in the allele length range depending on the number of individuals were investigated. A total of 98% of single sequence repeats presented lengths of less than 100 bp. The error rate of the genotyping and allelotyping of dimeric patterns was ten times higher than those for other patterns. The error rate of allelotyping was less than that of genotyping. The allele length range reached approximate saturation with 10 individuals. This pipeline uses allele numbers to select highly polymorphic loci, masks loci with variation, and applies in silico PCR to improve primer specificity. The application of the developed multiplex SSR‐PCR assays validated the pipeline's robustness, showing higher polymorphism and stability for the developed simple sequence repeats and a lower cost for genotyping and providing low‐depth resequencing data from less than a dozen individuals for the development of markers. This pipeline fills the gap between next‐generation sequencing and multiplex SSR‐PCR.
format Online
Article
Text
id pubmed-7083706
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher John Wiley and Sons Inc.
record_format MEDLINE/PubMed
spelling pubmed-70837062020-03-24 MultiplexSSR: A pipeline for developing multiplex SSR‐PCR assays from resequencing data Guo, Liang Yang, Quan Yang, Jing‐Wen Zhang, Nan Liu, Bao‐Suo Zhu, Ke‐Cheng Guo, Hua‐Yang Jiang, Shi‐Gui Zhang, Dian‐Chang Ecol Evol Original Research Next‐generation sequencing has greatly promoted the investigation of single nucleotide polymorphisms, while studies of simple sequence repeats are sharply decreasing. However, simple sequence repeats still present some advantages in conservation genetics. In this study, an end‐to‐end pipeline referred to as MultiplexSSR was established to develop multiplex PCR assays in batches with highly polymorphic simple sequence repeats for capillary platforms from resequencing data. The distribution of single sequence repeats in the genome, the error profiles of genotypes and allelotypes, and the increase in the allele length range depending on the number of individuals were investigated. A total of 98% of single sequence repeats presented lengths of less than 100 bp. The error rate of the genotyping and allelotyping of dimeric patterns was ten times higher than those for other patterns. The error rate of allelotyping was less than that of genotyping. The allele length range reached approximate saturation with 10 individuals. This pipeline uses allele numbers to select highly polymorphic loci, masks loci with variation, and applies in silico PCR to improve primer specificity. The application of the developed multiplex SSR‐PCR assays validated the pipeline's robustness, showing higher polymorphism and stability for the developed simple sequence repeats and a lower cost for genotyping and providing low‐depth resequencing data from less than a dozen individuals for the development of markers. This pipeline fills the gap between next‐generation sequencing and multiplex SSR‐PCR. John Wiley and Sons Inc. 2020-03-04 /pmc/articles/PMC7083706/ /pubmed/32211176 http://dx.doi.org/10.1002/ece3.6121 Text en © 2020 The Authors. Ecology and Evolution published by John Wiley & Sons Ltd This is an open access article under the terms of the http://creativecommons.org/licenses/by/4.0/ License, which permits use, distribution and reproduction in any medium, provided the original work is properly cited.
spellingShingle Original Research
Guo, Liang
Yang, Quan
Yang, Jing‐Wen
Zhang, Nan
Liu, Bao‐Suo
Zhu, Ke‐Cheng
Guo, Hua‐Yang
Jiang, Shi‐Gui
Zhang, Dian‐Chang
MultiplexSSR: A pipeline for developing multiplex SSR‐PCR assays from resequencing data
title MultiplexSSR: A pipeline for developing multiplex SSR‐PCR assays from resequencing data
title_full MultiplexSSR: A pipeline for developing multiplex SSR‐PCR assays from resequencing data
title_fullStr MultiplexSSR: A pipeline for developing multiplex SSR‐PCR assays from resequencing data
title_full_unstemmed MultiplexSSR: A pipeline for developing multiplex SSR‐PCR assays from resequencing data
title_short MultiplexSSR: A pipeline for developing multiplex SSR‐PCR assays from resequencing data
title_sort multiplexssr: a pipeline for developing multiplex ssr‐pcr assays from resequencing data
topic Original Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7083706/
https://www.ncbi.nlm.nih.gov/pubmed/32211176
http://dx.doi.org/10.1002/ece3.6121
work_keys_str_mv AT guoliang multiplexssrapipelinefordevelopingmultiplexssrpcrassaysfromresequencingdata
AT yangquan multiplexssrapipelinefordevelopingmultiplexssrpcrassaysfromresequencingdata
AT yangjingwen multiplexssrapipelinefordevelopingmultiplexssrpcrassaysfromresequencingdata
AT zhangnan multiplexssrapipelinefordevelopingmultiplexssrpcrassaysfromresequencingdata
AT liubaosuo multiplexssrapipelinefordevelopingmultiplexssrpcrassaysfromresequencingdata
AT zhukecheng multiplexssrapipelinefordevelopingmultiplexssrpcrassaysfromresequencingdata
AT guohuayang multiplexssrapipelinefordevelopingmultiplexssrpcrassaysfromresequencingdata
AT jiangshigui multiplexssrapipelinefordevelopingmultiplexssrpcrassaysfromresequencingdata
AT zhangdianchang multiplexssrapipelinefordevelopingmultiplexssrpcrassaysfromresequencingdata