Cargando…

Long‐read genotyping with SLANG (Simple Long‐read loci Assembly of Nanopore data for Genotyping)

PREMISE: Most phylogenomic library preparation methods and bioinformatic analysis tools in restriction site–associated DNA sequencing (RADseq)/genotyping‐by‐sequencing (GBS) studies are designed for use with Illumina data. The lack of alternative bioinformatic pipelines hinders the exploration of lo...

Descripción completa

Detalles Bibliográficos
Autores principales: Dorfner, Marco, Ott, Tankred, Ott, Philipp, Oberprieler, Christoph
Formato: Online Artículo Texto
Lenguaje:English
Publicado: John Wiley and Sons Inc. 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9215276/
https://www.ncbi.nlm.nih.gov/pubmed/35774992
http://dx.doi.org/10.1002/aps3.11484
_version_ 1784731174885654528
author Dorfner, Marco
Ott, Tankred
Ott, Philipp
Oberprieler, Christoph
author_facet Dorfner, Marco
Ott, Tankred
Ott, Philipp
Oberprieler, Christoph
author_sort Dorfner, Marco
collection PubMed
description PREMISE: Most phylogenomic library preparation methods and bioinformatic analysis tools in restriction site–associated DNA sequencing (RADseq)/genotyping‐by‐sequencing (GBS) studies are designed for use with Illumina data. The lack of alternative bioinformatic pipelines hinders the exploration of long‐read multi‐locus data from other sequencing platforms. The Simple Long‐read loci Assembly of Nanopore data for Genotyping (SLANG) pipeline enables locus assembly, orthology estimation, and single‐nucleotide polymorphism (SNP) calling using Nanopore‐sequenced multi‐locus data. METHODS AND RESULTS: Two test libraries (Leucanthemum spp., Senecio spp.; Compositae) were prepared using an amplified fragment length polymorphism (AFLP)‐based method to reduce genome complexity, then Nanopore‐sequenced, and analyzed with SLANG. We identified 704 and 448 orthologous loci with 12,368 and 10,048 SNPs, respectively. The constructed phylogenetic networks were identical to a GBS network produced using Leucanthemum Illumina data and were consistent with Senecio species circumscriptions based on morphology. CONCLUSIONS: SLANG identifies orthologous loci and extracts SNPs from long‐read multi‐locus Nanopore data for phylogenetic inference, population genetics, or phylogeographical studies. Combined with an AFLP‐based library preparation, SLANG provides an easily scalable, cost‐effective, and affordable alternative to Illumina‐based RADseq/GBS procedures.
format Online
Article
Text
id pubmed-9215276
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher John Wiley and Sons Inc.
record_format MEDLINE/PubMed
spelling pubmed-92152762022-06-29 Long‐read genotyping with SLANG (Simple Long‐read loci Assembly of Nanopore data for Genotyping) Dorfner, Marco Ott, Tankred Ott, Philipp Oberprieler, Christoph Appl Plant Sci Software Note PREMISE: Most phylogenomic library preparation methods and bioinformatic analysis tools in restriction site–associated DNA sequencing (RADseq)/genotyping‐by‐sequencing (GBS) studies are designed for use with Illumina data. The lack of alternative bioinformatic pipelines hinders the exploration of long‐read multi‐locus data from other sequencing platforms. The Simple Long‐read loci Assembly of Nanopore data for Genotyping (SLANG) pipeline enables locus assembly, orthology estimation, and single‐nucleotide polymorphism (SNP) calling using Nanopore‐sequenced multi‐locus data. METHODS AND RESULTS: Two test libraries (Leucanthemum spp., Senecio spp.; Compositae) were prepared using an amplified fragment length polymorphism (AFLP)‐based method to reduce genome complexity, then Nanopore‐sequenced, and analyzed with SLANG. We identified 704 and 448 orthologous loci with 12,368 and 10,048 SNPs, respectively. The constructed phylogenetic networks were identical to a GBS network produced using Leucanthemum Illumina data and were consistent with Senecio species circumscriptions based on morphology. CONCLUSIONS: SLANG identifies orthologous loci and extracts SNPs from long‐read multi‐locus Nanopore data for phylogenetic inference, population genetics, or phylogeographical studies. Combined with an AFLP‐based library preparation, SLANG provides an easily scalable, cost‐effective, and affordable alternative to Illumina‐based RADseq/GBS procedures. John Wiley and Sons Inc. 2022-06-14 /pmc/articles/PMC9215276/ /pubmed/35774992 http://dx.doi.org/10.1002/aps3.11484 Text en © 2022 The Authors. Applications in Plant Sciences published by Wiley Periodicals LLC on behalf of Botanical Society of America. https://creativecommons.org/licenses/by-nc/4.0/This is an open access article under the terms of the http://creativecommons.org/licenses/by-nc/4.0/ (https://creativecommons.org/licenses/by-nc/4.0/) License, which permits use, distribution and reproduction in any medium, provided the original work is properly cited and is not used for commercial purposes.
spellingShingle Software Note
Dorfner, Marco
Ott, Tankred
Ott, Philipp
Oberprieler, Christoph
Long‐read genotyping with SLANG (Simple Long‐read loci Assembly of Nanopore data for Genotyping)
title Long‐read genotyping with SLANG (Simple Long‐read loci Assembly of Nanopore data for Genotyping)
title_full Long‐read genotyping with SLANG (Simple Long‐read loci Assembly of Nanopore data for Genotyping)
title_fullStr Long‐read genotyping with SLANG (Simple Long‐read loci Assembly of Nanopore data for Genotyping)
title_full_unstemmed Long‐read genotyping with SLANG (Simple Long‐read loci Assembly of Nanopore data for Genotyping)
title_short Long‐read genotyping with SLANG (Simple Long‐read loci Assembly of Nanopore data for Genotyping)
title_sort long‐read genotyping with slang (simple long‐read loci assembly of nanopore data for genotyping)
topic Software Note
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9215276/
https://www.ncbi.nlm.nih.gov/pubmed/35774992
http://dx.doi.org/10.1002/aps3.11484
work_keys_str_mv AT dorfnermarco longreadgenotypingwithslangsimplelongreadlociassemblyofnanoporedataforgenotyping
AT otttankred longreadgenotypingwithslangsimplelongreadlociassemblyofnanoporedataforgenotyping
AT ottphilipp longreadgenotypingwithslangsimplelongreadlociassemblyofnanoporedataforgenotyping
AT oberprielerchristoph longreadgenotypingwithslangsimplelongreadlociassemblyofnanoporedataforgenotyping