Cargando…

Whole Animal Genome Sequencing: user-friendly, rapid, containerized pipelines for processing, variant discovery, and annotation of short-read whole genome sequencing data

Advancements in massively parallel short-read sequencing technologies and the associated decreasing costs have led to large and diverse variant discovery efforts across species. However, processing high-throughput short-read sequencing data can be challenging with potential pitfalls and bioinformati...

Descripción completa

Detalles Bibliográficos
Autores principales:	Cullen, Jonah N, Friedenberg, Steven G
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	Oxford University Press 2023
Materias:	Software and Data Resources
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10411559/ https://www.ncbi.nlm.nih.gov/pubmed/37243692 http://dx.doi.org/10.1093/g3journal/jkad117

_version_	1785086691634053120
author	Cullen, Jonah N Friedenberg, Steven G
author_facet	Cullen, Jonah N Friedenberg, Steven G
author_sort	Cullen, Jonah N
collection	PubMed
description	Advancements in massively parallel short-read sequencing technologies and the associated decreasing costs have led to large and diverse variant discovery efforts across species. However, processing high-throughput short-read sequencing data can be challenging with potential pitfalls and bioinformatics bottlenecks in generating reproducible results. Although a number of pipelines exist that address these challenges, these are often geared toward human or traditional model organism species and can be difficult to configure across institutions. Whole Animal Genome Sequencing (WAGS) is an open-source set of user-friendly, containerized pipelines designed to simplify the process of identifying germline short (SNP and indel) and structural variants (SVs) geared toward the veterinary community but adaptable to any species with a suitable reference genome. We present a description of the pipelines [adapted from the best practices of the Genome Analysis Toolkit (GATK)], along with benchmarking data from both the preprocessing and joint genotyping steps, consistent with a typical user workflow.
format	Online Article Text
id	pubmed-10411559
institution	National Center for Biotechnology Information
language	English
publishDate	2023
publisher	Oxford University Press
record_format	MEDLINE/PubMed
spelling	pubmed-104115592023-08-10 Whole Animal Genome Sequencing: user-friendly, rapid, containerized pipelines for processing, variant discovery, and annotation of short-read whole genome sequencing data Cullen, Jonah N Friedenberg, Steven G G3 (Bethesda) Software and Data Resources Advancements in massively parallel short-read sequencing technologies and the associated decreasing costs have led to large and diverse variant discovery efforts across species. However, processing high-throughput short-read sequencing data can be challenging with potential pitfalls and bioinformatics bottlenecks in generating reproducible results. Although a number of pipelines exist that address these challenges, these are often geared toward human or traditional model organism species and can be difficult to configure across institutions. Whole Animal Genome Sequencing (WAGS) is an open-source set of user-friendly, containerized pipelines designed to simplify the process of identifying germline short (SNP and indel) and structural variants (SVs) geared toward the veterinary community but adaptable to any species with a suitable reference genome. We present a description of the pipelines [adapted from the best practices of the Genome Analysis Toolkit (GATK)], along with benchmarking data from both the preprocessing and joint genotyping steps, consistent with a typical user workflow. Oxford University Press 2023-05-27 /pmc/articles/PMC10411559/ /pubmed/37243692 http://dx.doi.org/10.1093/g3journal/jkad117 Text en © The Author(s) 2023. Published by Oxford University Press on behalf of The Genetics Society of America. https://creativecommons.org/licenses/by/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle	Software and Data Resources Cullen, Jonah N Friedenberg, Steven G Whole Animal Genome Sequencing: user-friendly, rapid, containerized pipelines for processing, variant discovery, and annotation of short-read whole genome sequencing data
title	Whole Animal Genome Sequencing: user-friendly, rapid, containerized pipelines for processing, variant discovery, and annotation of short-read whole genome sequencing data
title_full	Whole Animal Genome Sequencing: user-friendly, rapid, containerized pipelines for processing, variant discovery, and annotation of short-read whole genome sequencing data
title_fullStr	Whole Animal Genome Sequencing: user-friendly, rapid, containerized pipelines for processing, variant discovery, and annotation of short-read whole genome sequencing data
title_full_unstemmed	Whole Animal Genome Sequencing: user-friendly, rapid, containerized pipelines for processing, variant discovery, and annotation of short-read whole genome sequencing data
title_short	Whole Animal Genome Sequencing: user-friendly, rapid, containerized pipelines for processing, variant discovery, and annotation of short-read whole genome sequencing data
title_sort	whole animal genome sequencing: user-friendly, rapid, containerized pipelines for processing, variant discovery, and annotation of short-read whole genome sequencing data
topic	Software and Data Resources
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10411559/ https://www.ncbi.nlm.nih.gov/pubmed/37243692 http://dx.doi.org/10.1093/g3journal/jkad117
work_keys_str_mv	AT cullenjonahn wholeanimalgenomesequencinguserfriendlyrapidcontainerizedpipelinesforprocessingvariantdiscoveryandannotationofshortreadwholegenomesequencingdata AT friedenbergsteveng wholeanimalgenomesequencinguserfriendlyrapidcontainerizedpipelinesforprocessingvariantdiscoveryandannotationofshortreadwholegenomesequencingdata

Whole Animal Genome Sequencing: user-friendly, rapid, containerized pipelines for processing, variant discovery, and annotation of short-read whole genome sequencing data

Ejemplares similares