Cargando…

SAMStat 2: quality control for next generation sequencing data

MOTIVATION: SAMStat is an efficient program to extract quality control metrics from fastq and SAM/BAM files. A distinguishing feature is that it displays sequence composition, base quality composition and mapping error profiles split by mapping quality. This allows users to rapidly identify reasons...

Descripción completa

Detalles Bibliográficos
Autor principal: Lassmann, Timo
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9850270/
https://www.ncbi.nlm.nih.gov/pubmed/36637208
http://dx.doi.org/10.1093/bioinformatics/btad019
_version_ 1784872147039027200
author Lassmann, Timo
author_facet Lassmann, Timo
author_sort Lassmann, Timo
collection PubMed
description MOTIVATION: SAMStat is an efficient program to extract quality control metrics from fastq and SAM/BAM files. A distinguishing feature is that it displays sequence composition, base quality composition and mapping error profiles split by mapping quality. This allows users to rapidly identify reasons for poor mapping including the presence of untrimmed adapters or poor sequencing quality at individual read positions. RESULTS: Here, we present a major update to SAMStat. The new version now supports paired-end and long-read data. Quality control plots are drawn using the ploty javascript library. AVAILABILITY AND IMPLEMENTATION: The source code of SAMStat and code to reproduce the results are found here: https://github.com/timolassmann/samstat.
format Online
Article
Text
id pubmed-9850270
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-98502702023-01-20 SAMStat 2: quality control for next generation sequencing data Lassmann, Timo Bioinformatics Applications Note MOTIVATION: SAMStat is an efficient program to extract quality control metrics from fastq and SAM/BAM files. A distinguishing feature is that it displays sequence composition, base quality composition and mapping error profiles split by mapping quality. This allows users to rapidly identify reasons for poor mapping including the presence of untrimmed adapters or poor sequencing quality at individual read positions. RESULTS: Here, we present a major update to SAMStat. The new version now supports paired-end and long-read data. Quality control plots are drawn using the ploty javascript library. AVAILABILITY AND IMPLEMENTATION: The source code of SAMStat and code to reproduce the results are found here: https://github.com/timolassmann/samstat. Oxford University Press 2023-01-13 /pmc/articles/PMC9850270/ /pubmed/36637208 http://dx.doi.org/10.1093/bioinformatics/btad019 Text en © The Author(s) 2023. Published by Oxford University Press. https://creativecommons.org/licenses/by/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Applications Note
Lassmann, Timo
SAMStat 2: quality control for next generation sequencing data
title SAMStat 2: quality control for next generation sequencing data
title_full SAMStat 2: quality control for next generation sequencing data
title_fullStr SAMStat 2: quality control for next generation sequencing data
title_full_unstemmed SAMStat 2: quality control for next generation sequencing data
title_short SAMStat 2: quality control for next generation sequencing data
title_sort samstat 2: quality control for next generation sequencing data
topic Applications Note
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9850270/
https://www.ncbi.nlm.nih.gov/pubmed/36637208
http://dx.doi.org/10.1093/bioinformatics/btad019
work_keys_str_mv AT lassmanntimo samstat2qualitycontrolfornextgenerationsequencingdata