Cargando…

PEPR: pipelines for evaluating prokaryotic references

The rapid adoption of microbial whole genome sequencing in public health, clinical testing, and forensic laboratories requires the use of validated measurement processes. Well-characterized, homogeneous, and stable microbial genomic reference materials can be used to evaluate measurement processes,...

Descripción completa

Detalles Bibliográficos
Autores principales: Olson, Nathan D., Zook, Justin M., Samarov, Daniel V., Jackson, Scott A., Salit, Marc L.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Springer Berlin Heidelberg 2016
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4819933/
https://www.ncbi.nlm.nih.gov/pubmed/26935931
http://dx.doi.org/10.1007/s00216-015-9299-5
_version_ 1782425315741532160
author Olson, Nathan D.
Zook, Justin M.
Samarov, Daniel V.
Jackson, Scott A.
Salit, Marc L.
author_facet Olson, Nathan D.
Zook, Justin M.
Samarov, Daniel V.
Jackson, Scott A.
Salit, Marc L.
author_sort Olson, Nathan D.
collection PubMed
description The rapid adoption of microbial whole genome sequencing in public health, clinical testing, and forensic laboratories requires the use of validated measurement processes. Well-characterized, homogeneous, and stable microbial genomic reference materials can be used to evaluate measurement processes, improving confidence in microbial whole genome sequencing results. We have developed a reproducible and transparent bioinformatics tool, PEPR, Pipelines for Evaluating Prokaryotic References, for characterizing the reference genome of prokaryotic genomic materials. PEPR evaluates the quality, purity, and homogeneity of the reference material genome, and purity of the genomic material. The quality of the genome is evaluated using high coverage paired-end sequence data; coverage, paired-end read size and direction, as well as soft-clipping rates, are used to identify mis-assemblies. The homogeneity and purity of the material relative to the reference genome are characterized by comparing base calls from replicate datasets generated using multiple sequencing technologies. Genomic purity of the material is assessed by checking for DNA contaminants. We demonstrate the tool and its output using sequencing data while developing a Staphylococcus aureus candidate genomic reference material. PEPR is open source and available at https://github.com/usnistgov/pepr. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1007/s00216-015-9299-5) contains supplementary material, which is available to authorized users.
format Online
Article
Text
id pubmed-4819933
institution National Center for Biotechnology Information
language English
publishDate 2016
publisher Springer Berlin Heidelberg
record_format MEDLINE/PubMed
spelling pubmed-48199332016-04-11 PEPR: pipelines for evaluating prokaryotic references Olson, Nathan D. Zook, Justin M. Samarov, Daniel V. Jackson, Scott A. Salit, Marc L. Anal Bioanal Chem Research Paper The rapid adoption of microbial whole genome sequencing in public health, clinical testing, and forensic laboratories requires the use of validated measurement processes. Well-characterized, homogeneous, and stable microbial genomic reference materials can be used to evaluate measurement processes, improving confidence in microbial whole genome sequencing results. We have developed a reproducible and transparent bioinformatics tool, PEPR, Pipelines for Evaluating Prokaryotic References, for characterizing the reference genome of prokaryotic genomic materials. PEPR evaluates the quality, purity, and homogeneity of the reference material genome, and purity of the genomic material. The quality of the genome is evaluated using high coverage paired-end sequence data; coverage, paired-end read size and direction, as well as soft-clipping rates, are used to identify mis-assemblies. The homogeneity and purity of the material relative to the reference genome are characterized by comparing base calls from replicate datasets generated using multiple sequencing technologies. Genomic purity of the material is assessed by checking for DNA contaminants. We demonstrate the tool and its output using sequencing data while developing a Staphylococcus aureus candidate genomic reference material. PEPR is open source and available at https://github.com/usnistgov/pepr. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1007/s00216-015-9299-5) contains supplementary material, which is available to authorized users. Springer Berlin Heidelberg 2016-03-02 2016 /pmc/articles/PMC4819933/ /pubmed/26935931 http://dx.doi.org/10.1007/s00216-015-9299-5 Text en © The Author(s) 2016 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made.
spellingShingle Research Paper
Olson, Nathan D.
Zook, Justin M.
Samarov, Daniel V.
Jackson, Scott A.
Salit, Marc L.
PEPR: pipelines for evaluating prokaryotic references
title PEPR: pipelines for evaluating prokaryotic references
title_full PEPR: pipelines for evaluating prokaryotic references
title_fullStr PEPR: pipelines for evaluating prokaryotic references
title_full_unstemmed PEPR: pipelines for evaluating prokaryotic references
title_short PEPR: pipelines for evaluating prokaryotic references
title_sort pepr: pipelines for evaluating prokaryotic references
topic Research Paper
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4819933/
https://www.ncbi.nlm.nih.gov/pubmed/26935931
http://dx.doi.org/10.1007/s00216-015-9299-5
work_keys_str_mv AT olsonnathand peprpipelinesforevaluatingprokaryoticreferences
AT zookjustinm peprpipelinesforevaluatingprokaryoticreferences
AT samarovdanielv peprpipelinesforevaluatingprokaryoticreferences
AT jacksonscotta peprpipelinesforevaluatingprokaryoticreferences
AT salitmarcl peprpipelinesforevaluatingprokaryoticreferences