Cargando…

PAIPline: pathogen identification in metagenomic and clinical next generation sequencing samples

MOTIVATION: Next generation sequencing (NGS) has provided researchers with a powerful tool to characterize metagenomic and clinical samples in research and diagnostic settings. NGS allows an open view into samples useful for pathogen detection in an unbiased fashion and without prior hypothesis abou...

Descripción completa

Detalles Bibliográficos
Autores principales: Andrusch, Andreas, Dabrowski, Piotr W, Klenner, Jeanette, Tausch, Simon H, Kohl, Claudia, Osman, Abdalla A, Renard, Bernhard Y, Nitsche, Andreas
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2018
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6129269/
https://www.ncbi.nlm.nih.gov/pubmed/30423069
http://dx.doi.org/10.1093/bioinformatics/bty595
_version_ 1783353770961272832
author Andrusch, Andreas
Dabrowski, Piotr W
Klenner, Jeanette
Tausch, Simon H
Kohl, Claudia
Osman, Abdalla A
Renard, Bernhard Y
Nitsche, Andreas
author_facet Andrusch, Andreas
Dabrowski, Piotr W
Klenner, Jeanette
Tausch, Simon H
Kohl, Claudia
Osman, Abdalla A
Renard, Bernhard Y
Nitsche, Andreas
author_sort Andrusch, Andreas
collection PubMed
description MOTIVATION: Next generation sequencing (NGS) has provided researchers with a powerful tool to characterize metagenomic and clinical samples in research and diagnostic settings. NGS allows an open view into samples useful for pathogen detection in an unbiased fashion and without prior hypothesis about possible causative agents. However, NGS datasets for pathogen detection come with different obstacles, such as a very unfavorable ratio of pathogen to host reads. Alongside often appearing false positives and irrelevant organisms, such as contaminants, tools are often challenged by samples with low pathogen loads and might not report organisms present below a certain threshold. Furthermore, some metagenomic profiling tools are only focused on one particular set of pathogens, for example bacteria. RESULTS: We present PAIPline, a bioinformatics pipeline specifically designed to address problems associated with detecting pathogens in diagnostic samples. PAIPline particularly focuses on userfriendliness and encapsulates all necessary steps from preprocessing to resolution of ambiguous reads and filtering up to visualization in a single tool. In contrast to existing tools, PAIPline is more specific while maintaining sensitivity. This is shown in a comparative evaluation where PAIPline was benchmarked along other well-known metagenomic profiling tools on previously published well-characterized datasets. Additionally, as part of an international cooperation project, PAIPline was applied to an outbreak sample of hemorrhagic fevers of then unknown etiology. The presented results show that PAIPline can serve as a robust, reliable, user-friendly, adaptable and generalizable stand-alone software for diagnostics from NGS samples and as a stepping stone for further downstream analyses. AVAILABILITY AND IMPLEMENTATION: PAIPline is freely available under https://gitlab.com/rki_bioinformatics/paipline.
format Online
Article
Text
id pubmed-6129269
institution National Center for Biotechnology Information
language English
publishDate 2018
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-61292692018-09-12 PAIPline: pathogen identification in metagenomic and clinical next generation sequencing samples Andrusch, Andreas Dabrowski, Piotr W Klenner, Jeanette Tausch, Simon H Kohl, Claudia Osman, Abdalla A Renard, Bernhard Y Nitsche, Andreas Bioinformatics Eccb 2018: European Conference on Computational Biology Proceedings MOTIVATION: Next generation sequencing (NGS) has provided researchers with a powerful tool to characterize metagenomic and clinical samples in research and diagnostic settings. NGS allows an open view into samples useful for pathogen detection in an unbiased fashion and without prior hypothesis about possible causative agents. However, NGS datasets for pathogen detection come with different obstacles, such as a very unfavorable ratio of pathogen to host reads. Alongside often appearing false positives and irrelevant organisms, such as contaminants, tools are often challenged by samples with low pathogen loads and might not report organisms present below a certain threshold. Furthermore, some metagenomic profiling tools are only focused on one particular set of pathogens, for example bacteria. RESULTS: We present PAIPline, a bioinformatics pipeline specifically designed to address problems associated with detecting pathogens in diagnostic samples. PAIPline particularly focuses on userfriendliness and encapsulates all necessary steps from preprocessing to resolution of ambiguous reads and filtering up to visualization in a single tool. In contrast to existing tools, PAIPline is more specific while maintaining sensitivity. This is shown in a comparative evaluation where PAIPline was benchmarked along other well-known metagenomic profiling tools on previously published well-characterized datasets. Additionally, as part of an international cooperation project, PAIPline was applied to an outbreak sample of hemorrhagic fevers of then unknown etiology. The presented results show that PAIPline can serve as a robust, reliable, user-friendly, adaptable and generalizable stand-alone software for diagnostics from NGS samples and as a stepping stone for further downstream analyses. AVAILABILITY AND IMPLEMENTATION: PAIPline is freely available under https://gitlab.com/rki_bioinformatics/paipline. Oxford University Press 2018-09-01 2018-09-08 /pmc/articles/PMC6129269/ /pubmed/30423069 http://dx.doi.org/10.1093/bioinformatics/bty595 Text en © The Author(s) 2018. Published by Oxford University Press. http://creativecommons.org/licenses/by/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Eccb 2018: European Conference on Computational Biology Proceedings
Andrusch, Andreas
Dabrowski, Piotr W
Klenner, Jeanette
Tausch, Simon H
Kohl, Claudia
Osman, Abdalla A
Renard, Bernhard Y
Nitsche, Andreas
PAIPline: pathogen identification in metagenomic and clinical next generation sequencing samples
title PAIPline: pathogen identification in metagenomic and clinical next generation sequencing samples
title_full PAIPline: pathogen identification in metagenomic and clinical next generation sequencing samples
title_fullStr PAIPline: pathogen identification in metagenomic and clinical next generation sequencing samples
title_full_unstemmed PAIPline: pathogen identification in metagenomic and clinical next generation sequencing samples
title_short PAIPline: pathogen identification in metagenomic and clinical next generation sequencing samples
title_sort paipline: pathogen identification in metagenomic and clinical next generation sequencing samples
topic Eccb 2018: European Conference on Computational Biology Proceedings
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6129269/
https://www.ncbi.nlm.nih.gov/pubmed/30423069
http://dx.doi.org/10.1093/bioinformatics/bty595
work_keys_str_mv AT andruschandreas paiplinepathogenidentificationinmetagenomicandclinicalnextgenerationsequencingsamples
AT dabrowskipiotrw paiplinepathogenidentificationinmetagenomicandclinicalnextgenerationsequencingsamples
AT klennerjeanette paiplinepathogenidentificationinmetagenomicandclinicalnextgenerationsequencingsamples
AT tauschsimonh paiplinepathogenidentificationinmetagenomicandclinicalnextgenerationsequencingsamples
AT kohlclaudia paiplinepathogenidentificationinmetagenomicandclinicalnextgenerationsequencingsamples
AT osmanabdallaa paiplinepathogenidentificationinmetagenomicandclinicalnextgenerationsequencingsamples
AT renardbernhardy paiplinepathogenidentificationinmetagenomicandclinicalnextgenerationsequencingsamples
AT nitscheandreas paiplinepathogenidentificationinmetagenomicandclinicalnextgenerationsequencingsamples