Cargando…

DNAscan2: a versatile, scalable, and user-friendly analysis pipeline for human next-generation sequencing data

SUMMARY: The current widespread adoption of next-generation sequencing (NGS) in all branches of basic research and clinical genetics fields means that users with highly variable informatics skills, computing facilities and application purposes need to process, analyse, and interpret NGS data. In thi...

Descripción completa

Detalles Bibliográficos
Autores principales: Marriott, Heather, Kabiljo, Renata, Al Khleifat, Ahmad, Dobson, Richard J, Al-Chalabi, Ammar, Iacoangeli, Alfredo
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10112953/
https://www.ncbi.nlm.nih.gov/pubmed/37010501
http://dx.doi.org/10.1093/bioinformatics/btad152
_version_ 1785027724022120448
author Marriott, Heather
Kabiljo, Renata
Al Khleifat, Ahmad
Dobson, Richard J
Al-Chalabi, Ammar
Iacoangeli, Alfredo
author_facet Marriott, Heather
Kabiljo, Renata
Al Khleifat, Ahmad
Dobson, Richard J
Al-Chalabi, Ammar
Iacoangeli, Alfredo
author_sort Marriott, Heather
collection PubMed
description SUMMARY: The current widespread adoption of next-generation sequencing (NGS) in all branches of basic research and clinical genetics fields means that users with highly variable informatics skills, computing facilities and application purposes need to process, analyse, and interpret NGS data. In this landscape, versatility, scalability, and user-friendliness are key characteristics for an NGS analysis software. We developed DNAscan2, a highly flexible, end-to-end pipeline for the analysis of NGS data, which (i) can be used for the detection of multiple variant types, including SNVs, small indels, transposable elements, short tandem repeats, and other large structural variants; (ii) covers all standard steps of NGS analysis, from quality control of raw data and genome alignment to variant calling, annotation, and generation of reports for the interpretation and prioritization of results; (iii) is highly adaptable as it can be deployed and run via either a graphic user interface for non-bioinformaticians and a command line tool for personal computer usage; (iv) is scalable as it can be executed in parallel as a Snakemake workflow, and; (v) is computationally efficient by minimizing RAM and CPU time requirements. AVAILABILITY AND IMPLEMENTATION: DNAscan2 is implemented in Python3 and is available at https://github.com/KHP-Informatics/DNAscanv2.
format Online
Article
Text
id pubmed-10112953
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-101129532023-04-19 DNAscan2: a versatile, scalable, and user-friendly analysis pipeline for human next-generation sequencing data Marriott, Heather Kabiljo, Renata Al Khleifat, Ahmad Dobson, Richard J Al-Chalabi, Ammar Iacoangeli, Alfredo Bioinformatics Applications Note SUMMARY: The current widespread adoption of next-generation sequencing (NGS) in all branches of basic research and clinical genetics fields means that users with highly variable informatics skills, computing facilities and application purposes need to process, analyse, and interpret NGS data. In this landscape, versatility, scalability, and user-friendliness are key characteristics for an NGS analysis software. We developed DNAscan2, a highly flexible, end-to-end pipeline for the analysis of NGS data, which (i) can be used for the detection of multiple variant types, including SNVs, small indels, transposable elements, short tandem repeats, and other large structural variants; (ii) covers all standard steps of NGS analysis, from quality control of raw data and genome alignment to variant calling, annotation, and generation of reports for the interpretation and prioritization of results; (iii) is highly adaptable as it can be deployed and run via either a graphic user interface for non-bioinformaticians and a command line tool for personal computer usage; (iv) is scalable as it can be executed in parallel as a Snakemake workflow, and; (v) is computationally efficient by minimizing RAM and CPU time requirements. AVAILABILITY AND IMPLEMENTATION: DNAscan2 is implemented in Python3 and is available at https://github.com/KHP-Informatics/DNAscanv2. Oxford University Press 2023-04-03 /pmc/articles/PMC10112953/ /pubmed/37010501 http://dx.doi.org/10.1093/bioinformatics/btad152 Text en © The Author(s) 2023. Published by Oxford University Press. https://creativecommons.org/licenses/by/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Applications Note
Marriott, Heather
Kabiljo, Renata
Al Khleifat, Ahmad
Dobson, Richard J
Al-Chalabi, Ammar
Iacoangeli, Alfredo
DNAscan2: a versatile, scalable, and user-friendly analysis pipeline for human next-generation sequencing data
title DNAscan2: a versatile, scalable, and user-friendly analysis pipeline for human next-generation sequencing data
title_full DNAscan2: a versatile, scalable, and user-friendly analysis pipeline for human next-generation sequencing data
title_fullStr DNAscan2: a versatile, scalable, and user-friendly analysis pipeline for human next-generation sequencing data
title_full_unstemmed DNAscan2: a versatile, scalable, and user-friendly analysis pipeline for human next-generation sequencing data
title_short DNAscan2: a versatile, scalable, and user-friendly analysis pipeline for human next-generation sequencing data
title_sort dnascan2: a versatile, scalable, and user-friendly analysis pipeline for human next-generation sequencing data
topic Applications Note
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10112953/
https://www.ncbi.nlm.nih.gov/pubmed/37010501
http://dx.doi.org/10.1093/bioinformatics/btad152
work_keys_str_mv AT marriottheather dnascan2aversatilescalableanduserfriendlyanalysispipelineforhumannextgenerationsequencingdata
AT kabiljorenata dnascan2aversatilescalableanduserfriendlyanalysispipelineforhumannextgenerationsequencingdata
AT alkhleifatahmad dnascan2aversatilescalableanduserfriendlyanalysispipelineforhumannextgenerationsequencingdata
AT dobsonrichardj dnascan2aversatilescalableanduserfriendlyanalysispipelineforhumannextgenerationsequencingdata
AT alchalabiammar dnascan2aversatilescalableanduserfriendlyanalysispipelineforhumannextgenerationsequencingdata
AT iacoangelialfredo dnascan2aversatilescalableanduserfriendlyanalysispipelineforhumannextgenerationsequencingdata