Cargando…
DNAscan2: a versatile, scalable, and user-friendly analysis pipeline for human next-generation sequencing data
SUMMARY: The current widespread adoption of next-generation sequencing (NGS) in all branches of basic research and clinical genetics fields means that users with highly variable informatics skills, computing facilities and application purposes need to process, analyse, and interpret NGS data. In thi...
Autores principales: | , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Oxford University Press
2023
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10112953/ https://www.ncbi.nlm.nih.gov/pubmed/37010501 http://dx.doi.org/10.1093/bioinformatics/btad152 |
_version_ | 1785027724022120448 |
---|---|
author | Marriott, Heather Kabiljo, Renata Al Khleifat, Ahmad Dobson, Richard J Al-Chalabi, Ammar Iacoangeli, Alfredo |
author_facet | Marriott, Heather Kabiljo, Renata Al Khleifat, Ahmad Dobson, Richard J Al-Chalabi, Ammar Iacoangeli, Alfredo |
author_sort | Marriott, Heather |
collection | PubMed |
description | SUMMARY: The current widespread adoption of next-generation sequencing (NGS) in all branches of basic research and clinical genetics fields means that users with highly variable informatics skills, computing facilities and application purposes need to process, analyse, and interpret NGS data. In this landscape, versatility, scalability, and user-friendliness are key characteristics for an NGS analysis software. We developed DNAscan2, a highly flexible, end-to-end pipeline for the analysis of NGS data, which (i) can be used for the detection of multiple variant types, including SNVs, small indels, transposable elements, short tandem repeats, and other large structural variants; (ii) covers all standard steps of NGS analysis, from quality control of raw data and genome alignment to variant calling, annotation, and generation of reports for the interpretation and prioritization of results; (iii) is highly adaptable as it can be deployed and run via either a graphic user interface for non-bioinformaticians and a command line tool for personal computer usage; (iv) is scalable as it can be executed in parallel as a Snakemake workflow, and; (v) is computationally efficient by minimizing RAM and CPU time requirements. AVAILABILITY AND IMPLEMENTATION: DNAscan2 is implemented in Python3 and is available at https://github.com/KHP-Informatics/DNAscanv2. |
format | Online Article Text |
id | pubmed-10112953 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2023 |
publisher | Oxford University Press |
record_format | MEDLINE/PubMed |
spelling | pubmed-101129532023-04-19 DNAscan2: a versatile, scalable, and user-friendly analysis pipeline for human next-generation sequencing data Marriott, Heather Kabiljo, Renata Al Khleifat, Ahmad Dobson, Richard J Al-Chalabi, Ammar Iacoangeli, Alfredo Bioinformatics Applications Note SUMMARY: The current widespread adoption of next-generation sequencing (NGS) in all branches of basic research and clinical genetics fields means that users with highly variable informatics skills, computing facilities and application purposes need to process, analyse, and interpret NGS data. In this landscape, versatility, scalability, and user-friendliness are key characteristics for an NGS analysis software. We developed DNAscan2, a highly flexible, end-to-end pipeline for the analysis of NGS data, which (i) can be used for the detection of multiple variant types, including SNVs, small indels, transposable elements, short tandem repeats, and other large structural variants; (ii) covers all standard steps of NGS analysis, from quality control of raw data and genome alignment to variant calling, annotation, and generation of reports for the interpretation and prioritization of results; (iii) is highly adaptable as it can be deployed and run via either a graphic user interface for non-bioinformaticians and a command line tool for personal computer usage; (iv) is scalable as it can be executed in parallel as a Snakemake workflow, and; (v) is computationally efficient by minimizing RAM and CPU time requirements. AVAILABILITY AND IMPLEMENTATION: DNAscan2 is implemented in Python3 and is available at https://github.com/KHP-Informatics/DNAscanv2. Oxford University Press 2023-04-03 /pmc/articles/PMC10112953/ /pubmed/37010501 http://dx.doi.org/10.1093/bioinformatics/btad152 Text en © The Author(s) 2023. Published by Oxford University Press. https://creativecommons.org/licenses/by/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Applications Note Marriott, Heather Kabiljo, Renata Al Khleifat, Ahmad Dobson, Richard J Al-Chalabi, Ammar Iacoangeli, Alfredo DNAscan2: a versatile, scalable, and user-friendly analysis pipeline for human next-generation sequencing data |
title | DNAscan2: a versatile, scalable, and user-friendly analysis pipeline for human next-generation sequencing data |
title_full | DNAscan2: a versatile, scalable, and user-friendly analysis pipeline for human next-generation sequencing data |
title_fullStr | DNAscan2: a versatile, scalable, and user-friendly analysis pipeline for human next-generation sequencing data |
title_full_unstemmed | DNAscan2: a versatile, scalable, and user-friendly analysis pipeline for human next-generation sequencing data |
title_short | DNAscan2: a versatile, scalable, and user-friendly analysis pipeline for human next-generation sequencing data |
title_sort | dnascan2: a versatile, scalable, and user-friendly analysis pipeline for human next-generation sequencing data |
topic | Applications Note |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10112953/ https://www.ncbi.nlm.nih.gov/pubmed/37010501 http://dx.doi.org/10.1093/bioinformatics/btad152 |
work_keys_str_mv | AT marriottheather dnascan2aversatilescalableanduserfriendlyanalysispipelineforhumannextgenerationsequencingdata AT kabiljorenata dnascan2aversatilescalableanduserfriendlyanalysispipelineforhumannextgenerationsequencingdata AT alkhleifatahmad dnascan2aversatilescalableanduserfriendlyanalysispipelineforhumannextgenerationsequencingdata AT dobsonrichardj dnascan2aversatilescalableanduserfriendlyanalysispipelineforhumannextgenerationsequencingdata AT alchalabiammar dnascan2aversatilescalableanduserfriendlyanalysispipelineforhumannextgenerationsequencingdata AT iacoangelialfredo dnascan2aversatilescalableanduserfriendlyanalysispipelineforhumannextgenerationsequencingdata |