Cargando…

Reproducible and accessible analysis of transposon insertion sequencing in Galaxy for qualitative essentiality analyses

BACKGROUND: Significant progress has been made in advancing and standardizing tools for human genomic and biomedical research. Yet, the field of next-generation sequencing (NGS) analysis for microorganisms (including multiple pathogens) remains fragmented, lacks accessible and reusable tools, is hin...

Descripción completa

Detalles Bibliográficos
Autores principales: Larivière, Delphine, Wickham, Laura, Keiler, Kenneth, Nekrutenko, Anton
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8178898/
https://www.ncbi.nlm.nih.gov/pubmed/34090324
http://dx.doi.org/10.1186/s12866-021-02184-4
_version_ 1783703669345091584
author Larivière, Delphine
Wickham, Laura
Keiler, Kenneth
Nekrutenko, Anton
author_facet Larivière, Delphine
Wickham, Laura
Keiler, Kenneth
Nekrutenko, Anton
author_sort Larivière, Delphine
collection PubMed
description BACKGROUND: Significant progress has been made in advancing and standardizing tools for human genomic and biomedical research. Yet, the field of next-generation sequencing (NGS) analysis for microorganisms (including multiple pathogens) remains fragmented, lacks accessible and reusable tools, is hindered by local computational resource limitations, and does not offer widely accepted standards. One such “problem areas” is the analysis of Transposon Insertion Sequencing (TIS) data. TIS allows probing of almost the entire genome of a microorganism by introducing random insertions of transposon-derived constructs. The impact of the insertions on the survival and growth under specific conditions provides precise information about genes affecting specific phenotypic characteristics. A wide array of tools has been developed to analyze TIS data. Among the variety of options available, it is often difficult to identify which one can provide a reliable and reproducible analysis. RESULTS: Here we sought to understand the challenges and propose reliable practices for the analysis of TIS experiments. Using data from two recent TIS studies, we have developed a series of workflows that include multiple tools for data de-multiplexing, promoter sequence identification, transposon flank alignment, and read count repartition across the genome. Particular attention was paid to quality control procedures, such as determining the optimal tool parameters for the analysis and removal of contamination. CONCLUSIONS: Our work provides an assessment of the currently available tools for TIS data analysis. It offers ready to use workflows that can be invoked by anyone in the world using our public Galaxy platform (https://usegalaxy.org). To lower the entry barriers, we have also developed interactive tutorials explaining details of TIS data analysis procedures at https://bit.ly/gxy-tis. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at (10.1186/s12866-021-02184-4).
format Online
Article
Text
id pubmed-8178898
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-81788982021-06-07 Reproducible and accessible analysis of transposon insertion sequencing in Galaxy for qualitative essentiality analyses Larivière, Delphine Wickham, Laura Keiler, Kenneth Nekrutenko, Anton BMC Microbiol Methodology Article BACKGROUND: Significant progress has been made in advancing and standardizing tools for human genomic and biomedical research. Yet, the field of next-generation sequencing (NGS) analysis for microorganisms (including multiple pathogens) remains fragmented, lacks accessible and reusable tools, is hindered by local computational resource limitations, and does not offer widely accepted standards. One such “problem areas” is the analysis of Transposon Insertion Sequencing (TIS) data. TIS allows probing of almost the entire genome of a microorganism by introducing random insertions of transposon-derived constructs. The impact of the insertions on the survival and growth under specific conditions provides precise information about genes affecting specific phenotypic characteristics. A wide array of tools has been developed to analyze TIS data. Among the variety of options available, it is often difficult to identify which one can provide a reliable and reproducible analysis. RESULTS: Here we sought to understand the challenges and propose reliable practices for the analysis of TIS experiments. Using data from two recent TIS studies, we have developed a series of workflows that include multiple tools for data de-multiplexing, promoter sequence identification, transposon flank alignment, and read count repartition across the genome. Particular attention was paid to quality control procedures, such as determining the optimal tool parameters for the analysis and removal of contamination. CONCLUSIONS: Our work provides an assessment of the currently available tools for TIS data analysis. It offers ready to use workflows that can be invoked by anyone in the world using our public Galaxy platform (https://usegalaxy.org). To lower the entry barriers, we have also developed interactive tutorials explaining details of TIS data analysis procedures at https://bit.ly/gxy-tis. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at (10.1186/s12866-021-02184-4). BioMed Central 2021-06-05 /pmc/articles/PMC8178898/ /pubmed/34090324 http://dx.doi.org/10.1186/s12866-021-02184-4 Text en © The Author(s) 2021 https://creativecommons.org/licenses/by/4.0/Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/ (https://creativecommons.org/publicdomain/zero/1.0/) ) applies to the data made available in this article, unless otherwise stated in a credit line to the data.
spellingShingle Methodology Article
Larivière, Delphine
Wickham, Laura
Keiler, Kenneth
Nekrutenko, Anton
Reproducible and accessible analysis of transposon insertion sequencing in Galaxy for qualitative essentiality analyses
title Reproducible and accessible analysis of transposon insertion sequencing in Galaxy for qualitative essentiality analyses
title_full Reproducible and accessible analysis of transposon insertion sequencing in Galaxy for qualitative essentiality analyses
title_fullStr Reproducible and accessible analysis of transposon insertion sequencing in Galaxy for qualitative essentiality analyses
title_full_unstemmed Reproducible and accessible analysis of transposon insertion sequencing in Galaxy for qualitative essentiality analyses
title_short Reproducible and accessible analysis of transposon insertion sequencing in Galaxy for qualitative essentiality analyses
title_sort reproducible and accessible analysis of transposon insertion sequencing in galaxy for qualitative essentiality analyses
topic Methodology Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8178898/
https://www.ncbi.nlm.nih.gov/pubmed/34090324
http://dx.doi.org/10.1186/s12866-021-02184-4
work_keys_str_mv AT larivieredelphine reproducibleandaccessibleanalysisoftransposoninsertionsequencingingalaxyforqualitativeessentialityanalyses
AT wickhamlaura reproducibleandaccessibleanalysisoftransposoninsertionsequencingingalaxyforqualitativeessentialityanalyses
AT keilerkenneth reproducibleandaccessibleanalysisoftransposoninsertionsequencingingalaxyforqualitativeessentialityanalyses
AT nekrutenkoanton reproducibleandaccessibleanalysisoftransposoninsertionsequencingingalaxyforqualitativeessentialityanalyses
AT reproducibleandaccessibleanalysisoftransposoninsertionsequencingingalaxyforqualitativeessentialityanalyses