Cargando…

FASTQINS and ANUBIS: two bioinformatic tools to explore facts and artifacts in transposon sequencing and essentiality studies

Transposon sequencing is commonly applied for identifying the minimal set of genes required for cellular life; a major challenge in fields such as evolutionary or synthetic biology. However, the scientific community has no standards at the level of processing, treatment, curation and analysis of thi...

Descripción completa

Detalles Bibliográficos
Autores principales: Miravet-Verde, Samuel, Burgos, Raul, Delgado, Javier, Lluch-Senar, Maria, Serrano, Luis
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7515713/
https://www.ncbi.nlm.nih.gov/pubmed/32813015
http://dx.doi.org/10.1093/nar/gkaa679
_version_ 1783586858930798592
author Miravet-Verde, Samuel
Burgos, Raul
Delgado, Javier
Lluch-Senar, Maria
Serrano, Luis
author_facet Miravet-Verde, Samuel
Burgos, Raul
Delgado, Javier
Lluch-Senar, Maria
Serrano, Luis
author_sort Miravet-Verde, Samuel
collection PubMed
description Transposon sequencing is commonly applied for identifying the minimal set of genes required for cellular life; a major challenge in fields such as evolutionary or synthetic biology. However, the scientific community has no standards at the level of processing, treatment, curation and analysis of this kind data. In addition, we lack knowledge about artifactual signals and the requirements a dataset has to satisfy to allow accurate prediction. Here, we have developed FASTQINS, a pipeline for the detection of transposon insertions, and ANUBIS, a library of functions to evaluate and correct deviating factors known and uncharacterized until now. ANUBIS implements previously defined essentiality estimate models in addition to new approaches with advantages like not requiring a training set of genes to predict general essentiality. To highlight the applicability of these tools, and provide a set of recommendations on how to analyze transposon sequencing data, we performed a comprehensive study on artifacts corrections and essentiality estimation at a 1.5-bp resolution, in the genome-reduced bacterium Mycoplasma pneumoniae. We envision FASTQINS and ANUBIS to aid in the analysis of Tn-seq procedures and lead to the development of accurate genome essentiality estimates to guide applications such as designing live vaccines or growth optimization.
format Online
Article
Text
id pubmed-7515713
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-75157132020-09-30 FASTQINS and ANUBIS: two bioinformatic tools to explore facts and artifacts in transposon sequencing and essentiality studies Miravet-Verde, Samuel Burgos, Raul Delgado, Javier Lluch-Senar, Maria Serrano, Luis Nucleic Acids Res Methods Online Transposon sequencing is commonly applied for identifying the minimal set of genes required for cellular life; a major challenge in fields such as evolutionary or synthetic biology. However, the scientific community has no standards at the level of processing, treatment, curation and analysis of this kind data. In addition, we lack knowledge about artifactual signals and the requirements a dataset has to satisfy to allow accurate prediction. Here, we have developed FASTQINS, a pipeline for the detection of transposon insertions, and ANUBIS, a library of functions to evaluate and correct deviating factors known and uncharacterized until now. ANUBIS implements previously defined essentiality estimate models in addition to new approaches with advantages like not requiring a training set of genes to predict general essentiality. To highlight the applicability of these tools, and provide a set of recommendations on how to analyze transposon sequencing data, we performed a comprehensive study on artifacts corrections and essentiality estimation at a 1.5-bp resolution, in the genome-reduced bacterium Mycoplasma pneumoniae. We envision FASTQINS and ANUBIS to aid in the analysis of Tn-seq procedures and lead to the development of accurate genome essentiality estimates to guide applications such as designing live vaccines or growth optimization. Oxford University Press 2020-08-19 /pmc/articles/PMC7515713/ /pubmed/32813015 http://dx.doi.org/10.1093/nar/gkaa679 Text en © The Author(s) 2020. Published by Oxford University Press on behalf of Nucleic Acids Research. http://creativecommons.org/licenses/by-nc/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/4.0/), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial re-use, please contact journals.permissions@oup.com
spellingShingle Methods Online
Miravet-Verde, Samuel
Burgos, Raul
Delgado, Javier
Lluch-Senar, Maria
Serrano, Luis
FASTQINS and ANUBIS: two bioinformatic tools to explore facts and artifacts in transposon sequencing and essentiality studies
title FASTQINS and ANUBIS: two bioinformatic tools to explore facts and artifacts in transposon sequencing and essentiality studies
title_full FASTQINS and ANUBIS: two bioinformatic tools to explore facts and artifacts in transposon sequencing and essentiality studies
title_fullStr FASTQINS and ANUBIS: two bioinformatic tools to explore facts and artifacts in transposon sequencing and essentiality studies
title_full_unstemmed FASTQINS and ANUBIS: two bioinformatic tools to explore facts and artifacts in transposon sequencing and essentiality studies
title_short FASTQINS and ANUBIS: two bioinformatic tools to explore facts and artifacts in transposon sequencing and essentiality studies
title_sort fastqins and anubis: two bioinformatic tools to explore facts and artifacts in transposon sequencing and essentiality studies
topic Methods Online
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7515713/
https://www.ncbi.nlm.nih.gov/pubmed/32813015
http://dx.doi.org/10.1093/nar/gkaa679
work_keys_str_mv AT miravetverdesamuel fastqinsandanubistwobioinformatictoolstoexplorefactsandartifactsintransposonsequencingandessentialitystudies
AT burgosraul fastqinsandanubistwobioinformatictoolstoexplorefactsandartifactsintransposonsequencingandessentialitystudies
AT delgadojavier fastqinsandanubistwobioinformatictoolstoexplorefactsandartifactsintransposonsequencingandessentialitystudies
AT lluchsenarmaria fastqinsandanubistwobioinformatictoolstoexplorefactsandartifactsintransposonsequencingandessentialitystudies
AT serranoluis fastqinsandanubistwobioinformatictoolstoexplorefactsandartifactsintransposonsequencingandessentialitystudies