Cargando…
FASTQINS and ANUBIS: two bioinformatic tools to explore facts and artifacts in transposon sequencing and essentiality studies
Transposon sequencing is commonly applied for identifying the minimal set of genes required for cellular life; a major challenge in fields such as evolutionary or synthetic biology. However, the scientific community has no standards at the level of processing, treatment, curation and analysis of thi...
Autores principales: | , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Oxford University Press
2020
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7515713/ https://www.ncbi.nlm.nih.gov/pubmed/32813015 http://dx.doi.org/10.1093/nar/gkaa679 |
_version_ | 1783586858930798592 |
---|---|
author | Miravet-Verde, Samuel Burgos, Raul Delgado, Javier Lluch-Senar, Maria Serrano, Luis |
author_facet | Miravet-Verde, Samuel Burgos, Raul Delgado, Javier Lluch-Senar, Maria Serrano, Luis |
author_sort | Miravet-Verde, Samuel |
collection | PubMed |
description | Transposon sequencing is commonly applied for identifying the minimal set of genes required for cellular life; a major challenge in fields such as evolutionary or synthetic biology. However, the scientific community has no standards at the level of processing, treatment, curation and analysis of this kind data. In addition, we lack knowledge about artifactual signals and the requirements a dataset has to satisfy to allow accurate prediction. Here, we have developed FASTQINS, a pipeline for the detection of transposon insertions, and ANUBIS, a library of functions to evaluate and correct deviating factors known and uncharacterized until now. ANUBIS implements previously defined essentiality estimate models in addition to new approaches with advantages like not requiring a training set of genes to predict general essentiality. To highlight the applicability of these tools, and provide a set of recommendations on how to analyze transposon sequencing data, we performed a comprehensive study on artifacts corrections and essentiality estimation at a 1.5-bp resolution, in the genome-reduced bacterium Mycoplasma pneumoniae. We envision FASTQINS and ANUBIS to aid in the analysis of Tn-seq procedures and lead to the development of accurate genome essentiality estimates to guide applications such as designing live vaccines or growth optimization. |
format | Online Article Text |
id | pubmed-7515713 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2020 |
publisher | Oxford University Press |
record_format | MEDLINE/PubMed |
spelling | pubmed-75157132020-09-30 FASTQINS and ANUBIS: two bioinformatic tools to explore facts and artifacts in transposon sequencing and essentiality studies Miravet-Verde, Samuel Burgos, Raul Delgado, Javier Lluch-Senar, Maria Serrano, Luis Nucleic Acids Res Methods Online Transposon sequencing is commonly applied for identifying the minimal set of genes required for cellular life; a major challenge in fields such as evolutionary or synthetic biology. However, the scientific community has no standards at the level of processing, treatment, curation and analysis of this kind data. In addition, we lack knowledge about artifactual signals and the requirements a dataset has to satisfy to allow accurate prediction. Here, we have developed FASTQINS, a pipeline for the detection of transposon insertions, and ANUBIS, a library of functions to evaluate and correct deviating factors known and uncharacterized until now. ANUBIS implements previously defined essentiality estimate models in addition to new approaches with advantages like not requiring a training set of genes to predict general essentiality. To highlight the applicability of these tools, and provide a set of recommendations on how to analyze transposon sequencing data, we performed a comprehensive study on artifacts corrections and essentiality estimation at a 1.5-bp resolution, in the genome-reduced bacterium Mycoplasma pneumoniae. We envision FASTQINS and ANUBIS to aid in the analysis of Tn-seq procedures and lead to the development of accurate genome essentiality estimates to guide applications such as designing live vaccines or growth optimization. Oxford University Press 2020-08-19 /pmc/articles/PMC7515713/ /pubmed/32813015 http://dx.doi.org/10.1093/nar/gkaa679 Text en © The Author(s) 2020. Published by Oxford University Press on behalf of Nucleic Acids Research. http://creativecommons.org/licenses/by-nc/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/4.0/), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial re-use, please contact journals.permissions@oup.com |
spellingShingle | Methods Online Miravet-Verde, Samuel Burgos, Raul Delgado, Javier Lluch-Senar, Maria Serrano, Luis FASTQINS and ANUBIS: two bioinformatic tools to explore facts and artifacts in transposon sequencing and essentiality studies |
title | FASTQINS and ANUBIS: two bioinformatic tools to explore facts and artifacts in transposon sequencing and essentiality studies |
title_full | FASTQINS and ANUBIS: two bioinformatic tools to explore facts and artifacts in transposon sequencing and essentiality studies |
title_fullStr | FASTQINS and ANUBIS: two bioinformatic tools to explore facts and artifacts in transposon sequencing and essentiality studies |
title_full_unstemmed | FASTQINS and ANUBIS: two bioinformatic tools to explore facts and artifacts in transposon sequencing and essentiality studies |
title_short | FASTQINS and ANUBIS: two bioinformatic tools to explore facts and artifacts in transposon sequencing and essentiality studies |
title_sort | fastqins and anubis: two bioinformatic tools to explore facts and artifacts in transposon sequencing and essentiality studies |
topic | Methods Online |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7515713/ https://www.ncbi.nlm.nih.gov/pubmed/32813015 http://dx.doi.org/10.1093/nar/gkaa679 |
work_keys_str_mv | AT miravetverdesamuel fastqinsandanubistwobioinformatictoolstoexplorefactsandartifactsintransposonsequencingandessentialitystudies AT burgosraul fastqinsandanubistwobioinformatictoolstoexplorefactsandartifactsintransposonsequencingandessentialitystudies AT delgadojavier fastqinsandanubistwobioinformatictoolstoexplorefactsandartifactsintransposonsequencingandessentialitystudies AT lluchsenarmaria fastqinsandanubistwobioinformatictoolstoexplorefactsandartifactsintransposonsequencingandessentialitystudies AT serranoluis fastqinsandanubistwobioinformatictoolstoexplorefactsandartifactsintransposonsequencingandessentialitystudies |