Cargando…

DetectIS: a pipeline to rapidly detect exogenous DNA integration sites using DNA or RNA paired-end sequencing data

MOTIVATION: Recombinant DNA technology is widely used for different applications in biology, medicine and bio-technology. Viral transduction and plasmid transfection are among the most frequently used techniques to generate recombinant cell lines. Many of these methods result in the random integrati...

Descripción completa

Detalles Bibliográficos
Autores principales: Grassi, Luigi, Harris, Claire, Zhu, Jie, Hardman, Colin, Hatton, Diane
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9502153/
https://www.ncbi.nlm.nih.gov/pubmed/33978747
http://dx.doi.org/10.1093/bioinformatics/btab366
_version_ 1784795636394098688
author Grassi, Luigi
Harris, Claire
Zhu, Jie
Hardman, Colin
Hatton, Diane
author_facet Grassi, Luigi
Harris, Claire
Zhu, Jie
Hardman, Colin
Hatton, Diane
author_sort Grassi, Luigi
collection PubMed
description MOTIVATION: Recombinant DNA technology is widely used for different applications in biology, medicine and bio-technology. Viral transduction and plasmid transfection are among the most frequently used techniques to generate recombinant cell lines. Many of these methods result in the random integration of the plasmid into the host genome. Rapid identification of the integration sites is highly desirable in order to characterize these engineered cell lines. RESULTS: We developed detectIS: a pipeline specifically designed to identify genomic integration sites of exogenous DNA, either a plasmid containing one or more transgenes or a virus. The pipeline is based on a Nextflow workflow combined with a Singularity image containing all the necessary software, ensuring high reproducibility and scalability of the analysis. We tested it on simulated datasets and RNA-seq data from a human sample infected with Hepatitis B virus. Comparisons with other state of the art tools show that our method can identify the integration site in different recombinant cell lines, with accurate results, lower computational demand and shorter execution times. AVAILABILITY AND IMPLEMENTATION: The Nextflow workflow, the Singularity image and a test dataset are available at https://github.com/AstraZeneca/detectIS. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
format Online
Article
Text
id pubmed-9502153
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-95021532022-09-26 DetectIS: a pipeline to rapidly detect exogenous DNA integration sites using DNA or RNA paired-end sequencing data Grassi, Luigi Harris, Claire Zhu, Jie Hardman, Colin Hatton, Diane Bioinformatics Applications Notes MOTIVATION: Recombinant DNA technology is widely used for different applications in biology, medicine and bio-technology. Viral transduction and plasmid transfection are among the most frequently used techniques to generate recombinant cell lines. Many of these methods result in the random integration of the plasmid into the host genome. Rapid identification of the integration sites is highly desirable in order to characterize these engineered cell lines. RESULTS: We developed detectIS: a pipeline specifically designed to identify genomic integration sites of exogenous DNA, either a plasmid containing one or more transgenes or a virus. The pipeline is based on a Nextflow workflow combined with a Singularity image containing all the necessary software, ensuring high reproducibility and scalability of the analysis. We tested it on simulated datasets and RNA-seq data from a human sample infected with Hepatitis B virus. Comparisons with other state of the art tools show that our method can identify the integration site in different recombinant cell lines, with accurate results, lower computational demand and shorter execution times. AVAILABILITY AND IMPLEMENTATION: The Nextflow workflow, the Singularity image and a test dataset are available at https://github.com/AstraZeneca/detectIS. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online. Oxford University Press 2021-05-12 /pmc/articles/PMC9502153/ /pubmed/33978747 http://dx.doi.org/10.1093/bioinformatics/btab366 Text en © The Author(s) 2021. Published by Oxford University Press. https://creativecommons.org/licenses/by/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Applications Notes
Grassi, Luigi
Harris, Claire
Zhu, Jie
Hardman, Colin
Hatton, Diane
DetectIS: a pipeline to rapidly detect exogenous DNA integration sites using DNA or RNA paired-end sequencing data
title DetectIS: a pipeline to rapidly detect exogenous DNA integration sites using DNA or RNA paired-end sequencing data
title_full DetectIS: a pipeline to rapidly detect exogenous DNA integration sites using DNA or RNA paired-end sequencing data
title_fullStr DetectIS: a pipeline to rapidly detect exogenous DNA integration sites using DNA or RNA paired-end sequencing data
title_full_unstemmed DetectIS: a pipeline to rapidly detect exogenous DNA integration sites using DNA or RNA paired-end sequencing data
title_short DetectIS: a pipeline to rapidly detect exogenous DNA integration sites using DNA or RNA paired-end sequencing data
title_sort detectis: a pipeline to rapidly detect exogenous dna integration sites using dna or rna paired-end sequencing data
topic Applications Notes
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9502153/
https://www.ncbi.nlm.nih.gov/pubmed/33978747
http://dx.doi.org/10.1093/bioinformatics/btab366
work_keys_str_mv AT grassiluigi detectisapipelinetorapidlydetectexogenousdnaintegrationsitesusingdnaorrnapairedendsequencingdata
AT harrisclaire detectisapipelinetorapidlydetectexogenousdnaintegrationsitesusingdnaorrnapairedendsequencingdata
AT zhujie detectisapipelinetorapidlydetectexogenousdnaintegrationsitesusingdnaorrnapairedendsequencingdata
AT hardmancolin detectisapipelinetorapidlydetectexogenousdnaintegrationsitesusingdnaorrnapairedendsequencingdata
AT hattondiane detectisapipelinetorapidlydetectexogenousdnaintegrationsitesusingdnaorrnapairedendsequencingdata