Cargando…
DetectIS: a pipeline to rapidly detect exogenous DNA integration sites using DNA or RNA paired-end sequencing data
MOTIVATION: Recombinant DNA technology is widely used for different applications in biology, medicine and bio-technology. Viral transduction and plasmid transfection are among the most frequently used techniques to generate recombinant cell lines. Many of these methods result in the random integrati...
Autores principales: | , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Oxford University Press
2021
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9502153/ https://www.ncbi.nlm.nih.gov/pubmed/33978747 http://dx.doi.org/10.1093/bioinformatics/btab366 |
_version_ | 1784795636394098688 |
---|---|
author | Grassi, Luigi Harris, Claire Zhu, Jie Hardman, Colin Hatton, Diane |
author_facet | Grassi, Luigi Harris, Claire Zhu, Jie Hardman, Colin Hatton, Diane |
author_sort | Grassi, Luigi |
collection | PubMed |
description | MOTIVATION: Recombinant DNA technology is widely used for different applications in biology, medicine and bio-technology. Viral transduction and plasmid transfection are among the most frequently used techniques to generate recombinant cell lines. Many of these methods result in the random integration of the plasmid into the host genome. Rapid identification of the integration sites is highly desirable in order to characterize these engineered cell lines. RESULTS: We developed detectIS: a pipeline specifically designed to identify genomic integration sites of exogenous DNA, either a plasmid containing one or more transgenes or a virus. The pipeline is based on a Nextflow workflow combined with a Singularity image containing all the necessary software, ensuring high reproducibility and scalability of the analysis. We tested it on simulated datasets and RNA-seq data from a human sample infected with Hepatitis B virus. Comparisons with other state of the art tools show that our method can identify the integration site in different recombinant cell lines, with accurate results, lower computational demand and shorter execution times. AVAILABILITY AND IMPLEMENTATION: The Nextflow workflow, the Singularity image and a test dataset are available at https://github.com/AstraZeneca/detectIS. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online. |
format | Online Article Text |
id | pubmed-9502153 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2021 |
publisher | Oxford University Press |
record_format | MEDLINE/PubMed |
spelling | pubmed-95021532022-09-26 DetectIS: a pipeline to rapidly detect exogenous DNA integration sites using DNA or RNA paired-end sequencing data Grassi, Luigi Harris, Claire Zhu, Jie Hardman, Colin Hatton, Diane Bioinformatics Applications Notes MOTIVATION: Recombinant DNA technology is widely used for different applications in biology, medicine and bio-technology. Viral transduction and plasmid transfection are among the most frequently used techniques to generate recombinant cell lines. Many of these methods result in the random integration of the plasmid into the host genome. Rapid identification of the integration sites is highly desirable in order to characterize these engineered cell lines. RESULTS: We developed detectIS: a pipeline specifically designed to identify genomic integration sites of exogenous DNA, either a plasmid containing one or more transgenes or a virus. The pipeline is based on a Nextflow workflow combined with a Singularity image containing all the necessary software, ensuring high reproducibility and scalability of the analysis. We tested it on simulated datasets and RNA-seq data from a human sample infected with Hepatitis B virus. Comparisons with other state of the art tools show that our method can identify the integration site in different recombinant cell lines, with accurate results, lower computational demand and shorter execution times. AVAILABILITY AND IMPLEMENTATION: The Nextflow workflow, the Singularity image and a test dataset are available at https://github.com/AstraZeneca/detectIS. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online. Oxford University Press 2021-05-12 /pmc/articles/PMC9502153/ /pubmed/33978747 http://dx.doi.org/10.1093/bioinformatics/btab366 Text en © The Author(s) 2021. Published by Oxford University Press. https://creativecommons.org/licenses/by/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Applications Notes Grassi, Luigi Harris, Claire Zhu, Jie Hardman, Colin Hatton, Diane DetectIS: a pipeline to rapidly detect exogenous DNA integration sites using DNA or RNA paired-end sequencing data |
title | DetectIS: a pipeline to rapidly detect exogenous DNA integration sites using DNA or RNA paired-end sequencing data |
title_full | DetectIS: a pipeline to rapidly detect exogenous DNA integration sites using DNA or RNA paired-end sequencing data |
title_fullStr | DetectIS: a pipeline to rapidly detect exogenous DNA integration sites using DNA or RNA paired-end sequencing data |
title_full_unstemmed | DetectIS: a pipeline to rapidly detect exogenous DNA integration sites using DNA or RNA paired-end sequencing data |
title_short | DetectIS: a pipeline to rapidly detect exogenous DNA integration sites using DNA or RNA paired-end sequencing data |
title_sort | detectis: a pipeline to rapidly detect exogenous dna integration sites using dna or rna paired-end sequencing data |
topic | Applications Notes |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9502153/ https://www.ncbi.nlm.nih.gov/pubmed/33978747 http://dx.doi.org/10.1093/bioinformatics/btab366 |
work_keys_str_mv | AT grassiluigi detectisapipelinetorapidlydetectexogenousdnaintegrationsitesusingdnaorrnapairedendsequencingdata AT harrisclaire detectisapipelinetorapidlydetectexogenousdnaintegrationsitesusingdnaorrnapairedendsequencingdata AT zhujie detectisapipelinetorapidlydetectexogenousdnaintegrationsitesusingdnaorrnapairedendsequencingdata AT hardmancolin detectisapipelinetorapidlydetectexogenousdnaintegrationsitesusingdnaorrnapairedendsequencingdata AT hattondiane detectisapipelinetorapidlydetectexogenousdnaintegrationsitesusingdnaorrnapairedendsequencingdata |