Cargando…

LoRTE: Detecting transposon-induced genomic variants using low coverage PacBio long read sequences

BACKGROUND: Population genomic analysis of transposable elements has greatly benefited from recent advances of sequencing technologies. However, the short size of the reads and the propensity of transposable elements to nest in highly repeated regions of genomes limits the efficiency of bioinformati...

Descripción completa

Detalles Bibliográficos
Autores principales: Disdero, Eric, Filée, Jonathan
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2017
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5385071/
https://www.ncbi.nlm.nih.gov/pubmed/28405230
http://dx.doi.org/10.1186/s13100-017-0088-x
_version_ 1782520538778828800
author Disdero, Eric
Filée, Jonathan
author_facet Disdero, Eric
Filée, Jonathan
author_sort Disdero, Eric
collection PubMed
description BACKGROUND: Population genomic analysis of transposable elements has greatly benefited from recent advances of sequencing technologies. However, the short size of the reads and the propensity of transposable elements to nest in highly repeated regions of genomes limits the efficiency of bioinformatic tools when Illumina or 454 technologies are used. Fortunately, long read sequencing technologies generating read length that may span the entire length of full transposons are now available. However, existing TE population genomic softwares were not designed to handle long reads and the development of new dedicated tools is needed. RESULTS: LoRTE is the first tool able to use PacBio long read sequences to identify transposon deletions and insertions between a reference genome and genomes of different strains or populations. Tested against simulated and genuine Drosophila melanogaster PacBio datasets, LoRTE appears to be a reliable and broadly applicable tool to study the dynamic and evolutionary impact of transposable elements using low coverage, long read sequences. CONCLUSIONS: LoRTE is an efficient and accurate tool to identify structural genomic variants caused by TE insertion or deletion. LoRTE is available for download at http://www.egce.cnrs-gif.fr/?p=6422
format Online
Article
Text
id pubmed-5385071
institution National Center for Biotechnology Information
language English
publishDate 2017
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-53850712017-04-12 LoRTE: Detecting transposon-induced genomic variants using low coverage PacBio long read sequences Disdero, Eric Filée, Jonathan Mob DNA Software BACKGROUND: Population genomic analysis of transposable elements has greatly benefited from recent advances of sequencing technologies. However, the short size of the reads and the propensity of transposable elements to nest in highly repeated regions of genomes limits the efficiency of bioinformatic tools when Illumina or 454 technologies are used. Fortunately, long read sequencing technologies generating read length that may span the entire length of full transposons are now available. However, existing TE population genomic softwares were not designed to handle long reads and the development of new dedicated tools is needed. RESULTS: LoRTE is the first tool able to use PacBio long read sequences to identify transposon deletions and insertions between a reference genome and genomes of different strains or populations. Tested against simulated and genuine Drosophila melanogaster PacBio datasets, LoRTE appears to be a reliable and broadly applicable tool to study the dynamic and evolutionary impact of transposable elements using low coverage, long read sequences. CONCLUSIONS: LoRTE is an efficient and accurate tool to identify structural genomic variants caused by TE insertion or deletion. LoRTE is available for download at http://www.egce.cnrs-gif.fr/?p=6422 BioMed Central 2017-04-08 /pmc/articles/PMC5385071/ /pubmed/28405230 http://dx.doi.org/10.1186/s13100-017-0088-x Text en © The Author(s). 2017 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Software
Disdero, Eric
Filée, Jonathan
LoRTE: Detecting transposon-induced genomic variants using low coverage PacBio long read sequences
title LoRTE: Detecting transposon-induced genomic variants using low coverage PacBio long read sequences
title_full LoRTE: Detecting transposon-induced genomic variants using low coverage PacBio long read sequences
title_fullStr LoRTE: Detecting transposon-induced genomic variants using low coverage PacBio long read sequences
title_full_unstemmed LoRTE: Detecting transposon-induced genomic variants using low coverage PacBio long read sequences
title_short LoRTE: Detecting transposon-induced genomic variants using low coverage PacBio long read sequences
title_sort lorte: detecting transposon-induced genomic variants using low coverage pacbio long read sequences
topic Software
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5385071/
https://www.ncbi.nlm.nih.gov/pubmed/28405230
http://dx.doi.org/10.1186/s13100-017-0088-x
work_keys_str_mv AT disderoeric lortedetectingtransposoninducedgenomicvariantsusinglowcoveragepacbiolongreadsequences
AT fileejonathan lortedetectingtransposoninducedgenomicvariantsusinglowcoveragepacbiolongreadsequences