Cargando…
LoRTE: Detecting transposon-induced genomic variants using low coverage PacBio long read sequences
BACKGROUND: Population genomic analysis of transposable elements has greatly benefited from recent advances of sequencing technologies. However, the short size of the reads and the propensity of transposable elements to nest in highly repeated regions of genomes limits the efficiency of bioinformati...
Autores principales: | , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2017
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5385071/ https://www.ncbi.nlm.nih.gov/pubmed/28405230 http://dx.doi.org/10.1186/s13100-017-0088-x |
_version_ | 1782520538778828800 |
---|---|
author | Disdero, Eric Filée, Jonathan |
author_facet | Disdero, Eric Filée, Jonathan |
author_sort | Disdero, Eric |
collection | PubMed |
description | BACKGROUND: Population genomic analysis of transposable elements has greatly benefited from recent advances of sequencing technologies. However, the short size of the reads and the propensity of transposable elements to nest in highly repeated regions of genomes limits the efficiency of bioinformatic tools when Illumina or 454 technologies are used. Fortunately, long read sequencing technologies generating read length that may span the entire length of full transposons are now available. However, existing TE population genomic softwares were not designed to handle long reads and the development of new dedicated tools is needed. RESULTS: LoRTE is the first tool able to use PacBio long read sequences to identify transposon deletions and insertions between a reference genome and genomes of different strains or populations. Tested against simulated and genuine Drosophila melanogaster PacBio datasets, LoRTE appears to be a reliable and broadly applicable tool to study the dynamic and evolutionary impact of transposable elements using low coverage, long read sequences. CONCLUSIONS: LoRTE is an efficient and accurate tool to identify structural genomic variants caused by TE insertion or deletion. LoRTE is available for download at http://www.egce.cnrs-gif.fr/?p=6422 |
format | Online Article Text |
id | pubmed-5385071 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2017 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-53850712017-04-12 LoRTE: Detecting transposon-induced genomic variants using low coverage PacBio long read sequences Disdero, Eric Filée, Jonathan Mob DNA Software BACKGROUND: Population genomic analysis of transposable elements has greatly benefited from recent advances of sequencing technologies. However, the short size of the reads and the propensity of transposable elements to nest in highly repeated regions of genomes limits the efficiency of bioinformatic tools when Illumina or 454 technologies are used. Fortunately, long read sequencing technologies generating read length that may span the entire length of full transposons are now available. However, existing TE population genomic softwares were not designed to handle long reads and the development of new dedicated tools is needed. RESULTS: LoRTE is the first tool able to use PacBio long read sequences to identify transposon deletions and insertions between a reference genome and genomes of different strains or populations. Tested against simulated and genuine Drosophila melanogaster PacBio datasets, LoRTE appears to be a reliable and broadly applicable tool to study the dynamic and evolutionary impact of transposable elements using low coverage, long read sequences. CONCLUSIONS: LoRTE is an efficient and accurate tool to identify structural genomic variants caused by TE insertion or deletion. LoRTE is available for download at http://www.egce.cnrs-gif.fr/?p=6422 BioMed Central 2017-04-08 /pmc/articles/PMC5385071/ /pubmed/28405230 http://dx.doi.org/10.1186/s13100-017-0088-x Text en © The Author(s). 2017 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated. |
spellingShingle | Software Disdero, Eric Filée, Jonathan LoRTE: Detecting transposon-induced genomic variants using low coverage PacBio long read sequences |
title | LoRTE: Detecting transposon-induced genomic variants using low coverage PacBio long read sequences |
title_full | LoRTE: Detecting transposon-induced genomic variants using low coverage PacBio long read sequences |
title_fullStr | LoRTE: Detecting transposon-induced genomic variants using low coverage PacBio long read sequences |
title_full_unstemmed | LoRTE: Detecting transposon-induced genomic variants using low coverage PacBio long read sequences |
title_short | LoRTE: Detecting transposon-induced genomic variants using low coverage PacBio long read sequences |
title_sort | lorte: detecting transposon-induced genomic variants using low coverage pacbio long read sequences |
topic | Software |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5385071/ https://www.ncbi.nlm.nih.gov/pubmed/28405230 http://dx.doi.org/10.1186/s13100-017-0088-x |
work_keys_str_mv | AT disderoeric lortedetectingtransposoninducedgenomicvariantsusinglowcoveragepacbiolongreadsequences AT fileejonathan lortedetectingtransposoninducedgenomicvariantsusinglowcoveragepacbiolongreadsequences |