Cargando…

STEAK: A specific tool for transposable elements and retrovirus detection in high-throughput sequencing data

The advancements of high-throughput genomics have unveiled much about the human genome highlighting the importance of variations between individuals and their contribution to disease. Even though numerous software have been developed to make sense of large genomics datasets, a major short falling of...

Descripción completa

Detalles Bibliográficos
Autores principales: Santander, Cindy G., Gambron, Philippe, Marchi, Emanuele, Karamitros, Timokratis, Katzourakis, Aris, Magiorkinis, Gkikas
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2017
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5597868/
https://www.ncbi.nlm.nih.gov/pubmed/28948042
http://dx.doi.org/10.1093/ve/vex023
_version_ 1783263786275176448
author Santander, Cindy G.
Gambron, Philippe
Marchi, Emanuele
Karamitros, Timokratis
Katzourakis, Aris
Magiorkinis, Gkikas
author_facet Santander, Cindy G.
Gambron, Philippe
Marchi, Emanuele
Karamitros, Timokratis
Katzourakis, Aris
Magiorkinis, Gkikas
author_sort Santander, Cindy G.
collection PubMed
description The advancements of high-throughput genomics have unveiled much about the human genome highlighting the importance of variations between individuals and their contribution to disease. Even though numerous software have been developed to make sense of large genomics datasets, a major short falling of these has been the inability to cope with repetitive regions, specifically to validate structural variants and accordingly assess their role in disease. Here we describe our program STEAK, a massively parallel software designed to detect chimeric reads in high-throughput sequencing data for a broad number of applications such as identifying presence/absence, as well as discovery of transposable elements (TEs), and retroviral integrations. We highlight the capabilities of STEAK by comparing its efficacy in locating HERV-K HML-2 in clinical whole genome projects, target enrichment sequences, and in the 1000 Genomes CEU Trio to the performance of other TE and virus detecting tools. We show that STEAK outperforms other software in terms of computational efficiency, sensitivity, and specificity. We demonstrate that STEAK is a robust tool, which allows analysts to flexibly detect and evaluate TE and retroviral integrations in a diverse range of sequencing projects for both research and clinical purposes.
format Online
Article
Text
id pubmed-5597868
institution National Center for Biotechnology Information
language English
publishDate 2017
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-55978682017-09-25 STEAK: A specific tool for transposable elements and retrovirus detection in high-throughput sequencing data Santander, Cindy G. Gambron, Philippe Marchi, Emanuele Karamitros, Timokratis Katzourakis, Aris Magiorkinis, Gkikas Virus Evol Resources The advancements of high-throughput genomics have unveiled much about the human genome highlighting the importance of variations between individuals and their contribution to disease. Even though numerous software have been developed to make sense of large genomics datasets, a major short falling of these has been the inability to cope with repetitive regions, specifically to validate structural variants and accordingly assess their role in disease. Here we describe our program STEAK, a massively parallel software designed to detect chimeric reads in high-throughput sequencing data for a broad number of applications such as identifying presence/absence, as well as discovery of transposable elements (TEs), and retroviral integrations. We highlight the capabilities of STEAK by comparing its efficacy in locating HERV-K HML-2 in clinical whole genome projects, target enrichment sequences, and in the 1000 Genomes CEU Trio to the performance of other TE and virus detecting tools. We show that STEAK outperforms other software in terms of computational efficiency, sensitivity, and specificity. We demonstrate that STEAK is a robust tool, which allows analysts to flexibly detect and evaluate TE and retroviral integrations in a diverse range of sequencing projects for both research and clinical purposes. Oxford University Press 2017-08-21 /pmc/articles/PMC5597868/ /pubmed/28948042 http://dx.doi.org/10.1093/ve/vex023 Text en © The Author 2017. Published by Oxford University Press. http://creativecommons.org/licenses/by/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Resources
Santander, Cindy G.
Gambron, Philippe
Marchi, Emanuele
Karamitros, Timokratis
Katzourakis, Aris
Magiorkinis, Gkikas
STEAK: A specific tool for transposable elements and retrovirus detection in high-throughput sequencing data
title STEAK: A specific tool for transposable elements and retrovirus detection in high-throughput sequencing data
title_full STEAK: A specific tool for transposable elements and retrovirus detection in high-throughput sequencing data
title_fullStr STEAK: A specific tool for transposable elements and retrovirus detection in high-throughput sequencing data
title_full_unstemmed STEAK: A specific tool for transposable elements and retrovirus detection in high-throughput sequencing data
title_short STEAK: A specific tool for transposable elements and retrovirus detection in high-throughput sequencing data
title_sort steak: a specific tool for transposable elements and retrovirus detection in high-throughput sequencing data
topic Resources
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5597868/
https://www.ncbi.nlm.nih.gov/pubmed/28948042
http://dx.doi.org/10.1093/ve/vex023
work_keys_str_mv AT santandercindyg steakaspecifictoolfortransposableelementsandretrovirusdetectioninhighthroughputsequencingdata
AT gambronphilippe steakaspecifictoolfortransposableelementsandretrovirusdetectioninhighthroughputsequencingdata
AT marchiemanuele steakaspecifictoolfortransposableelementsandretrovirusdetectioninhighthroughputsequencingdata
AT karamitrostimokratis steakaspecifictoolfortransposableelementsandretrovirusdetectioninhighthroughputsequencingdata
AT katzourakisaris steakaspecifictoolfortransposableelementsandretrovirusdetectioninhighthroughputsequencingdata
AT magiorkinisgkikas steakaspecifictoolfortransposableelementsandretrovirusdetectioninhighthroughputsequencingdata