Cargando…
STEAK: A specific tool for transposable elements and retrovirus detection in high-throughput sequencing data
The advancements of high-throughput genomics have unveiled much about the human genome highlighting the importance of variations between individuals and their contribution to disease. Even though numerous software have been developed to make sense of large genomics datasets, a major short falling of...
Autores principales: | , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Oxford University Press
2017
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5597868/ https://www.ncbi.nlm.nih.gov/pubmed/28948042 http://dx.doi.org/10.1093/ve/vex023 |
_version_ | 1783263786275176448 |
---|---|
author | Santander, Cindy G. Gambron, Philippe Marchi, Emanuele Karamitros, Timokratis Katzourakis, Aris Magiorkinis, Gkikas |
author_facet | Santander, Cindy G. Gambron, Philippe Marchi, Emanuele Karamitros, Timokratis Katzourakis, Aris Magiorkinis, Gkikas |
author_sort | Santander, Cindy G. |
collection | PubMed |
description | The advancements of high-throughput genomics have unveiled much about the human genome highlighting the importance of variations between individuals and their contribution to disease. Even though numerous software have been developed to make sense of large genomics datasets, a major short falling of these has been the inability to cope with repetitive regions, specifically to validate structural variants and accordingly assess their role in disease. Here we describe our program STEAK, a massively parallel software designed to detect chimeric reads in high-throughput sequencing data for a broad number of applications such as identifying presence/absence, as well as discovery of transposable elements (TEs), and retroviral integrations. We highlight the capabilities of STEAK by comparing its efficacy in locating HERV-K HML-2 in clinical whole genome projects, target enrichment sequences, and in the 1000 Genomes CEU Trio to the performance of other TE and virus detecting tools. We show that STEAK outperforms other software in terms of computational efficiency, sensitivity, and specificity. We demonstrate that STEAK is a robust tool, which allows analysts to flexibly detect and evaluate TE and retroviral integrations in a diverse range of sequencing projects for both research and clinical purposes. |
format | Online Article Text |
id | pubmed-5597868 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2017 |
publisher | Oxford University Press |
record_format | MEDLINE/PubMed |
spelling | pubmed-55978682017-09-25 STEAK: A specific tool for transposable elements and retrovirus detection in high-throughput sequencing data Santander, Cindy G. Gambron, Philippe Marchi, Emanuele Karamitros, Timokratis Katzourakis, Aris Magiorkinis, Gkikas Virus Evol Resources The advancements of high-throughput genomics have unveiled much about the human genome highlighting the importance of variations between individuals and their contribution to disease. Even though numerous software have been developed to make sense of large genomics datasets, a major short falling of these has been the inability to cope with repetitive regions, specifically to validate structural variants and accordingly assess their role in disease. Here we describe our program STEAK, a massively parallel software designed to detect chimeric reads in high-throughput sequencing data for a broad number of applications such as identifying presence/absence, as well as discovery of transposable elements (TEs), and retroviral integrations. We highlight the capabilities of STEAK by comparing its efficacy in locating HERV-K HML-2 in clinical whole genome projects, target enrichment sequences, and in the 1000 Genomes CEU Trio to the performance of other TE and virus detecting tools. We show that STEAK outperforms other software in terms of computational efficiency, sensitivity, and specificity. We demonstrate that STEAK is a robust tool, which allows analysts to flexibly detect and evaluate TE and retroviral integrations in a diverse range of sequencing projects for both research and clinical purposes. Oxford University Press 2017-08-21 /pmc/articles/PMC5597868/ /pubmed/28948042 http://dx.doi.org/10.1093/ve/vex023 Text en © The Author 2017. Published by Oxford University Press. http://creativecommons.org/licenses/by/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Resources Santander, Cindy G. Gambron, Philippe Marchi, Emanuele Karamitros, Timokratis Katzourakis, Aris Magiorkinis, Gkikas STEAK: A specific tool for transposable elements and retrovirus detection in high-throughput sequencing data |
title | STEAK: A specific tool for transposable elements and retrovirus detection in high-throughput sequencing data |
title_full | STEAK: A specific tool for transposable elements and retrovirus detection in high-throughput sequencing data |
title_fullStr | STEAK: A specific tool for transposable elements and retrovirus detection in high-throughput sequencing data |
title_full_unstemmed | STEAK: A specific tool for transposable elements and retrovirus detection in high-throughput sequencing data |
title_short | STEAK: A specific tool for transposable elements and retrovirus detection in high-throughput sequencing data |
title_sort | steak: a specific tool for transposable elements and retrovirus detection in high-throughput sequencing data |
topic | Resources |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5597868/ https://www.ncbi.nlm.nih.gov/pubmed/28948042 http://dx.doi.org/10.1093/ve/vex023 |
work_keys_str_mv | AT santandercindyg steakaspecifictoolfortransposableelementsandretrovirusdetectioninhighthroughputsequencingdata AT gambronphilippe steakaspecifictoolfortransposableelementsandretrovirusdetectioninhighthroughputsequencingdata AT marchiemanuele steakaspecifictoolfortransposableelementsandretrovirusdetectioninhighthroughputsequencingdata AT karamitrostimokratis steakaspecifictoolfortransposableelementsandretrovirusdetectioninhighthroughputsequencingdata AT katzourakisaris steakaspecifictoolfortransposableelementsandretrovirusdetectioninhighthroughputsequencingdata AT magiorkinisgkikas steakaspecifictoolfortransposableelementsandretrovirusdetectioninhighthroughputsequencingdata |