Cargando…

TETyper: a bioinformatic pipeline for classifying variation and genetic contexts of transposable elements from short-read whole-genome sequencing data

Much of the worldwide dissemination of antibiotic resistance has been driven by resistance gene associations with mobile genetic elements (MGEs), such as plasmids and transposons. Although increasing, our understanding of resistance spread remains relatively limited, as methods for tracking mobile r...

Descripción completa

Detalles Bibliográficos
Autores principales: Sheppard, Anna E., Stoesser, Nicole, German-Mesner, Ian, Vegesana, Kasi, Sarah Walker, A., Crook, Derrick W., Mathers, Amy J.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Microbiology Society 2018
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6412039/
https://www.ncbi.nlm.nih.gov/pubmed/30465646
http://dx.doi.org/10.1099/mgen.0.000232
_version_ 1783402512389242880
author Sheppard, Anna E.
Stoesser, Nicole
German-Mesner, Ian
Vegesana, Kasi
Sarah Walker, A.
Crook, Derrick W.
Mathers, Amy J.
author_facet Sheppard, Anna E.
Stoesser, Nicole
German-Mesner, Ian
Vegesana, Kasi
Sarah Walker, A.
Crook, Derrick W.
Mathers, Amy J.
author_sort Sheppard, Anna E.
collection PubMed
description Much of the worldwide dissemination of antibiotic resistance has been driven by resistance gene associations with mobile genetic elements (MGEs), such as plasmids and transposons. Although increasing, our understanding of resistance spread remains relatively limited, as methods for tracking mobile resistance genes through multiple species, strains and plasmids are lacking. We have developed a bioinformatic pipeline for tracking variation within, and mobility of, specific transposable elements (TEs), such as transposons carrying antibiotic-resistance genes. TETyper takes short-read whole-genome sequencing data as input and identifies single-nucleotide mutations and deletions within the TE of interest, to enable tracking of specific sequence variants, as well as the surrounding genetic context(s), to enable identification of transposition events. A major advantage of TETyper over previous methods is that it does not require a genome reference. To investigate global dissemination of Klebsiella pneumoniae carbapenemase (KPC) and its associated transposon Tn4401, we applied TETyper to a collection of over 3000 publicly available Illumina datasets containing bla(KPC). This revealed surprising diversity, with over 200 distinct flanking genetic contexts for Tn4401, indicating high levels of transposition. Integration of sample metadata revealed insights into associations between geographic locations, host species, Tn4401 sequence variants and flanking genetic contexts. To demonstrate the ability of TETyper to cope with high-copy-number TEs and to track specific short-term evolutionary changes, we also applied it to the insertion sequence IS26 within a defined K. pneumoniae outbreak. TETyper is implemented in python and is freely available at https://github.com/aesheppard/TETyper.
format Online
Article
Text
id pubmed-6412039
institution National Center for Biotechnology Information
language English
publishDate 2018
publisher Microbiology Society
record_format MEDLINE/PubMed
spelling pubmed-64120392019-03-12 TETyper: a bioinformatic pipeline for classifying variation and genetic contexts of transposable elements from short-read whole-genome sequencing data Sheppard, Anna E. Stoesser, Nicole German-Mesner, Ian Vegesana, Kasi Sarah Walker, A. Crook, Derrick W. Mathers, Amy J. Microb Genom Methods Paper Much of the worldwide dissemination of antibiotic resistance has been driven by resistance gene associations with mobile genetic elements (MGEs), such as plasmids and transposons. Although increasing, our understanding of resistance spread remains relatively limited, as methods for tracking mobile resistance genes through multiple species, strains and plasmids are lacking. We have developed a bioinformatic pipeline for tracking variation within, and mobility of, specific transposable elements (TEs), such as transposons carrying antibiotic-resistance genes. TETyper takes short-read whole-genome sequencing data as input and identifies single-nucleotide mutations and deletions within the TE of interest, to enable tracking of specific sequence variants, as well as the surrounding genetic context(s), to enable identification of transposition events. A major advantage of TETyper over previous methods is that it does not require a genome reference. To investigate global dissemination of Klebsiella pneumoniae carbapenemase (KPC) and its associated transposon Tn4401, we applied TETyper to a collection of over 3000 publicly available Illumina datasets containing bla(KPC). This revealed surprising diversity, with over 200 distinct flanking genetic contexts for Tn4401, indicating high levels of transposition. Integration of sample metadata revealed insights into associations between geographic locations, host species, Tn4401 sequence variants and flanking genetic contexts. To demonstrate the ability of TETyper to cope with high-copy-number TEs and to track specific short-term evolutionary changes, we also applied it to the insertion sequence IS26 within a defined K. pneumoniae outbreak. TETyper is implemented in python and is freely available at https://github.com/aesheppard/TETyper. Microbiology Society 2018-11-22 /pmc/articles/PMC6412039/ /pubmed/30465646 http://dx.doi.org/10.1099/mgen.0.000232 Text en © 2018 The Authors http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Methods Paper
Sheppard, Anna E.
Stoesser, Nicole
German-Mesner, Ian
Vegesana, Kasi
Sarah Walker, A.
Crook, Derrick W.
Mathers, Amy J.
TETyper: a bioinformatic pipeline for classifying variation and genetic contexts of transposable elements from short-read whole-genome sequencing data
title TETyper: a bioinformatic pipeline for classifying variation and genetic contexts of transposable elements from short-read whole-genome sequencing data
title_full TETyper: a bioinformatic pipeline for classifying variation and genetic contexts of transposable elements from short-read whole-genome sequencing data
title_fullStr TETyper: a bioinformatic pipeline for classifying variation and genetic contexts of transposable elements from short-read whole-genome sequencing data
title_full_unstemmed TETyper: a bioinformatic pipeline for classifying variation and genetic contexts of transposable elements from short-read whole-genome sequencing data
title_short TETyper: a bioinformatic pipeline for classifying variation and genetic contexts of transposable elements from short-read whole-genome sequencing data
title_sort tetyper: a bioinformatic pipeline for classifying variation and genetic contexts of transposable elements from short-read whole-genome sequencing data
topic Methods Paper
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6412039/
https://www.ncbi.nlm.nih.gov/pubmed/30465646
http://dx.doi.org/10.1099/mgen.0.000232
work_keys_str_mv AT sheppardannae tetyperabioinformaticpipelineforclassifyingvariationandgeneticcontextsoftransposableelementsfromshortreadwholegenomesequencingdata
AT stoessernicole tetyperabioinformaticpipelineforclassifyingvariationandgeneticcontextsoftransposableelementsfromshortreadwholegenomesequencingdata
AT germanmesnerian tetyperabioinformaticpipelineforclassifyingvariationandgeneticcontextsoftransposableelementsfromshortreadwholegenomesequencingdata
AT vegesanakasi tetyperabioinformaticpipelineforclassifyingvariationandgeneticcontextsoftransposableelementsfromshortreadwholegenomesequencingdata
AT sarahwalkera tetyperabioinformaticpipelineforclassifyingvariationandgeneticcontextsoftransposableelementsfromshortreadwholegenomesequencingdata
AT crookderrickw tetyperabioinformaticpipelineforclassifyingvariationandgeneticcontextsoftransposableelementsfromshortreadwholegenomesequencingdata
AT mathersamyj tetyperabioinformaticpipelineforclassifyingvariationandgeneticcontextsoftransposableelementsfromshortreadwholegenomesequencingdata