Cargando…

Identification of the conjugative and mobilizable plasmid fragments in the plasmidome using sequence signatures

Plasmids are the key element in horizontal gene transfer in the microbial community. Recently, a large number of experimental and computational methods have been developed to obtain the plasmidomes of microbial communities. Distinguishing transmissible plasmid sequences, which are derived from conju...

Descripción completa

Detalles Bibliográficos
Autores principales: Fang, Zhencheng, Zhou, Hongwei
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Microbiology Society 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7725325/
https://www.ncbi.nlm.nih.gov/pubmed/33074084
http://dx.doi.org/10.1099/mgen.0.000459
_version_ 1783620682542743552
author Fang, Zhencheng
Zhou, Hongwei
author_facet Fang, Zhencheng
Zhou, Hongwei
author_sort Fang, Zhencheng
collection PubMed
description Plasmids are the key element in horizontal gene transfer in the microbial community. Recently, a large number of experimental and computational methods have been developed to obtain the plasmidomes of microbial communities. Distinguishing transmissible plasmid sequences, which are derived from conjugative or at least mobilizable plasmids, from non-transmissible plasmid sequences in the plasmidome is essential for understanding the diversity of plasmids and how they regulate the microbial community. Unfortunately, due to the highly fragmented characteristics of DNA sequences in the plasmidome, effective identification methods are lacking. In this work, we used information entropy from information theory to assess the randomness of synonymous codon usage over 4424 plasmid genomes. The results showed that for all amino acids, the choice of a synonymous codon in conjugative and mobilizable plasmids is more random than that in non-transmissible plasmids, indicating that transmissible plasmids have different sequence signatures from non-transmissible plasmids. Inspired by this phenomenon, we further developed a novel algorithm named PlasTrans. PlasTrans takes the triplet code sequences and base sequences of plasmid DNA fragments as input and uses the convolutional neural network of the deep learning technique to further extract the more complex signatures of the plasmid sequences and identify the conjugative and mobilizable DNA fragments. Tests showed that PlasTrans could achieve an AUC of as high as 84–91%, even though the fragments only contained hundreds of base pairs. To the best of our knowledge, this is the first quantitative analysis of the difference in sequence signatures between transmissible and non-transmissible plasmids, and we developed the first tool to perform transferability annotation for DNA fragments in the plasmidome. We expect that PlasTrans will be a useful tool for researchers who analyse the properties of novel plasmids in the microbial community and horizontal gene transfer, especially the spread of resistance genes and virulence factors associated with plasmids. PlasTrans is freely available via https://github.com/zhenchengfang/PlasTrans
format Online
Article
Text
id pubmed-7725325
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher Microbiology Society
record_format MEDLINE/PubMed
spelling pubmed-77253252020-12-14 Identification of the conjugative and mobilizable plasmid fragments in the plasmidome using sequence signatures Fang, Zhencheng Zhou, Hongwei Microb Genom Research Article Plasmids are the key element in horizontal gene transfer in the microbial community. Recently, a large number of experimental and computational methods have been developed to obtain the plasmidomes of microbial communities. Distinguishing transmissible plasmid sequences, which are derived from conjugative or at least mobilizable plasmids, from non-transmissible plasmid sequences in the plasmidome is essential for understanding the diversity of plasmids and how they regulate the microbial community. Unfortunately, due to the highly fragmented characteristics of DNA sequences in the plasmidome, effective identification methods are lacking. In this work, we used information entropy from information theory to assess the randomness of synonymous codon usage over 4424 plasmid genomes. The results showed that for all amino acids, the choice of a synonymous codon in conjugative and mobilizable plasmids is more random than that in non-transmissible plasmids, indicating that transmissible plasmids have different sequence signatures from non-transmissible plasmids. Inspired by this phenomenon, we further developed a novel algorithm named PlasTrans. PlasTrans takes the triplet code sequences and base sequences of plasmid DNA fragments as input and uses the convolutional neural network of the deep learning technique to further extract the more complex signatures of the plasmid sequences and identify the conjugative and mobilizable DNA fragments. Tests showed that PlasTrans could achieve an AUC of as high as 84–91%, even though the fragments only contained hundreds of base pairs. To the best of our knowledge, this is the first quantitative analysis of the difference in sequence signatures between transmissible and non-transmissible plasmids, and we developed the first tool to perform transferability annotation for DNA fragments in the plasmidome. We expect that PlasTrans will be a useful tool for researchers who analyse the properties of novel plasmids in the microbial community and horizontal gene transfer, especially the spread of resistance genes and virulence factors associated with plasmids. PlasTrans is freely available via https://github.com/zhenchengfang/PlasTrans Microbiology Society 2020-10-19 /pmc/articles/PMC7725325/ /pubmed/33074084 http://dx.doi.org/10.1099/mgen.0.000459 Text en © 2020 The Authors http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License.
spellingShingle Research Article
Fang, Zhencheng
Zhou, Hongwei
Identification of the conjugative and mobilizable plasmid fragments in the plasmidome using sequence signatures
title Identification of the conjugative and mobilizable plasmid fragments in the plasmidome using sequence signatures
title_full Identification of the conjugative and mobilizable plasmid fragments in the plasmidome using sequence signatures
title_fullStr Identification of the conjugative and mobilizable plasmid fragments in the plasmidome using sequence signatures
title_full_unstemmed Identification of the conjugative and mobilizable plasmid fragments in the plasmidome using sequence signatures
title_short Identification of the conjugative and mobilizable plasmid fragments in the plasmidome using sequence signatures
title_sort identification of the conjugative and mobilizable plasmid fragments in the plasmidome using sequence signatures
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7725325/
https://www.ncbi.nlm.nih.gov/pubmed/33074084
http://dx.doi.org/10.1099/mgen.0.000459
work_keys_str_mv AT fangzhencheng identificationoftheconjugativeandmobilizableplasmidfragmentsintheplasmidomeusingsequencesignatures
AT zhouhongwei identificationoftheconjugativeandmobilizableplasmidfragmentsintheplasmidomeusingsequencesignatures