Cargando…
Identification of the conjugative and mobilizable plasmid fragments in the plasmidome using sequence signatures
Plasmids are the key element in horizontal gene transfer in the microbial community. Recently, a large number of experimental and computational methods have been developed to obtain the plasmidomes of microbial communities. Distinguishing transmissible plasmid sequences, which are derived from conju...
Autores principales: | , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Microbiology Society
2020
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7725325/ https://www.ncbi.nlm.nih.gov/pubmed/33074084 http://dx.doi.org/10.1099/mgen.0.000459 |
_version_ | 1783620682542743552 |
---|---|
author | Fang, Zhencheng Zhou, Hongwei |
author_facet | Fang, Zhencheng Zhou, Hongwei |
author_sort | Fang, Zhencheng |
collection | PubMed |
description | Plasmids are the key element in horizontal gene transfer in the microbial community. Recently, a large number of experimental and computational methods have been developed to obtain the plasmidomes of microbial communities. Distinguishing transmissible plasmid sequences, which are derived from conjugative or at least mobilizable plasmids, from non-transmissible plasmid sequences in the plasmidome is essential for understanding the diversity of plasmids and how they regulate the microbial community. Unfortunately, due to the highly fragmented characteristics of DNA sequences in the plasmidome, effective identification methods are lacking. In this work, we used information entropy from information theory to assess the randomness of synonymous codon usage over 4424 plasmid genomes. The results showed that for all amino acids, the choice of a synonymous codon in conjugative and mobilizable plasmids is more random than that in non-transmissible plasmids, indicating that transmissible plasmids have different sequence signatures from non-transmissible plasmids. Inspired by this phenomenon, we further developed a novel algorithm named PlasTrans. PlasTrans takes the triplet code sequences and base sequences of plasmid DNA fragments as input and uses the convolutional neural network of the deep learning technique to further extract the more complex signatures of the plasmid sequences and identify the conjugative and mobilizable DNA fragments. Tests showed that PlasTrans could achieve an AUC of as high as 84–91%, even though the fragments only contained hundreds of base pairs. To the best of our knowledge, this is the first quantitative analysis of the difference in sequence signatures between transmissible and non-transmissible plasmids, and we developed the first tool to perform transferability annotation for DNA fragments in the plasmidome. We expect that PlasTrans will be a useful tool for researchers who analyse the properties of novel plasmids in the microbial community and horizontal gene transfer, especially the spread of resistance genes and virulence factors associated with plasmids. PlasTrans is freely available via https://github.com/zhenchengfang/PlasTrans |
format | Online Article Text |
id | pubmed-7725325 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2020 |
publisher | Microbiology Society |
record_format | MEDLINE/PubMed |
spelling | pubmed-77253252020-12-14 Identification of the conjugative and mobilizable plasmid fragments in the plasmidome using sequence signatures Fang, Zhencheng Zhou, Hongwei Microb Genom Research Article Plasmids are the key element in horizontal gene transfer in the microbial community. Recently, a large number of experimental and computational methods have been developed to obtain the plasmidomes of microbial communities. Distinguishing transmissible plasmid sequences, which are derived from conjugative or at least mobilizable plasmids, from non-transmissible plasmid sequences in the plasmidome is essential for understanding the diversity of plasmids and how they regulate the microbial community. Unfortunately, due to the highly fragmented characteristics of DNA sequences in the plasmidome, effective identification methods are lacking. In this work, we used information entropy from information theory to assess the randomness of synonymous codon usage over 4424 plasmid genomes. The results showed that for all amino acids, the choice of a synonymous codon in conjugative and mobilizable plasmids is more random than that in non-transmissible plasmids, indicating that transmissible plasmids have different sequence signatures from non-transmissible plasmids. Inspired by this phenomenon, we further developed a novel algorithm named PlasTrans. PlasTrans takes the triplet code sequences and base sequences of plasmid DNA fragments as input and uses the convolutional neural network of the deep learning technique to further extract the more complex signatures of the plasmid sequences and identify the conjugative and mobilizable DNA fragments. Tests showed that PlasTrans could achieve an AUC of as high as 84–91%, even though the fragments only contained hundreds of base pairs. To the best of our knowledge, this is the first quantitative analysis of the difference in sequence signatures between transmissible and non-transmissible plasmids, and we developed the first tool to perform transferability annotation for DNA fragments in the plasmidome. We expect that PlasTrans will be a useful tool for researchers who analyse the properties of novel plasmids in the microbial community and horizontal gene transfer, especially the spread of resistance genes and virulence factors associated with plasmids. PlasTrans is freely available via https://github.com/zhenchengfang/PlasTrans Microbiology Society 2020-10-19 /pmc/articles/PMC7725325/ /pubmed/33074084 http://dx.doi.org/10.1099/mgen.0.000459 Text en © 2020 The Authors http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License. |
spellingShingle | Research Article Fang, Zhencheng Zhou, Hongwei Identification of the conjugative and mobilizable plasmid fragments in the plasmidome using sequence signatures |
title | Identification of the conjugative and mobilizable plasmid fragments in the plasmidome using sequence signatures |
title_full | Identification of the conjugative and mobilizable plasmid fragments in the plasmidome using sequence signatures |
title_fullStr | Identification of the conjugative and mobilizable plasmid fragments in the plasmidome using sequence signatures |
title_full_unstemmed | Identification of the conjugative and mobilizable plasmid fragments in the plasmidome using sequence signatures |
title_short | Identification of the conjugative and mobilizable plasmid fragments in the plasmidome using sequence signatures |
title_sort | identification of the conjugative and mobilizable plasmid fragments in the plasmidome using sequence signatures |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7725325/ https://www.ncbi.nlm.nih.gov/pubmed/33074084 http://dx.doi.org/10.1099/mgen.0.000459 |
work_keys_str_mv | AT fangzhencheng identificationoftheconjugativeandmobilizableplasmidfragmentsintheplasmidomeusingsequencesignatures AT zhouhongwei identificationoftheconjugativeandmobilizableplasmidfragmentsintheplasmidomeusingsequencesignatures |