Cargando…

Background Adjusted Alignment-Free Dissimilarity Measures Improve the Detection of Horizontal Gene Transfer

Horizontal gene transfer (HGT) plays an important role in the evolution of microbial organisms including bacteria. Alignment-free methods based on single genome compositional information have been used to detect HGT. Currently, Manhattan and Euclidean distances based on tetranucleotide frequencies a...

Descripción completa

Detalles Bibliográficos
Autores principales: Tang, Kujin, Lu, Yang Young, Sun, Fengzhu
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Frontiers Media S.A. 2018
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5911508/
https://www.ncbi.nlm.nih.gov/pubmed/29713314
http://dx.doi.org/10.3389/fmicb.2018.00711
_version_ 1783316223092588544
author Tang, Kujin
Lu, Yang Young
Sun, Fengzhu
author_facet Tang, Kujin
Lu, Yang Young
Sun, Fengzhu
author_sort Tang, Kujin
collection PubMed
description Horizontal gene transfer (HGT) plays an important role in the evolution of microbial organisms including bacteria. Alignment-free methods based on single genome compositional information have been used to detect HGT. Currently, Manhattan and Euclidean distances based on tetranucleotide frequencies are the most commonly used alignment-free dissimilarity measures to detect HGT. By testing on simulated bacterial sequences and real data sets with known horizontal transferred genomic regions, we found that more advanced alignment-free dissimilarity measures such as CVTree and [Formula: see text] that take into account the background Markov sequences can solve HGT detection problems with significantly improved performance. We also studied the influence of different factors such as evolutionary distance between host and donor sequences, size of sliding window, and host genome composition on the performances of alignment-free methods to detect HGT. Our study showed that alignment-free methods can predict HGT accurately when host and donor genomes are in different order levels. Among all methods, CVTree with word length of 3, [Formula: see text] with word length 3, Markov order 1 and [Formula: see text] with word length 4, Markov order 1 outperform others in terms of their highest F(1)-score and their robustness under the influence of different factors.
format Online
Article
Text
id pubmed-5911508
institution National Center for Biotechnology Information
language English
publishDate 2018
publisher Frontiers Media S.A.
record_format MEDLINE/PubMed
spelling pubmed-59115082018-04-30 Background Adjusted Alignment-Free Dissimilarity Measures Improve the Detection of Horizontal Gene Transfer Tang, Kujin Lu, Yang Young Sun, Fengzhu Front Microbiol Microbiology Horizontal gene transfer (HGT) plays an important role in the evolution of microbial organisms including bacteria. Alignment-free methods based on single genome compositional information have been used to detect HGT. Currently, Manhattan and Euclidean distances based on tetranucleotide frequencies are the most commonly used alignment-free dissimilarity measures to detect HGT. By testing on simulated bacterial sequences and real data sets with known horizontal transferred genomic regions, we found that more advanced alignment-free dissimilarity measures such as CVTree and [Formula: see text] that take into account the background Markov sequences can solve HGT detection problems with significantly improved performance. We also studied the influence of different factors such as evolutionary distance between host and donor sequences, size of sliding window, and host genome composition on the performances of alignment-free methods to detect HGT. Our study showed that alignment-free methods can predict HGT accurately when host and donor genomes are in different order levels. Among all methods, CVTree with word length of 3, [Formula: see text] with word length 3, Markov order 1 and [Formula: see text] with word length 4, Markov order 1 outperform others in terms of their highest F(1)-score and their robustness under the influence of different factors. Frontiers Media S.A. 2018-04-16 /pmc/articles/PMC5911508/ /pubmed/29713314 http://dx.doi.org/10.3389/fmicb.2018.00711 Text en Copyright © 2018 Tang, Lu and Sun. http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle Microbiology
Tang, Kujin
Lu, Yang Young
Sun, Fengzhu
Background Adjusted Alignment-Free Dissimilarity Measures Improve the Detection of Horizontal Gene Transfer
title Background Adjusted Alignment-Free Dissimilarity Measures Improve the Detection of Horizontal Gene Transfer
title_full Background Adjusted Alignment-Free Dissimilarity Measures Improve the Detection of Horizontal Gene Transfer
title_fullStr Background Adjusted Alignment-Free Dissimilarity Measures Improve the Detection of Horizontal Gene Transfer
title_full_unstemmed Background Adjusted Alignment-Free Dissimilarity Measures Improve the Detection of Horizontal Gene Transfer
title_short Background Adjusted Alignment-Free Dissimilarity Measures Improve the Detection of Horizontal Gene Transfer
title_sort background adjusted alignment-free dissimilarity measures improve the detection of horizontal gene transfer
topic Microbiology
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5911508/
https://www.ncbi.nlm.nih.gov/pubmed/29713314
http://dx.doi.org/10.3389/fmicb.2018.00711
work_keys_str_mv AT tangkujin backgroundadjustedalignmentfreedissimilaritymeasuresimprovethedetectionofhorizontalgenetransfer
AT luyangyoung backgroundadjustedalignmentfreedissimilaritymeasuresimprovethedetectionofhorizontalgenetransfer
AT sunfengzhu backgroundadjustedalignmentfreedissimilaritymeasuresimprovethedetectionofhorizontalgenetransfer