Cargando…

Study of LZ-word distribution and its application for sequence comparison

Lempel–Ziv complexity has been widely used for sequence comparison and achieved promising results, but until now components' distribution in exhaustive history has not been studied. This paper investigated the whole distribution of LZ-words and presented a novel statistical method for sequence...

Descripción completa

Detalles Bibliográficos
Autores principales: Dai, Qi, Yan, Zhaofang, Shi, Zhuoxing, Liu, Xiaoqing, Yao, Yuhua, He, Pingan
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Elsevier Ltd. 2013
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7094135/
https://www.ncbi.nlm.nih.gov/pubmed/23876763
http://dx.doi.org/10.1016/j.jtbi.2013.07.008
_version_ 1783510407811432448
author Dai, Qi
Yan, Zhaofang
Shi, Zhuoxing
Liu, Xiaoqing
Yao, Yuhua
He, Pingan
author_facet Dai, Qi
Yan, Zhaofang
Shi, Zhuoxing
Liu, Xiaoqing
Yao, Yuhua
He, Pingan
author_sort Dai, Qi
collection PubMed
description Lempel–Ziv complexity has been widely used for sequence comparison and achieved promising results, but until now components' distribution in exhaustive history has not been studied. This paper investigated the whole distribution of LZ-words and presented a novel statistical method for sequence comparison. With the components' length in mind, we revised Lempel–Ziv complexity and obtained various sets of LZ-words. Instead of calculating the LZ-words' contents, we defined a series of set operations on LZ-word set to compare biological sequences. In order to assess the effectiveness of the proposed method, we performed two sets of experiments and compared it with alignment-based methods.
format Online
Article
Text
id pubmed-7094135
institution National Center for Biotechnology Information
language English
publishDate 2013
publisher Elsevier Ltd.
record_format MEDLINE/PubMed
spelling pubmed-70941352020-03-25 Study of LZ-word distribution and its application for sequence comparison Dai, Qi Yan, Zhaofang Shi, Zhuoxing Liu, Xiaoqing Yao, Yuhua He, Pingan J Theor Biol Article Lempel–Ziv complexity has been widely used for sequence comparison and achieved promising results, but until now components' distribution in exhaustive history has not been studied. This paper investigated the whole distribution of LZ-words and presented a novel statistical method for sequence comparison. With the components' length in mind, we revised Lempel–Ziv complexity and obtained various sets of LZ-words. Instead of calculating the LZ-words' contents, we defined a series of set operations on LZ-word set to compare biological sequences. In order to assess the effectiveness of the proposed method, we performed two sets of experiments and compared it with alignment-based methods. Elsevier Ltd. 2013-11-07 2013-07-19 /pmc/articles/PMC7094135/ /pubmed/23876763 http://dx.doi.org/10.1016/j.jtbi.2013.07.008 Text en Copyright © 2013 Elsevier Ltd. All rights reserved. Since January 2020 Elsevier has created a COVID-19 resource centre with free information in English and Mandarin on the novel coronavirus COVID-19. The COVID-19 resource centre is hosted on Elsevier Connect, the company's public news and information website. Elsevier hereby grants permission to make all its COVID-19-related research that is available on the COVID-19 resource centre - including this research content - immediately available in PubMed Central and other publicly funded repositories, such as the WHO COVID database with rights for unrestricted research re-use and analyses in any form or by any means with acknowledgement of the original source. These permissions are granted for free by Elsevier for as long as the COVID-19 resource centre remains active.
spellingShingle Article
Dai, Qi
Yan, Zhaofang
Shi, Zhuoxing
Liu, Xiaoqing
Yao, Yuhua
He, Pingan
Study of LZ-word distribution and its application for sequence comparison
title Study of LZ-word distribution and its application for sequence comparison
title_full Study of LZ-word distribution and its application for sequence comparison
title_fullStr Study of LZ-word distribution and its application for sequence comparison
title_full_unstemmed Study of LZ-word distribution and its application for sequence comparison
title_short Study of LZ-word distribution and its application for sequence comparison
title_sort study of lz-word distribution and its application for sequence comparison
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7094135/
https://www.ncbi.nlm.nih.gov/pubmed/23876763
http://dx.doi.org/10.1016/j.jtbi.2013.07.008
work_keys_str_mv AT daiqi studyoflzworddistributionanditsapplicationforsequencecomparison
AT yanzhaofang studyoflzworddistributionanditsapplicationforsequencecomparison
AT shizhuoxing studyoflzworddistributionanditsapplicationforsequencecomparison
AT liuxiaoqing studyoflzworddistributionanditsapplicationforsequencecomparison
AT yaoyuhua studyoflzworddistributionanditsapplicationforsequencecomparison
AT hepingan studyoflzworddistributionanditsapplicationforsequencecomparison