Cargando…

A Novel Method for Alignment-free DNA Sequence Similarity Analysis Based on the Characterization of Complex Networks

Determination of sequence similarity is one of the major steps in computational phylogenetic studies. One of the major tasks of computational biologists is to develop novel mathematical descriptors for similarity analysis. DNA clustering is an important technology that automatically identifies inher...

Descripción completa

Detalles Bibliográficos
Autores principales: Zhou, Jie, Zhong, Pianyu, Zhang, Tinghui
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Libertas Academica 2016
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5054945/
https://www.ncbi.nlm.nih.gov/pubmed/27746676
http://dx.doi.org/10.4137/EBO.S40474
_version_ 1782458692970479616
author Zhou, Jie
Zhong, Pianyu
Zhang, Tinghui
author_facet Zhou, Jie
Zhong, Pianyu
Zhang, Tinghui
author_sort Zhou, Jie
collection PubMed
description Determination of sequence similarity is one of the major steps in computational phylogenetic studies. One of the major tasks of computational biologists is to develop novel mathematical descriptors for similarity analysis. DNA clustering is an important technology that automatically identifies inherent relationships among large-scale DNA sequences. The comparison between the DNA sequences of different species helps determine phylogenetic relationships among species. Alignment-free approaches have continuously gained interest in various sequence analysis applications such as phylogenetic inference and metagenomic classification/clustering, particularly for large-scale sequence datasets. Here, we construct a novel and simple mathematical descriptor based on the characterization of cis sequence complex DNA networks. This new approach is based on a code of three cis nucleotides in a gene that could code for an amino acid. In particular, for each DNA sequence, we will set up a cis sequence complex network that will be used to develop a characterization vector for the analysis of mitochondrial DNA sequence phylogenetic relationships among nine species. The resulting phylogenetic relationships among the nine species were determined to be in agreement with the actual situation.
format Online
Article
Text
id pubmed-5054945
institution National Center for Biotechnology Information
language English
publishDate 2016
publisher Libertas Academica
record_format MEDLINE/PubMed
spelling pubmed-50549452016-10-14 A Novel Method for Alignment-free DNA Sequence Similarity Analysis Based on the Characterization of Complex Networks Zhou, Jie Zhong, Pianyu Zhang, Tinghui Evol Bioinform Online Original Research Determination of sequence similarity is one of the major steps in computational phylogenetic studies. One of the major tasks of computational biologists is to develop novel mathematical descriptors for similarity analysis. DNA clustering is an important technology that automatically identifies inherent relationships among large-scale DNA sequences. The comparison between the DNA sequences of different species helps determine phylogenetic relationships among species. Alignment-free approaches have continuously gained interest in various sequence analysis applications such as phylogenetic inference and metagenomic classification/clustering, particularly for large-scale sequence datasets. Here, we construct a novel and simple mathematical descriptor based on the characterization of cis sequence complex DNA networks. This new approach is based on a code of three cis nucleotides in a gene that could code for an amino acid. In particular, for each DNA sequence, we will set up a cis sequence complex network that will be used to develop a characterization vector for the analysis of mitochondrial DNA sequence phylogenetic relationships among nine species. The resulting phylogenetic relationships among the nine species were determined to be in agreement with the actual situation. Libertas Academica 2016-10-06 /pmc/articles/PMC5054945/ /pubmed/27746676 http://dx.doi.org/10.4137/EBO.S40474 Text en © 2016 the authors, publisher and licensee Libertas Academica Limited This is an open-access article distributed under the terms of the Creative Commons CC-BY-NC 3.0 License.
spellingShingle Original Research
Zhou, Jie
Zhong, Pianyu
Zhang, Tinghui
A Novel Method for Alignment-free DNA Sequence Similarity Analysis Based on the Characterization of Complex Networks
title A Novel Method for Alignment-free DNA Sequence Similarity Analysis Based on the Characterization of Complex Networks
title_full A Novel Method for Alignment-free DNA Sequence Similarity Analysis Based on the Characterization of Complex Networks
title_fullStr A Novel Method for Alignment-free DNA Sequence Similarity Analysis Based on the Characterization of Complex Networks
title_full_unstemmed A Novel Method for Alignment-free DNA Sequence Similarity Analysis Based on the Characterization of Complex Networks
title_short A Novel Method for Alignment-free DNA Sequence Similarity Analysis Based on the Characterization of Complex Networks
title_sort novel method for alignment-free dna sequence similarity analysis based on the characterization of complex networks
topic Original Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5054945/
https://www.ncbi.nlm.nih.gov/pubmed/27746676
http://dx.doi.org/10.4137/EBO.S40474
work_keys_str_mv AT zhoujie anovelmethodforalignmentfreednasequencesimilarityanalysisbasedonthecharacterizationofcomplexnetworks
AT zhongpianyu anovelmethodforalignmentfreednasequencesimilarityanalysisbasedonthecharacterizationofcomplexnetworks
AT zhangtinghui anovelmethodforalignmentfreednasequencesimilarityanalysisbasedonthecharacterizationofcomplexnetworks
AT zhoujie novelmethodforalignmentfreednasequencesimilarityanalysisbasedonthecharacterizationofcomplexnetworks
AT zhongpianyu novelmethodforalignmentfreednasequencesimilarityanalysisbasedonthecharacterizationofcomplexnetworks
AT zhangtinghui novelmethodforalignmentfreednasequencesimilarityanalysisbasedonthecharacterizationofcomplexnetworks