Cargando…

Graphical and numerical representations of DNA sequences: statistical aspects of similarity

New approaches aiming at a detailed similarity/dissimilarity analysis of DNA sequences are formulated. Several corrections that enrich the information which may be derived from the alignment methods are proposed. The corrections take into account the distributions along the sequences of the aligned...

Descripción completa

Detalles Bibliográficos
Autor principal: Bielińska-Wąż, Dorota
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Springer Netherlands 2011
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7087963/
https://www.ncbi.nlm.nih.gov/pubmed/32214591
http://dx.doi.org/10.1007/s10910-011-9890-8
_version_ 1783509442165211136
author Bielińska-Wąż, Dorota
author_facet Bielińska-Wąż, Dorota
author_sort Bielińska-Wąż, Dorota
collection PubMed
description New approaches aiming at a detailed similarity/dissimilarity analysis of DNA sequences are formulated. Several corrections that enrich the information which may be derived from the alignment methods are proposed. The corrections take into account the distributions along the sequences of the aligned bases (neglected in the standard alignment methods). As a consequence, different aspects of similarity, as for example asymmetry of the gene structure, may be studied either using new similarity measures associated with four-component spectral representation of the DNA sequences or using alignment methods with corrections introduced in this paper. The corrections to the alignment methods and the statistical distribution moment-based descriptors derived from the four-component spectral representation of the DNA sequences are applied to similarity/dissimilarity studies of β-globin gene across species. The studies are supplemented by detailed similarity studies for histones H1 and H4 coding sequences. The data are described according to the latest version of the EMBL database. The work is supplemented by a concise review of the state-of-art graphical representations of DNA sequences.
format Online
Article
Text
id pubmed-7087963
institution National Center for Biotechnology Information
language English
publishDate 2011
publisher Springer Netherlands
record_format MEDLINE/PubMed
spelling pubmed-70879632020-03-23 Graphical and numerical representations of DNA sequences: statistical aspects of similarity Bielińska-Wąż, Dorota J Math Chem Original Paper New approaches aiming at a detailed similarity/dissimilarity analysis of DNA sequences are formulated. Several corrections that enrich the information which may be derived from the alignment methods are proposed. The corrections take into account the distributions along the sequences of the aligned bases (neglected in the standard alignment methods). As a consequence, different aspects of similarity, as for example asymmetry of the gene structure, may be studied either using new similarity measures associated with four-component spectral representation of the DNA sequences or using alignment methods with corrections introduced in this paper. The corrections to the alignment methods and the statistical distribution moment-based descriptors derived from the four-component spectral representation of the DNA sequences are applied to similarity/dissimilarity studies of β-globin gene across species. The studies are supplemented by detailed similarity studies for histones H1 and H4 coding sequences. The data are described according to the latest version of the EMBL database. The work is supplemented by a concise review of the state-of-art graphical representations of DNA sequences. Springer Netherlands 2011-08-28 2011 /pmc/articles/PMC7087963/ /pubmed/32214591 http://dx.doi.org/10.1007/s10910-011-9890-8 Text en © The Author(s) 2011 Open AccessThis is an open access article distributed under the terms of the Creative Commons Attribution Noncommercial License (https://creativecommons.org/licenses/by-nc/2.0), which permits any noncommercial use, distribution, and reproduction in any medium, provided the original author(s) and source are credited.
spellingShingle Original Paper
Bielińska-Wąż, Dorota
Graphical and numerical representations of DNA sequences: statistical aspects of similarity
title Graphical and numerical representations of DNA sequences: statistical aspects of similarity
title_full Graphical and numerical representations of DNA sequences: statistical aspects of similarity
title_fullStr Graphical and numerical representations of DNA sequences: statistical aspects of similarity
title_full_unstemmed Graphical and numerical representations of DNA sequences: statistical aspects of similarity
title_short Graphical and numerical representations of DNA sequences: statistical aspects of similarity
title_sort graphical and numerical representations of dna sequences: statistical aspects of similarity
topic Original Paper
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7087963/
https://www.ncbi.nlm.nih.gov/pubmed/32214591
http://dx.doi.org/10.1007/s10910-011-9890-8
work_keys_str_mv AT bielinskawazdorota graphicalandnumericalrepresentationsofdnasequencesstatisticalaspectsofsimilarity