Cargando…

Wavelet-domain elastic net for clustering on genomes strains

We propose to evaluate genome similarity by combining discrete non-decimated wavelet transform (NDWT) and elastic net. The wavelets represent a signal with levels of detail, that is, hidden components are detected by means of the decomposition of this signal, where each level provides a different ch...

Descripción completa

Detalles Bibliográficos
Autores principales: Ferreira, Leila Maria, Sáfadi, Thelma, Ferreira, Juliano Lino
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Sociedade Brasileira de Genética 2018
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6415607/
https://www.ncbi.nlm.nih.gov/pubmed/30508009
http://dx.doi.org/10.1590/1678-4685-GMB-2018-0035
_version_ 1783403219827818496
author Ferreira, Leila Maria
Sáfadi, Thelma
Ferreira, Juliano Lino
author_facet Ferreira, Leila Maria
Sáfadi, Thelma
Ferreira, Juliano Lino
author_sort Ferreira, Leila Maria
collection PubMed
description We propose to evaluate genome similarity by combining discrete non-decimated wavelet transform (NDWT) and elastic net. The wavelets represent a signal with levels of detail, that is, hidden components are detected by means of the decomposition of this signal, where each level provides a different characteristic. The main feature of the elastic net is the grouping of correlated variables where the number of predictors is greater than the number of observations. The combination of these two methodologies applied in the clustering analysis of the Mycobacterium tuberculosis genome strains proved very effective, being able to identify clusters at each level of decomposition.
format Online
Article
Text
id pubmed-6415607
institution National Center for Biotechnology Information
language English
publishDate 2018
publisher Sociedade Brasileira de Genética
record_format MEDLINE/PubMed
spelling pubmed-64156072019-03-21 Wavelet-domain elastic net for clustering on genomes strains Ferreira, Leila Maria Sáfadi, Thelma Ferreira, Juliano Lino Genet Mol Biol Genomics and Bioinformatics We propose to evaluate genome similarity by combining discrete non-decimated wavelet transform (NDWT) and elastic net. The wavelets represent a signal with levels of detail, that is, hidden components are detected by means of the decomposition of this signal, where each level provides a different characteristic. The main feature of the elastic net is the grouping of correlated variables where the number of predictors is greater than the number of observations. The combination of these two methodologies applied in the clustering analysis of the Mycobacterium tuberculosis genome strains proved very effective, being able to identify clusters at each level of decomposition. Sociedade Brasileira de Genética 2018-11-29 2018 /pmc/articles/PMC6415607/ /pubmed/30508009 http://dx.doi.org/10.1590/1678-4685-GMB-2018-0035 Text en Copyright © 2018, Sociedade Brasileira de Genética. https://creativecommons.org/licenses/by/4.0/ License information: This is an open-access article distributed under the terms of the Creative Commons Attribution License (type CC-BY), which permits unrestricted use, distribution and reproduction in any medium, provided the original article is properly cited.
spellingShingle Genomics and Bioinformatics
Ferreira, Leila Maria
Sáfadi, Thelma
Ferreira, Juliano Lino
Wavelet-domain elastic net for clustering on genomes strains
title Wavelet-domain elastic net for clustering on genomes strains
title_full Wavelet-domain elastic net for clustering on genomes strains
title_fullStr Wavelet-domain elastic net for clustering on genomes strains
title_full_unstemmed Wavelet-domain elastic net for clustering on genomes strains
title_short Wavelet-domain elastic net for clustering on genomes strains
title_sort wavelet-domain elastic net for clustering on genomes strains
topic Genomics and Bioinformatics
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6415607/
https://www.ncbi.nlm.nih.gov/pubmed/30508009
http://dx.doi.org/10.1590/1678-4685-GMB-2018-0035
work_keys_str_mv AT ferreiraleilamaria waveletdomainelasticnetforclusteringongenomesstrains
AT safadithelma waveletdomainelasticnetforclusteringongenomesstrains
AT ferreirajulianolino waveletdomainelasticnetforclusteringongenomesstrains