Cargando…

Human polyomaviruses identification by logic mining techniques

BACKGROUND: Differences in genomic sequences are crucial for the classification of viruses into different species. In this work, viral DNA sequences belonging to the human polyomaviruses BKPyV, JCPyV, KIPyV, WUPyV, and MCPyV are analyzed using a logic data mining method in order to identify the nucl...

Descripción completa

Detalles Bibliográficos
Autores principales: Weitschek, Emanuel, Lo Presti, Alessandra, Drovandi, Guido, Felici, Giovanni, Ciccozzi, Massimo, Ciotti, Marco, Bertolazzi, Paola
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2012
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3307486/
https://www.ncbi.nlm.nih.gov/pubmed/22385517
http://dx.doi.org/10.1186/1743-422X-9-58
_version_ 1782227333811273728
author Weitschek, Emanuel
Lo Presti, Alessandra
Drovandi, Guido
Felici, Giovanni
Ciccozzi, Massimo
Ciotti, Marco
Bertolazzi, Paola
author_facet Weitschek, Emanuel
Lo Presti, Alessandra
Drovandi, Guido
Felici, Giovanni
Ciccozzi, Massimo
Ciotti, Marco
Bertolazzi, Paola
author_sort Weitschek, Emanuel
collection PubMed
description BACKGROUND: Differences in genomic sequences are crucial for the classification of viruses into different species. In this work, viral DNA sequences belonging to the human polyomaviruses BKPyV, JCPyV, KIPyV, WUPyV, and MCPyV are analyzed using a logic data mining method in order to identify the nucleotides which are able to distinguish the five different human polyomaviruses. RESULTS: The approach presented in this work is successful as it discovers several logic rules that effectively characterize the different five studied polyomaviruses. The individuated logic rules are able to separate precisely one viral type from the other and to assign an unknown DNA sequence to one of the five analyzed polyomaviruses. CONCLUSIONS: The data mining analysis is performed by considering the complete sequences of the viruses and the sequences of the different gene regions separately, obtaining in both cases extremely high correct recognition rates.
format Online
Article
Text
id pubmed-3307486
institution National Center for Biotechnology Information
language English
publishDate 2012
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-33074862012-03-20 Human polyomaviruses identification by logic mining techniques Weitschek, Emanuel Lo Presti, Alessandra Drovandi, Guido Felici, Giovanni Ciccozzi, Massimo Ciotti, Marco Bertolazzi, Paola Virol J Methodology BACKGROUND: Differences in genomic sequences are crucial for the classification of viruses into different species. In this work, viral DNA sequences belonging to the human polyomaviruses BKPyV, JCPyV, KIPyV, WUPyV, and MCPyV are analyzed using a logic data mining method in order to identify the nucleotides which are able to distinguish the five different human polyomaviruses. RESULTS: The approach presented in this work is successful as it discovers several logic rules that effectively characterize the different five studied polyomaviruses. The individuated logic rules are able to separate precisely one viral type from the other and to assign an unknown DNA sequence to one of the five analyzed polyomaviruses. CONCLUSIONS: The data mining analysis is performed by considering the complete sequences of the viruses and the sequences of the different gene regions separately, obtaining in both cases extremely high correct recognition rates. BioMed Central 2012-03-02 /pmc/articles/PMC3307486/ /pubmed/22385517 http://dx.doi.org/10.1186/1743-422X-9-58 Text en Copyright ©2012 Weitschek et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Methodology
Weitschek, Emanuel
Lo Presti, Alessandra
Drovandi, Guido
Felici, Giovanni
Ciccozzi, Massimo
Ciotti, Marco
Bertolazzi, Paola
Human polyomaviruses identification by logic mining techniques
title Human polyomaviruses identification by logic mining techniques
title_full Human polyomaviruses identification by logic mining techniques
title_fullStr Human polyomaviruses identification by logic mining techniques
title_full_unstemmed Human polyomaviruses identification by logic mining techniques
title_short Human polyomaviruses identification by logic mining techniques
title_sort human polyomaviruses identification by logic mining techniques
topic Methodology
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3307486/
https://www.ncbi.nlm.nih.gov/pubmed/22385517
http://dx.doi.org/10.1186/1743-422X-9-58
work_keys_str_mv AT weitschekemanuel humanpolyomavirusesidentificationbylogicminingtechniques
AT loprestialessandra humanpolyomavirusesidentificationbylogicminingtechniques
AT drovandiguido humanpolyomavirusesidentificationbylogicminingtechniques
AT felicigiovanni humanpolyomavirusesidentificationbylogicminingtechniques
AT ciccozzimassimo humanpolyomavirusesidentificationbylogicminingtechniques
AT ciottimarco humanpolyomavirusesidentificationbylogicminingtechniques
AT bertolazzipaola humanpolyomavirusesidentificationbylogicminingtechniques