Cargando…

Utilizing maximal frequent itemsets and social network analysis for HIV data analysis

Acquired immune deficiency syndrome is a deadly disease which is caused by human immunodeficiency virus (HIV). This virus attacks patients immune system and effects its ability to fight against diseases. Developing effective medicine requires understanding the life cycle and replication ability of t...

Descripción completa

Detalles Bibliográficos
Autores principales: Koçak, Yunuscan, Özyer, Tansel, Alhajj, Reda
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Springer International Publishing 2016
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5395515/
http://dx.doi.org/10.1186/s13321-016-0184-9
_version_ 1783229874120425472
author Koçak, Yunuscan
Özyer, Tansel
Alhajj, Reda
author_facet Koçak, Yunuscan
Özyer, Tansel
Alhajj, Reda
author_sort Koçak, Yunuscan
collection PubMed
description Acquired immune deficiency syndrome is a deadly disease which is caused by human immunodeficiency virus (HIV). This virus attacks patients immune system and effects its ability to fight against diseases. Developing effective medicine requires understanding the life cycle and replication ability of the virus. HIV-1 protease enzyme is used to cleave an octamer peptide into peptides which are used to create proteins by the virus. In this paper, a novel feature extraction method is proposed for understanding important patterns in octamer’s cleavability. This feature extraction method is based on data mining techniques which are used to find important relations inside a dataset by comprehensively analyzing the given data. As demonstrated in this paper, using the extracted information in the classification process yields important results which may be taken into consideration when developing a new medicine. We have used 746 and 1625, Impens and schilling data instances from the 746-dataset. Besides, we have performed social network analysis as a complementary alternative method. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s13321-016-0184-9) contains supplementary material, which is available to authorized users.
format Online
Article
Text
id pubmed-5395515
institution National Center for Biotechnology Information
language English
publishDate 2016
publisher Springer International Publishing
record_format MEDLINE/PubMed
spelling pubmed-53955152017-05-05 Utilizing maximal frequent itemsets and social network analysis for HIV data analysis Koçak, Yunuscan Özyer, Tansel Alhajj, Reda J Cheminform Research Article Acquired immune deficiency syndrome is a deadly disease which is caused by human immunodeficiency virus (HIV). This virus attacks patients immune system and effects its ability to fight against diseases. Developing effective medicine requires understanding the life cycle and replication ability of the virus. HIV-1 protease enzyme is used to cleave an octamer peptide into peptides which are used to create proteins by the virus. In this paper, a novel feature extraction method is proposed for understanding important patterns in octamer’s cleavability. This feature extraction method is based on data mining techniques which are used to find important relations inside a dataset by comprehensively analyzing the given data. As demonstrated in this paper, using the extracted information in the classification process yields important results which may be taken into consideration when developing a new medicine. We have used 746 and 1625, Impens and schilling data instances from the 746-dataset. Besides, we have performed social network analysis as a complementary alternative method. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s13321-016-0184-9) contains supplementary material, which is available to authorized users. Springer International Publishing 2016-12-09 /pmc/articles/PMC5395515/ http://dx.doi.org/10.1186/s13321-016-0184-9 Text en © The Author(s) 2016 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Research Article
Koçak, Yunuscan
Özyer, Tansel
Alhajj, Reda
Utilizing maximal frequent itemsets and social network analysis for HIV data analysis
title Utilizing maximal frequent itemsets and social network analysis for HIV data analysis
title_full Utilizing maximal frequent itemsets and social network analysis for HIV data analysis
title_fullStr Utilizing maximal frequent itemsets and social network analysis for HIV data analysis
title_full_unstemmed Utilizing maximal frequent itemsets and social network analysis for HIV data analysis
title_short Utilizing maximal frequent itemsets and social network analysis for HIV data analysis
title_sort utilizing maximal frequent itemsets and social network analysis for hiv data analysis
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5395515/
http://dx.doi.org/10.1186/s13321-016-0184-9
work_keys_str_mv AT kocakyunuscan utilizingmaximalfrequentitemsetsandsocialnetworkanalysisforhivdataanalysis
AT ozyertansel utilizingmaximalfrequentitemsetsandsocialnetworkanalysisforhivdataanalysis
AT alhajjreda utilizingmaximalfrequentitemsetsandsocialnetworkanalysisforhivdataanalysis