Cargando…

Network topology measures for identifying disease-gene association in breast cancer

BACKGROUND: Massive biological datasets are generated in different locations all over the world. Analysis of these datasets is required in order to extract knowledge that might be helpful for biologists, physicians and pharmacists. Recently, analysis of biological networks has received a lot of atte...

Descripción completa

Detalles Bibliográficos
Autores principales: Ramadan, Emad, Alinsaif, Sadiq, Hassan, Md Rafiul
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2016
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4965731/
https://www.ncbi.nlm.nih.gov/pubmed/27454166
http://dx.doi.org/10.1186/s12859-016-1095-5
_version_ 1782445304311709696
author Ramadan, Emad
Alinsaif, Sadiq
Hassan, Md Rafiul
author_facet Ramadan, Emad
Alinsaif, Sadiq
Hassan, Md Rafiul
author_sort Ramadan, Emad
collection PubMed
description BACKGROUND: Massive biological datasets are generated in different locations all over the world. Analysis of these datasets is required in order to extract knowledge that might be helpful for biologists, physicians and pharmacists. Recently, analysis of biological networks has received a lot of attention, as an understanding of the network can reveal information about life at the cellular level. Biological networks can be generated that examine the interaction between proteins or the relationship amongst different genes at the expression level. Identifying information from biological networks is recognized as a significant challenge, due to the inherent complexity of the structures. Computational techniques are used to analyze such complex networks with varying success. RESULTS: In this paper, we construct a new method for predicting phenotype-gene association in breast cancer using biological network analysis. Several network topological measures have been computed and fed as features into two classification models to investigate phenotype-gene association in breast cancer. More importantly, to overcome the problem of the skewed datasets, a synthetic minority oversampling technique (SMOTE) is adapted in order to transform an imbalanced dataset to a balanced one. We have applied our method on the gene co-expression network (GCN), protein–protein interaction network (PPI), and the integrated functional interaction network (FI), which combined the PPIs and gene co-expression, amongst others. We assess the quality of our proposed method using a slightly modified cross-validation. CONCLUSIONS: Our method can identify phenotype-gene association in breast cancer. Moreover, use of the integrated functional interaction network (FI) has the potential to reveal more information and hidden patterns than the other networks. The software and accompanying examples are freely available at http://faculty.kfupm.edu.sa/ics/eramadan/NetTop.zip.
format Online
Article
Text
id pubmed-4965731
institution National Center for Biotechnology Information
language English
publishDate 2016
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-49657312016-08-02 Network topology measures for identifying disease-gene association in breast cancer Ramadan, Emad Alinsaif, Sadiq Hassan, Md Rafiul BMC Bioinformatics Research BACKGROUND: Massive biological datasets are generated in different locations all over the world. Analysis of these datasets is required in order to extract knowledge that might be helpful for biologists, physicians and pharmacists. Recently, analysis of biological networks has received a lot of attention, as an understanding of the network can reveal information about life at the cellular level. Biological networks can be generated that examine the interaction between proteins or the relationship amongst different genes at the expression level. Identifying information from biological networks is recognized as a significant challenge, due to the inherent complexity of the structures. Computational techniques are used to analyze such complex networks with varying success. RESULTS: In this paper, we construct a new method for predicting phenotype-gene association in breast cancer using biological network analysis. Several network topological measures have been computed and fed as features into two classification models to investigate phenotype-gene association in breast cancer. More importantly, to overcome the problem of the skewed datasets, a synthetic minority oversampling technique (SMOTE) is adapted in order to transform an imbalanced dataset to a balanced one. We have applied our method on the gene co-expression network (GCN), protein–protein interaction network (PPI), and the integrated functional interaction network (FI), which combined the PPIs and gene co-expression, amongst others. We assess the quality of our proposed method using a slightly modified cross-validation. CONCLUSIONS: Our method can identify phenotype-gene association in breast cancer. Moreover, use of the integrated functional interaction network (FI) has the potential to reveal more information and hidden patterns than the other networks. The software and accompanying examples are freely available at http://faculty.kfupm.edu.sa/ics/eramadan/NetTop.zip. BioMed Central 2016-07-25 /pmc/articles/PMC4965731/ /pubmed/27454166 http://dx.doi.org/10.1186/s12859-016-1095-5 Text en © The Author(s) 2016 Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver(http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Research
Ramadan, Emad
Alinsaif, Sadiq
Hassan, Md Rafiul
Network topology measures for identifying disease-gene association in breast cancer
title Network topology measures for identifying disease-gene association in breast cancer
title_full Network topology measures for identifying disease-gene association in breast cancer
title_fullStr Network topology measures for identifying disease-gene association in breast cancer
title_full_unstemmed Network topology measures for identifying disease-gene association in breast cancer
title_short Network topology measures for identifying disease-gene association in breast cancer
title_sort network topology measures for identifying disease-gene association in breast cancer
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4965731/
https://www.ncbi.nlm.nih.gov/pubmed/27454166
http://dx.doi.org/10.1186/s12859-016-1095-5
work_keys_str_mv AT ramadanemad networktopologymeasuresforidentifyingdiseasegeneassociationinbreastcancer
AT alinsaifsadiq networktopologymeasuresforidentifyingdiseasegeneassociationinbreastcancer
AT hassanmdrafiul networktopologymeasuresforidentifyingdiseasegeneassociationinbreastcancer