Cargando…

Detection of biomarkers for Hepatocellular Carcinoma using a hybrid univariate gene selection methods

BACKGROUND: Discovering new biomarkers has a great role in improving early diagnosis of Hepatocellular carcinoma (HCC). The experimental determination of biomarkers needs a lot of time and money. This motivates this work to use in-silico prediction of biomarkers to reduce the number of experiments r...

Descripción completa

Detalles Bibliográficos
Autores principales: Abdel Samee, Nagwan M, Solouma, Nahed H, Kadah, Yasser M
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2012
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3570375/
https://www.ncbi.nlm.nih.gov/pubmed/22867264
http://dx.doi.org/10.1186/1742-4682-9-34
_version_ 1782259059811942400
author Abdel Samee, Nagwan M
Solouma, Nahed H
Kadah, Yasser M
author_facet Abdel Samee, Nagwan M
Solouma, Nahed H
Kadah, Yasser M
author_sort Abdel Samee, Nagwan M
collection PubMed
description BACKGROUND: Discovering new biomarkers has a great role in improving early diagnosis of Hepatocellular carcinoma (HCC). The experimental determination of biomarkers needs a lot of time and money. This motivates this work to use in-silico prediction of biomarkers to reduce the number of experiments required for detecting new ones. This is achieved by extracting the most representative genes in microarrays of HCC. RESULTS: In this work, we provide a method for extracting the differential expressed genes, up regulated ones, that can be considered candidate biomarkers in high throughput microarrays of HCC. We examine the power of several gene selection methods (such as Pearson’s correlation coefficient, Cosine coefficient, Euclidean distance, Mutual information and Entropy with different estimators) in selecting informative genes. A biological interpretation of the highly ranked genes is done using KEGG (Kyoto Encyclopedia of Genes and Genomes) pathways, ENTREZ and DAVID (Database for Annotation, Visualization, and Integrated Discovery) databases. The top ten genes selected using Pearson’s correlation coefficient and Cosine coefficient contained six genes that have been implicated in cancer (often multiple cancers) genesis in previous studies. A fewer number of genes were obtained by the other methods (4 genes using Mutual information, 3genes using Euclidean distance and only one gene using Entropy). A better result was obtained by the utilization of a hybrid approach based on intersecting the highly ranked genes in the output of all investigated methods. This hybrid combination yielded seven genes (2 genes for HCC and 5 genes in different types of cancer) in the top ten genes of the list of intersected genes. CONCLUSIONS: To strengthen the effectiveness of the univariate selection methods, we propose a hybrid approach by intersecting several of these methods in a cascaded manner. This approach surpasses all of univariate selection methods when used individually according to biological interpretation and the examination of gene expression signal profiles.
format Online
Article
Text
id pubmed-3570375
institution National Center for Biotechnology Information
language English
publishDate 2012
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-35703752013-02-15 Detection of biomarkers for Hepatocellular Carcinoma using a hybrid univariate gene selection methods Abdel Samee, Nagwan M Solouma, Nahed H Kadah, Yasser M Theor Biol Med Model Research BACKGROUND: Discovering new biomarkers has a great role in improving early diagnosis of Hepatocellular carcinoma (HCC). The experimental determination of biomarkers needs a lot of time and money. This motivates this work to use in-silico prediction of biomarkers to reduce the number of experiments required for detecting new ones. This is achieved by extracting the most representative genes in microarrays of HCC. RESULTS: In this work, we provide a method for extracting the differential expressed genes, up regulated ones, that can be considered candidate biomarkers in high throughput microarrays of HCC. We examine the power of several gene selection methods (such as Pearson’s correlation coefficient, Cosine coefficient, Euclidean distance, Mutual information and Entropy with different estimators) in selecting informative genes. A biological interpretation of the highly ranked genes is done using KEGG (Kyoto Encyclopedia of Genes and Genomes) pathways, ENTREZ and DAVID (Database for Annotation, Visualization, and Integrated Discovery) databases. The top ten genes selected using Pearson’s correlation coefficient and Cosine coefficient contained six genes that have been implicated in cancer (often multiple cancers) genesis in previous studies. A fewer number of genes were obtained by the other methods (4 genes using Mutual information, 3genes using Euclidean distance and only one gene using Entropy). A better result was obtained by the utilization of a hybrid approach based on intersecting the highly ranked genes in the output of all investigated methods. This hybrid combination yielded seven genes (2 genes for HCC and 5 genes in different types of cancer) in the top ten genes of the list of intersected genes. CONCLUSIONS: To strengthen the effectiveness of the univariate selection methods, we propose a hybrid approach by intersecting several of these methods in a cascaded manner. This approach surpasses all of univariate selection methods when used individually according to biological interpretation and the examination of gene expression signal profiles. BioMed Central 2012-08-06 /pmc/articles/PMC3570375/ /pubmed/22867264 http://dx.doi.org/10.1186/1742-4682-9-34 Text en Copyright ©2012 Abdel Samee et al.; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License ( http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research
Abdel Samee, Nagwan M
Solouma, Nahed H
Kadah, Yasser M
Detection of biomarkers for Hepatocellular Carcinoma using a hybrid univariate gene selection methods
title Detection of biomarkers for Hepatocellular Carcinoma using a hybrid univariate gene selection methods
title_full Detection of biomarkers for Hepatocellular Carcinoma using a hybrid univariate gene selection methods
title_fullStr Detection of biomarkers for Hepatocellular Carcinoma using a hybrid univariate gene selection methods
title_full_unstemmed Detection of biomarkers for Hepatocellular Carcinoma using a hybrid univariate gene selection methods
title_short Detection of biomarkers for Hepatocellular Carcinoma using a hybrid univariate gene selection methods
title_sort detection of biomarkers for hepatocellular carcinoma using a hybrid univariate gene selection methods
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3570375/
https://www.ncbi.nlm.nih.gov/pubmed/22867264
http://dx.doi.org/10.1186/1742-4682-9-34
work_keys_str_mv AT abdelsameenagwanm detectionofbiomarkersforhepatocellularcarcinomausingahybridunivariategeneselectionmethods
AT soloumanahedh detectionofbiomarkersforhepatocellularcarcinomausingahybridunivariategeneselectionmethods
AT kadahyasserm detectionofbiomarkersforhepatocellularcarcinomausingahybridunivariategeneselectionmethods