Cargando…

Identification of Gene Signatures Used to Recognize Biological Characteristics of Gastric Cancer Upon Gene Expression Data

High-throughput gene expression microarrays can be examined by machine-learning algorithms to identify gene signatures that recognize the biological characteristics of specific human diseases, including cancer, with high sensitivity and specificity. A previous study compared 20 gastric cancer (GC) s...

Descripción completa

Detalles Bibliográficos
Autores principales: Yan, Zhi, Luke, Brian T, Tsang, Shirley X, Xing, Rui, Pan, Yuanming, Liu, Yixuan, Wang, Jinlian, Geng, Tao, Li, Jiangeng, Lu, Youyong
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Libertas Academica 2014
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4149392/
https://www.ncbi.nlm.nih.gov/pubmed/25210421
http://dx.doi.org/10.4137/BMI.S13059
_version_ 1782332747065327616
author Yan, Zhi
Luke, Brian T
Tsang, Shirley X
Xing, Rui
Pan, Yuanming
Liu, Yixuan
Wang, Jinlian
Geng, Tao
Li, Jiangeng
Lu, Youyong
author_facet Yan, Zhi
Luke, Brian T
Tsang, Shirley X
Xing, Rui
Pan, Yuanming
Liu, Yixuan
Wang, Jinlian
Geng, Tao
Li, Jiangeng
Lu, Youyong
author_sort Yan, Zhi
collection PubMed
description High-throughput gene expression microarrays can be examined by machine-learning algorithms to identify gene signatures that recognize the biological characteristics of specific human diseases, including cancer, with high sensitivity and specificity. A previous study compared 20 gastric cancer (GC) samples against 20 normal tissue (NT) samples and identified 1,519 differentially expressed genes (DEGs). In this study, Classification Information Index (CII), Information Gain Index (IGI), and RELIEF algorithms are used to mine the previously reported gene expression profiling data. In all, 29 of these genes are identified by all three algorithms and are treated as GC candidate biomarkers. Three biomarkers, COL1A2, ATP4B, and HADHSC, are selected and further examined using quantitative real-time polymerase chain reaction (qRT-PCR) and immunohistochemistry (IHC) staining in two independent sets of GC and normal adjacent tissue (NAT) samples. Our study shows that COL1A2 and HADHSC are the two best biomarkers from the microarray data, distinguishing all GC from the NT, whereas ATP4B is diagnostically significant in lab tests because of its wider range of fold-changes in expression. Herein, a data-mining model applicable for small sample sizes is presented and discussed. Our result suggested that this mining model may be useful in small sample-size studies to identify putative biomarkers and potential biological features of GC.
format Online
Article
Text
id pubmed-4149392
institution National Center for Biotechnology Information
language English
publishDate 2014
publisher Libertas Academica
record_format MEDLINE/PubMed
spelling pubmed-41493922014-09-10 Identification of Gene Signatures Used to Recognize Biological Characteristics of Gastric Cancer Upon Gene Expression Data Yan, Zhi Luke, Brian T Tsang, Shirley X Xing, Rui Pan, Yuanming Liu, Yixuan Wang, Jinlian Geng, Tao Li, Jiangeng Lu, Youyong Biomark Insights Original Research High-throughput gene expression microarrays can be examined by machine-learning algorithms to identify gene signatures that recognize the biological characteristics of specific human diseases, including cancer, with high sensitivity and specificity. A previous study compared 20 gastric cancer (GC) samples against 20 normal tissue (NT) samples and identified 1,519 differentially expressed genes (DEGs). In this study, Classification Information Index (CII), Information Gain Index (IGI), and RELIEF algorithms are used to mine the previously reported gene expression profiling data. In all, 29 of these genes are identified by all three algorithms and are treated as GC candidate biomarkers. Three biomarkers, COL1A2, ATP4B, and HADHSC, are selected and further examined using quantitative real-time polymerase chain reaction (qRT-PCR) and immunohistochemistry (IHC) staining in two independent sets of GC and normal adjacent tissue (NAT) samples. Our study shows that COL1A2 and HADHSC are the two best biomarkers from the microarray data, distinguishing all GC from the NT, whereas ATP4B is diagnostically significant in lab tests because of its wider range of fold-changes in expression. Herein, a data-mining model applicable for small sample sizes is presented and discussed. Our result suggested that this mining model may be useful in small sample-size studies to identify putative biomarkers and potential biological features of GC. Libertas Academica 2014-08-14 /pmc/articles/PMC4149392/ /pubmed/25210421 http://dx.doi.org/10.4137/BMI.S13059 Text en © 2014 the author(s), publisher and licensee Libertas Academica Ltd. This is an open access article published under the Creative Commons CC-BY-NC 3.0 License.
spellingShingle Original Research
Yan, Zhi
Luke, Brian T
Tsang, Shirley X
Xing, Rui
Pan, Yuanming
Liu, Yixuan
Wang, Jinlian
Geng, Tao
Li, Jiangeng
Lu, Youyong
Identification of Gene Signatures Used to Recognize Biological Characteristics of Gastric Cancer Upon Gene Expression Data
title Identification of Gene Signatures Used to Recognize Biological Characteristics of Gastric Cancer Upon Gene Expression Data
title_full Identification of Gene Signatures Used to Recognize Biological Characteristics of Gastric Cancer Upon Gene Expression Data
title_fullStr Identification of Gene Signatures Used to Recognize Biological Characteristics of Gastric Cancer Upon Gene Expression Data
title_full_unstemmed Identification of Gene Signatures Used to Recognize Biological Characteristics of Gastric Cancer Upon Gene Expression Data
title_short Identification of Gene Signatures Used to Recognize Biological Characteristics of Gastric Cancer Upon Gene Expression Data
title_sort identification of gene signatures used to recognize biological characteristics of gastric cancer upon gene expression data
topic Original Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4149392/
https://www.ncbi.nlm.nih.gov/pubmed/25210421
http://dx.doi.org/10.4137/BMI.S13059
work_keys_str_mv AT yanzhi identificationofgenesignaturesusedtorecognizebiologicalcharacteristicsofgastriccancerupongeneexpressiondata
AT lukebriant identificationofgenesignaturesusedtorecognizebiologicalcharacteristicsofgastriccancerupongeneexpressiondata
AT tsangshirleyx identificationofgenesignaturesusedtorecognizebiologicalcharacteristicsofgastriccancerupongeneexpressiondata
AT xingrui identificationofgenesignaturesusedtorecognizebiologicalcharacteristicsofgastriccancerupongeneexpressiondata
AT panyuanming identificationofgenesignaturesusedtorecognizebiologicalcharacteristicsofgastriccancerupongeneexpressiondata
AT liuyixuan identificationofgenesignaturesusedtorecognizebiologicalcharacteristicsofgastriccancerupongeneexpressiondata
AT wangjinlian identificationofgenesignaturesusedtorecognizebiologicalcharacteristicsofgastriccancerupongeneexpressiondata
AT gengtao identificationofgenesignaturesusedtorecognizebiologicalcharacteristicsofgastriccancerupongeneexpressiondata
AT lijiangeng identificationofgenesignaturesusedtorecognizebiologicalcharacteristicsofgastriccancerupongeneexpressiondata
AT luyouyong identificationofgenesignaturesusedtorecognizebiologicalcharacteristicsofgastriccancerupongeneexpressiondata