Cargando…
Identification of Gene Signatures Used to Recognize Biological Characteristics of Gastric Cancer Upon Gene Expression Data
High-throughput gene expression microarrays can be examined by machine-learning algorithms to identify gene signatures that recognize the biological characteristics of specific human diseases, including cancer, with high sensitivity and specificity. A previous study compared 20 gastric cancer (GC) s...
Autores principales: | , , , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Libertas Academica
2014
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4149392/ https://www.ncbi.nlm.nih.gov/pubmed/25210421 http://dx.doi.org/10.4137/BMI.S13059 |
_version_ | 1782332747065327616 |
---|---|
author | Yan, Zhi Luke, Brian T Tsang, Shirley X Xing, Rui Pan, Yuanming Liu, Yixuan Wang, Jinlian Geng, Tao Li, Jiangeng Lu, Youyong |
author_facet | Yan, Zhi Luke, Brian T Tsang, Shirley X Xing, Rui Pan, Yuanming Liu, Yixuan Wang, Jinlian Geng, Tao Li, Jiangeng Lu, Youyong |
author_sort | Yan, Zhi |
collection | PubMed |
description | High-throughput gene expression microarrays can be examined by machine-learning algorithms to identify gene signatures that recognize the biological characteristics of specific human diseases, including cancer, with high sensitivity and specificity. A previous study compared 20 gastric cancer (GC) samples against 20 normal tissue (NT) samples and identified 1,519 differentially expressed genes (DEGs). In this study, Classification Information Index (CII), Information Gain Index (IGI), and RELIEF algorithms are used to mine the previously reported gene expression profiling data. In all, 29 of these genes are identified by all three algorithms and are treated as GC candidate biomarkers. Three biomarkers, COL1A2, ATP4B, and HADHSC, are selected and further examined using quantitative real-time polymerase chain reaction (qRT-PCR) and immunohistochemistry (IHC) staining in two independent sets of GC and normal adjacent tissue (NAT) samples. Our study shows that COL1A2 and HADHSC are the two best biomarkers from the microarray data, distinguishing all GC from the NT, whereas ATP4B is diagnostically significant in lab tests because of its wider range of fold-changes in expression. Herein, a data-mining model applicable for small sample sizes is presented and discussed. Our result suggested that this mining model may be useful in small sample-size studies to identify putative biomarkers and potential biological features of GC. |
format | Online Article Text |
id | pubmed-4149392 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2014 |
publisher | Libertas Academica |
record_format | MEDLINE/PubMed |
spelling | pubmed-41493922014-09-10 Identification of Gene Signatures Used to Recognize Biological Characteristics of Gastric Cancer Upon Gene Expression Data Yan, Zhi Luke, Brian T Tsang, Shirley X Xing, Rui Pan, Yuanming Liu, Yixuan Wang, Jinlian Geng, Tao Li, Jiangeng Lu, Youyong Biomark Insights Original Research High-throughput gene expression microarrays can be examined by machine-learning algorithms to identify gene signatures that recognize the biological characteristics of specific human diseases, including cancer, with high sensitivity and specificity. A previous study compared 20 gastric cancer (GC) samples against 20 normal tissue (NT) samples and identified 1,519 differentially expressed genes (DEGs). In this study, Classification Information Index (CII), Information Gain Index (IGI), and RELIEF algorithms are used to mine the previously reported gene expression profiling data. In all, 29 of these genes are identified by all three algorithms and are treated as GC candidate biomarkers. Three biomarkers, COL1A2, ATP4B, and HADHSC, are selected and further examined using quantitative real-time polymerase chain reaction (qRT-PCR) and immunohistochemistry (IHC) staining in two independent sets of GC and normal adjacent tissue (NAT) samples. Our study shows that COL1A2 and HADHSC are the two best biomarkers from the microarray data, distinguishing all GC from the NT, whereas ATP4B is diagnostically significant in lab tests because of its wider range of fold-changes in expression. Herein, a data-mining model applicable for small sample sizes is presented and discussed. Our result suggested that this mining model may be useful in small sample-size studies to identify putative biomarkers and potential biological features of GC. Libertas Academica 2014-08-14 /pmc/articles/PMC4149392/ /pubmed/25210421 http://dx.doi.org/10.4137/BMI.S13059 Text en © 2014 the author(s), publisher and licensee Libertas Academica Ltd. This is an open access article published under the Creative Commons CC-BY-NC 3.0 License. |
spellingShingle | Original Research Yan, Zhi Luke, Brian T Tsang, Shirley X Xing, Rui Pan, Yuanming Liu, Yixuan Wang, Jinlian Geng, Tao Li, Jiangeng Lu, Youyong Identification of Gene Signatures Used to Recognize Biological Characteristics of Gastric Cancer Upon Gene Expression Data |
title | Identification of Gene Signatures Used to Recognize Biological Characteristics of Gastric Cancer Upon Gene Expression Data |
title_full | Identification of Gene Signatures Used to Recognize Biological Characteristics of Gastric Cancer Upon Gene Expression Data |
title_fullStr | Identification of Gene Signatures Used to Recognize Biological Characteristics of Gastric Cancer Upon Gene Expression Data |
title_full_unstemmed | Identification of Gene Signatures Used to Recognize Biological Characteristics of Gastric Cancer Upon Gene Expression Data |
title_short | Identification of Gene Signatures Used to Recognize Biological Characteristics of Gastric Cancer Upon Gene Expression Data |
title_sort | identification of gene signatures used to recognize biological characteristics of gastric cancer upon gene expression data |
topic | Original Research |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4149392/ https://www.ncbi.nlm.nih.gov/pubmed/25210421 http://dx.doi.org/10.4137/BMI.S13059 |
work_keys_str_mv | AT yanzhi identificationofgenesignaturesusedtorecognizebiologicalcharacteristicsofgastriccancerupongeneexpressiondata AT lukebriant identificationofgenesignaturesusedtorecognizebiologicalcharacteristicsofgastriccancerupongeneexpressiondata AT tsangshirleyx identificationofgenesignaturesusedtorecognizebiologicalcharacteristicsofgastriccancerupongeneexpressiondata AT xingrui identificationofgenesignaturesusedtorecognizebiologicalcharacteristicsofgastriccancerupongeneexpressiondata AT panyuanming identificationofgenesignaturesusedtorecognizebiologicalcharacteristicsofgastriccancerupongeneexpressiondata AT liuyixuan identificationofgenesignaturesusedtorecognizebiologicalcharacteristicsofgastriccancerupongeneexpressiondata AT wangjinlian identificationofgenesignaturesusedtorecognizebiologicalcharacteristicsofgastriccancerupongeneexpressiondata AT gengtao identificationofgenesignaturesusedtorecognizebiologicalcharacteristicsofgastriccancerupongeneexpressiondata AT lijiangeng identificationofgenesignaturesusedtorecognizebiologicalcharacteristicsofgastriccancerupongeneexpressiondata AT luyouyong identificationofgenesignaturesusedtorecognizebiologicalcharacteristicsofgastriccancerupongeneexpressiondata |