Cargando…

GEO数据库联合机器学习策略识别骨关节炎特征性lncRNA分子标志物及实验验证

OBJECTIVE: To screen for long non-coding RNA (lncRNA) molecular markers characteristic of osteoarthritis (OA) by utilizing the Gene Expression Omnibus (GEO) database combined with machine learning. METHODS: The samples of 185 OA patients and 76 healthy individuals as normal controls were included in...

Descripción completa

Detalles Bibliográficos
Formato: Online Artículo Texto
Lenguaje:English
Publicado: 四川大学学报(医学版)编辑部 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10579086/
https://www.ncbi.nlm.nih.gov/pubmed/37866944
http://dx.doi.org/10.12182/20230960101
_version_ 1785121647962882048
collection PubMed
description OBJECTIVE: To screen for long non-coding RNA (lncRNA) molecular markers characteristic of osteoarthritis (OA) by utilizing the Gene Expression Omnibus (GEO) database combined with machine learning. METHODS: The samples of 185 OA patients and 76 healthy individuals as normal controls were included in the study. GEO datasets were screened for differentially expressed lncRNAs. Three algorithms, the least absolute shrinkage and selection operator (LASSO), support vector machine recursive feature elimination (SVM-RFE), and random forest (RF), were used to screen for candidate lncRNA models and receiver operating characteristic (ROC) curves were plotted to evaluate the models. We collected the peripheral blood samples of 30 clinical OA patients and 15 health controls and measured the immunoinflammatory indicators. RT-PCR was performed for quantitative analysis of the expression of lncRNA molecular markers in peripheral blood mononuclear cells (PBMC). Pearson analysis was performed to examine the correlation between lncRNA and indicators for inflammation of the immune system. RESULTS: A total of 14 key markers were identified with LASSO, 6 genes were identified with SVM-RFE, and 24 genes were identified with RF. Venn diagram was used to screen for overlapping genes identified with the three algorithms, showing HOTAIR, H19, MIR155HG, and NKILA to be the overlapping genes. The ROC curves showed that these four lncRNAs all had an area under the curve (AUC) greater than 0.7. The RT-PCR findings revealed relatively elevated expression of HOTAIR, H19, and MIR155HG and decreased expression of NKILA in the PBMC of OA patients compared with those of the normal group (P<0.01). The results were consistent with the bioinformatics predictions. Pearson analysis showed that the candidate lncRNAs were correlated with clinical indicators for inflammation. CONCLUSION: HOTAIR, H19, MIR155HG, and NKILA can be used as molecular markers for the clinical diagnosis of OA and are correlate with clinical indicators of inflammation of the immune system.
format Online
Article
Text
id pubmed-10579086
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher 四川大学学报(医学版)编辑部
record_format MEDLINE/PubMed
spelling pubmed-105790862023-10-18 GEO数据库联合机器学习策略识别骨关节炎特征性lncRNA分子标志物及实验验证 Sichuan Da Xue Xue Bao Yi Xue Ban 大数据与人工智能技术在生物医学多场景的应用 OBJECTIVE: To screen for long non-coding RNA (lncRNA) molecular markers characteristic of osteoarthritis (OA) by utilizing the Gene Expression Omnibus (GEO) database combined with machine learning. METHODS: The samples of 185 OA patients and 76 healthy individuals as normal controls were included in the study. GEO datasets were screened for differentially expressed lncRNAs. Three algorithms, the least absolute shrinkage and selection operator (LASSO), support vector machine recursive feature elimination (SVM-RFE), and random forest (RF), were used to screen for candidate lncRNA models and receiver operating characteristic (ROC) curves were plotted to evaluate the models. We collected the peripheral blood samples of 30 clinical OA patients and 15 health controls and measured the immunoinflammatory indicators. RT-PCR was performed for quantitative analysis of the expression of lncRNA molecular markers in peripheral blood mononuclear cells (PBMC). Pearson analysis was performed to examine the correlation between lncRNA and indicators for inflammation of the immune system. RESULTS: A total of 14 key markers were identified with LASSO, 6 genes were identified with SVM-RFE, and 24 genes were identified with RF. Venn diagram was used to screen for overlapping genes identified with the three algorithms, showing HOTAIR, H19, MIR155HG, and NKILA to be the overlapping genes. The ROC curves showed that these four lncRNAs all had an area under the curve (AUC) greater than 0.7. The RT-PCR findings revealed relatively elevated expression of HOTAIR, H19, and MIR155HG and decreased expression of NKILA in the PBMC of OA patients compared with those of the normal group (P<0.01). The results were consistent with the bioinformatics predictions. Pearson analysis showed that the candidate lncRNAs were correlated with clinical indicators for inflammation. CONCLUSION: HOTAIR, H19, MIR155HG, and NKILA can be used as molecular markers for the clinical diagnosis of OA and are correlate with clinical indicators of inflammation of the immune system. 四川大学学报(医学版)编辑部 2023-09-20 /pmc/articles/PMC10579086/ /pubmed/37866944 http://dx.doi.org/10.12182/20230960101 Text en © 2023《四川大学学报(医学版)》编辑部 版权所有 https://creativecommons.org/licenses/by-nc/4.0/开放获取 本文遵循知识共享署名—非商业性使用4.0国际许可协议(CC BY-NC 4.0),允许第三方对本刊发表的论文自由共享(即在任何媒介以任何形式复制、发行原文)、演绎(即修改、转换或以原文为基础进行创作),必须给出适当的署名,提供指向本文许可协议的链接,同时标明是否对原文作了修改;不得将本文用于商业目的。CC BY-NC 4.0许可协议访问 https://creativecommons.org/licenses/by-nc/4.0 (https://creativecommons.org/licenses/by-nc/4.0/) https://creativecommons.org/licenses/by-nc/4.0/Open Access This article is licensed under a Creative Commons Attribution-NonCommercial 4.0 International license (CC BY-NC 4.0). In other words, the full-text content of the journal is made freely available for third-party users to copy and redistribute in any medium or format, and to remix, transform, and build upon the content of the journal. You must give appropriate credit, provide a link to the license, and indicate if changes were made. You may not use the content of the journal for commercial purposes. For more information about the license, visit https://creativecommons.org/licenses/by-nc/4.0 (https://creativecommons.org/licenses/by-nc/4.0/)
spellingShingle 大数据与人工智能技术在生物医学多场景的应用
GEO数据库联合机器学习策略识别骨关节炎特征性lncRNA分子标志物及实验验证
title GEO数据库联合机器学习策略识别骨关节炎特征性lncRNA分子标志物及实验验证
title_full GEO数据库联合机器学习策略识别骨关节炎特征性lncRNA分子标志物及实验验证
title_fullStr GEO数据库联合机器学习策略识别骨关节炎特征性lncRNA分子标志物及实验验证
title_full_unstemmed GEO数据库联合机器学习策略识别骨关节炎特征性lncRNA分子标志物及实验验证
title_short GEO数据库联合机器学习策略识别骨关节炎特征性lncRNA分子标志物及实验验证
title_sort geo数据库联合机器学习策略识别骨关节炎特征性lncrna分子标志物及实验验证
topic 大数据与人工智能技术在生物医学多场景的应用
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10579086/
https://www.ncbi.nlm.nih.gov/pubmed/37866944
http://dx.doi.org/10.12182/20230960101
work_keys_str_mv AT geoshùjùkùliánhéjīqìxuéxícèlüèshíbiégǔguānjiéyántèzhēngxìnglncrnafēnzibiāozhìwùjíshíyànyànzhèng
AT geoshùjùkùliánhéjīqìxuéxícèlüèshíbiégǔguānjiéyántèzhēngxìnglncrnafēnzibiāozhìwùjíshíyànyànzhèng
AT geoshùjùkùliánhéjīqìxuéxícèlüèshíbiégǔguānjiéyántèzhēngxìnglncrnafēnzibiāozhìwùjíshíyànyànzhèng
AT geoshùjùkùliánhéjīqìxuéxícèlüèshíbiégǔguānjiéyántèzhēngxìnglncrnafēnzibiāozhìwùjíshíyànyànzhèng
AT geoshùjùkùliánhéjīqìxuéxícèlüèshíbiégǔguānjiéyántèzhēngxìnglncrnafēnzibiāozhìwùjíshíyànyànzhèng
AT geoshùjùkùliánhéjīqìxuéxícèlüèshíbiégǔguānjiéyántèzhēngxìnglncrnafēnzibiāozhìwùjíshíyànyànzhèng