Cargando…
GEO数据库联合机器学习策略识别骨关节炎特征性lncRNA分子标志物及实验验证
OBJECTIVE: To screen for long non-coding RNA (lncRNA) molecular markers characteristic of osteoarthritis (OA) by utilizing the Gene Expression Omnibus (GEO) database combined with machine learning. METHODS: The samples of 185 OA patients and 76 healthy individuals as normal controls were included in...
Formato: | Online Artículo Texto |
---|---|
Lenguaje: | English |
Publicado: |
四川大学学报(医学版)编辑部
2023
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10579086/ https://www.ncbi.nlm.nih.gov/pubmed/37866944 http://dx.doi.org/10.12182/20230960101 |
_version_ | 1785121647962882048 |
---|---|
collection | PubMed |
description | OBJECTIVE: To screen for long non-coding RNA (lncRNA) molecular markers characteristic of osteoarthritis (OA) by utilizing the Gene Expression Omnibus (GEO) database combined with machine learning. METHODS: The samples of 185 OA patients and 76 healthy individuals as normal controls were included in the study. GEO datasets were screened for differentially expressed lncRNAs. Three algorithms, the least absolute shrinkage and selection operator (LASSO), support vector machine recursive feature elimination (SVM-RFE), and random forest (RF), were used to screen for candidate lncRNA models and receiver operating characteristic (ROC) curves were plotted to evaluate the models. We collected the peripheral blood samples of 30 clinical OA patients and 15 health controls and measured the immunoinflammatory indicators. RT-PCR was performed for quantitative analysis of the expression of lncRNA molecular markers in peripheral blood mononuclear cells (PBMC). Pearson analysis was performed to examine the correlation between lncRNA and indicators for inflammation of the immune system. RESULTS: A total of 14 key markers were identified with LASSO, 6 genes were identified with SVM-RFE, and 24 genes were identified with RF. Venn diagram was used to screen for overlapping genes identified with the three algorithms, showing HOTAIR, H19, MIR155HG, and NKILA to be the overlapping genes. The ROC curves showed that these four lncRNAs all had an area under the curve (AUC) greater than 0.7. The RT-PCR findings revealed relatively elevated expression of HOTAIR, H19, and MIR155HG and decreased expression of NKILA in the PBMC of OA patients compared with those of the normal group (P<0.01). The results were consistent with the bioinformatics predictions. Pearson analysis showed that the candidate lncRNAs were correlated with clinical indicators for inflammation. CONCLUSION: HOTAIR, H19, MIR155HG, and NKILA can be used as molecular markers for the clinical diagnosis of OA and are correlate with clinical indicators of inflammation of the immune system. |
format | Online Article Text |
id | pubmed-10579086 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2023 |
publisher | 四川大学学报(医学版)编辑部 |
record_format | MEDLINE/PubMed |
spelling | pubmed-105790862023-10-18 GEO数据库联合机器学习策略识别骨关节炎特征性lncRNA分子标志物及实验验证 Sichuan Da Xue Xue Bao Yi Xue Ban 大数据与人工智能技术在生物医学多场景的应用 OBJECTIVE: To screen for long non-coding RNA (lncRNA) molecular markers characteristic of osteoarthritis (OA) by utilizing the Gene Expression Omnibus (GEO) database combined with machine learning. METHODS: The samples of 185 OA patients and 76 healthy individuals as normal controls were included in the study. GEO datasets were screened for differentially expressed lncRNAs. Three algorithms, the least absolute shrinkage and selection operator (LASSO), support vector machine recursive feature elimination (SVM-RFE), and random forest (RF), were used to screen for candidate lncRNA models and receiver operating characteristic (ROC) curves were plotted to evaluate the models. We collected the peripheral blood samples of 30 clinical OA patients and 15 health controls and measured the immunoinflammatory indicators. RT-PCR was performed for quantitative analysis of the expression of lncRNA molecular markers in peripheral blood mononuclear cells (PBMC). Pearson analysis was performed to examine the correlation between lncRNA and indicators for inflammation of the immune system. RESULTS: A total of 14 key markers were identified with LASSO, 6 genes were identified with SVM-RFE, and 24 genes were identified with RF. Venn diagram was used to screen for overlapping genes identified with the three algorithms, showing HOTAIR, H19, MIR155HG, and NKILA to be the overlapping genes. The ROC curves showed that these four lncRNAs all had an area under the curve (AUC) greater than 0.7. The RT-PCR findings revealed relatively elevated expression of HOTAIR, H19, and MIR155HG and decreased expression of NKILA in the PBMC of OA patients compared with those of the normal group (P<0.01). The results were consistent with the bioinformatics predictions. Pearson analysis showed that the candidate lncRNAs were correlated with clinical indicators for inflammation. CONCLUSION: HOTAIR, H19, MIR155HG, and NKILA can be used as molecular markers for the clinical diagnosis of OA and are correlate with clinical indicators of inflammation of the immune system. 四川大学学报(医学版)编辑部 2023-09-20 /pmc/articles/PMC10579086/ /pubmed/37866944 http://dx.doi.org/10.12182/20230960101 Text en © 2023《四川大学学报(医学版)》编辑部 版权所有 https://creativecommons.org/licenses/by-nc/4.0/开放获取 本文遵循知识共享署名—非商业性使用4.0国际许可协议(CC BY-NC 4.0),允许第三方对本刊发表的论文自由共享(即在任何媒介以任何形式复制、发行原文)、演绎(即修改、转换或以原文为基础进行创作),必须给出适当的署名,提供指向本文许可协议的链接,同时标明是否对原文作了修改;不得将本文用于商业目的。CC BY-NC 4.0许可协议访问 https://creativecommons.org/licenses/by-nc/4.0 (https://creativecommons.org/licenses/by-nc/4.0/) https://creativecommons.org/licenses/by-nc/4.0/Open Access This article is licensed under a Creative Commons Attribution-NonCommercial 4.0 International license (CC BY-NC 4.0). In other words, the full-text content of the journal is made freely available for third-party users to copy and redistribute in any medium or format, and to remix, transform, and build upon the content of the journal. You must give appropriate credit, provide a link to the license, and indicate if changes were made. You may not use the content of the journal for commercial purposes. For more information about the license, visit https://creativecommons.org/licenses/by-nc/4.0 (https://creativecommons.org/licenses/by-nc/4.0/) |
spellingShingle | 大数据与人工智能技术在生物医学多场景的应用 GEO数据库联合机器学习策略识别骨关节炎特征性lncRNA分子标志物及实验验证 |
title | GEO数据库联合机器学习策略识别骨关节炎特征性lncRNA分子标志物及实验验证 |
title_full | GEO数据库联合机器学习策略识别骨关节炎特征性lncRNA分子标志物及实验验证 |
title_fullStr | GEO数据库联合机器学习策略识别骨关节炎特征性lncRNA分子标志物及实验验证 |
title_full_unstemmed | GEO数据库联合机器学习策略识别骨关节炎特征性lncRNA分子标志物及实验验证 |
title_short | GEO数据库联合机器学习策略识别骨关节炎特征性lncRNA分子标志物及实验验证 |
title_sort | geo数据库联合机器学习策略识别骨关节炎特征性lncrna分子标志物及实验验证 |
topic | 大数据与人工智能技术在生物医学多场景的应用 |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10579086/ https://www.ncbi.nlm.nih.gov/pubmed/37866944 http://dx.doi.org/10.12182/20230960101 |
work_keys_str_mv | AT geoshùjùkùliánhéjīqìxuéxícèlüèshíbiégǔguānjiéyántèzhēngxìnglncrnafēnzibiāozhìwùjíshíyànyànzhèng AT geoshùjùkùliánhéjīqìxuéxícèlüèshíbiégǔguānjiéyántèzhēngxìnglncrnafēnzibiāozhìwùjíshíyànyànzhèng AT geoshùjùkùliánhéjīqìxuéxícèlüèshíbiégǔguānjiéyántèzhēngxìnglncrnafēnzibiāozhìwùjíshíyànyànzhèng AT geoshùjùkùliánhéjīqìxuéxícèlüèshíbiégǔguānjiéyántèzhēngxìnglncrnafēnzibiāozhìwùjíshíyànyànzhèng AT geoshùjùkùliánhéjīqìxuéxícèlüèshíbiégǔguānjiéyántèzhēngxìnglncrnafēnzibiāozhìwùjíshíyànyànzhèng AT geoshùjùkùliánhéjīqìxuéxícèlüèshíbiégǔguānjiéyántèzhēngxìnglncrnafēnzibiāozhìwùjíshíyànyànzhèng |