Cargando…

Combination of G72 Genetic Variation and G72 Protein Level to Detect Schizophrenia: Machine Learning Approaches

The D-amino acid oxidase activator (DAOA, also known as G72) gene is a strong schizophrenia susceptibility gene. Higher G72 protein levels have been implicated in patients with schizophrenia. The current study aimed to differentiate patients with schizophrenia from healthy individuals using G72 sing...

Descripción completa

Detalles Bibliográficos
Autores principales: Lin, Eugene, Lin, Chieh-Hsin, Lai, Yi-Lun, Huang, Chiung-Hsien, Huang, Yu-Jhen, Lane, Hsien-Yuan
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Frontiers Media S.A. 2018
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6232512/
https://www.ncbi.nlm.nih.gov/pubmed/30459659
http://dx.doi.org/10.3389/fpsyt.2018.00566
_version_ 1783370404314742784
author Lin, Eugene
Lin, Chieh-Hsin
Lai, Yi-Lun
Huang, Chiung-Hsien
Huang, Yu-Jhen
Lane, Hsien-Yuan
author_facet Lin, Eugene
Lin, Chieh-Hsin
Lai, Yi-Lun
Huang, Chiung-Hsien
Huang, Yu-Jhen
Lane, Hsien-Yuan
author_sort Lin, Eugene
collection PubMed
description The D-amino acid oxidase activator (DAOA, also known as G72) gene is a strong schizophrenia susceptibility gene. Higher G72 protein levels have been implicated in patients with schizophrenia. The current study aimed to differentiate patients with schizophrenia from healthy individuals using G72 single nucleotide polymorphisms (SNPs) and G72 protein levels by leveraging computational artificial intelligence and machine learning tools. A total of 149 subjects with 89 patients with schizophrenia and 60 healthy controls were recruited. Two G72 genotypes (including rs1421292 and rs2391191) and G72 protein levels were measured with the peripheral blood. We utilized three machine learning algorithms (including logistic regression, naive Bayes, and C4.5 decision tree) to build the optimal predictive model for distinguishing schizophrenia patients from healthy controls. The naive Bayes model using two factors, including G72 rs1421292 and G72 protein, appeared to be the best model for disease susceptibility (sensitivity = 0.7969, specificity = 0.9372, area under the receiver operating characteristic curve (AUC) = 0.9356). However, a model integrating G72 rs1421292 only slightly increased the discriminative power than a model with G72 protein alone (sensitivity = 0.7941, specificity = 0.9503, AUC = 0.9324). Among the three models with G72 protein alone, the naive Bayes with G72 protein alone had the best specificity (0.9503), while logistic regression with G72 protein alone was the most sensitive (0.8765). The findings remained similar after adjusting for age and gender. This study suggests that G72 protein alone, without incorporating the two G72 SNPs, may have been suitable enough to identify schizophrenia patients. We also recommend applying both naive Bayes and logistic regression models for the best specificity and sensitivity, respectively. Larger-scale studies are warranted to confirm the findings.
format Online
Article
Text
id pubmed-6232512
institution National Center for Biotechnology Information
language English
publishDate 2018
publisher Frontiers Media S.A.
record_format MEDLINE/PubMed
spelling pubmed-62325122018-11-20 Combination of G72 Genetic Variation and G72 Protein Level to Detect Schizophrenia: Machine Learning Approaches Lin, Eugene Lin, Chieh-Hsin Lai, Yi-Lun Huang, Chiung-Hsien Huang, Yu-Jhen Lane, Hsien-Yuan Front Psychiatry Psychiatry The D-amino acid oxidase activator (DAOA, also known as G72) gene is a strong schizophrenia susceptibility gene. Higher G72 protein levels have been implicated in patients with schizophrenia. The current study aimed to differentiate patients with schizophrenia from healthy individuals using G72 single nucleotide polymorphisms (SNPs) and G72 protein levels by leveraging computational artificial intelligence and machine learning tools. A total of 149 subjects with 89 patients with schizophrenia and 60 healthy controls were recruited. Two G72 genotypes (including rs1421292 and rs2391191) and G72 protein levels were measured with the peripheral blood. We utilized three machine learning algorithms (including logistic regression, naive Bayes, and C4.5 decision tree) to build the optimal predictive model for distinguishing schizophrenia patients from healthy controls. The naive Bayes model using two factors, including G72 rs1421292 and G72 protein, appeared to be the best model for disease susceptibility (sensitivity = 0.7969, specificity = 0.9372, area under the receiver operating characteristic curve (AUC) = 0.9356). However, a model integrating G72 rs1421292 only slightly increased the discriminative power than a model with G72 protein alone (sensitivity = 0.7941, specificity = 0.9503, AUC = 0.9324). Among the three models with G72 protein alone, the naive Bayes with G72 protein alone had the best specificity (0.9503), while logistic regression with G72 protein alone was the most sensitive (0.8765). The findings remained similar after adjusting for age and gender. This study suggests that G72 protein alone, without incorporating the two G72 SNPs, may have been suitable enough to identify schizophrenia patients. We also recommend applying both naive Bayes and logistic regression models for the best specificity and sensitivity, respectively. Larger-scale studies are warranted to confirm the findings. Frontiers Media S.A. 2018-11-06 /pmc/articles/PMC6232512/ /pubmed/30459659 http://dx.doi.org/10.3389/fpsyt.2018.00566 Text en Copyright © 2018 Lin, Lin, Lai, Huang, Huang and Lane. http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle Psychiatry
Lin, Eugene
Lin, Chieh-Hsin
Lai, Yi-Lun
Huang, Chiung-Hsien
Huang, Yu-Jhen
Lane, Hsien-Yuan
Combination of G72 Genetic Variation and G72 Protein Level to Detect Schizophrenia: Machine Learning Approaches
title Combination of G72 Genetic Variation and G72 Protein Level to Detect Schizophrenia: Machine Learning Approaches
title_full Combination of G72 Genetic Variation and G72 Protein Level to Detect Schizophrenia: Machine Learning Approaches
title_fullStr Combination of G72 Genetic Variation and G72 Protein Level to Detect Schizophrenia: Machine Learning Approaches
title_full_unstemmed Combination of G72 Genetic Variation and G72 Protein Level to Detect Schizophrenia: Machine Learning Approaches
title_short Combination of G72 Genetic Variation and G72 Protein Level to Detect Schizophrenia: Machine Learning Approaches
title_sort combination of g72 genetic variation and g72 protein level to detect schizophrenia: machine learning approaches
topic Psychiatry
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6232512/
https://www.ncbi.nlm.nih.gov/pubmed/30459659
http://dx.doi.org/10.3389/fpsyt.2018.00566
work_keys_str_mv AT lineugene combinationofg72geneticvariationandg72proteinleveltodetectschizophreniamachinelearningapproaches
AT linchiehhsin combinationofg72geneticvariationandg72proteinleveltodetectschizophreniamachinelearningapproaches
AT laiyilun combinationofg72geneticvariationandg72proteinleveltodetectschizophreniamachinelearningapproaches
AT huangchiunghsien combinationofg72geneticvariationandg72proteinleveltodetectschizophreniamachinelearningapproaches
AT huangyujhen combinationofg72geneticvariationandg72proteinleveltodetectschizophreniamachinelearningapproaches
AT lanehsienyuan combinationofg72geneticvariationandg72proteinleveltodetectschizophreniamachinelearningapproaches