Cargando…

Inferring phenotypes from substance use via collaborative matrix completion

BACKGROUND: Although substance use disorders (SUDs) are heritable, few genetic risk factors for them have been identified, in part due to the small sample sizes of study populations. To address this limitation, researchers have aggregated subjects from multiple existing genetic studies, but these su...

Descripción completa

Detalles Bibliográficos
Autores principales: Lu, Jin, Sun, Jiangwen, Wang, Xinyu, Kranzler, Henry, Gelernter, Joel, Bi, Jinbo
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2018
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6249733/
https://www.ncbi.nlm.nih.gov/pubmed/30463556
http://dx.doi.org/10.1186/s12918-018-0623-5
_version_ 1783372804379377664
author Lu, Jin
Sun, Jiangwen
Wang, Xinyu
Kranzler, Henry
Gelernter, Joel
Bi, Jinbo
author_facet Lu, Jin
Sun, Jiangwen
Wang, Xinyu
Kranzler, Henry
Gelernter, Joel
Bi, Jinbo
author_sort Lu, Jin
collection PubMed
description BACKGROUND: Although substance use disorders (SUDs) are heritable, few genetic risk factors for them have been identified, in part due to the small sample sizes of study populations. To address this limitation, researchers have aggregated subjects from multiple existing genetic studies, but these subjects can have missing phenotypic information, including diagnostic criteria for certain substances that were not originally a focus of study. Recent advances in addiction neurobiology have shown that comorbid SUDs (e.g., the abuse of multiple substances) have similar genetic determinants, which makes it possible to infer missing SUD diagnostic criteria using criteria from another SUD and patient genotypes through statistical modeling. RESULTS: We propose a new approach based on matrix completion techniques to integrate features of comorbid health conditions and individual’s genotypes to infer unreported diagnostic criteria for a disorder. This approach optimizes a bi-linear model that uses the interactions between known disease correlations and candidate genes to impute missing criteria. An efficient stochastic and parallel algorithm was developed to optimize the model with a speed 20 times greater than the classic sequential algorithm. It was tested on 3441 subjects who had both cocaine and opioid use disorders and successfully inferred missing diagnostic criteria with consistently better accuracy than other recent statistical methods. CONCLUSIONS: The proposed matrix completion imputation method is a promising tool to impute unreported or unobserved symptoms or criteria for disease diagnosis. Integrating data at multiple scales or from heterogeneous sources may help improve the accuracy of phenotype imputation.
format Online
Article
Text
id pubmed-6249733
institution National Center for Biotechnology Information
language English
publishDate 2018
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-62497332018-11-26 Inferring phenotypes from substance use via collaborative matrix completion Lu, Jin Sun, Jiangwen Wang, Xinyu Kranzler, Henry Gelernter, Joel Bi, Jinbo BMC Syst Biol Research BACKGROUND: Although substance use disorders (SUDs) are heritable, few genetic risk factors for them have been identified, in part due to the small sample sizes of study populations. To address this limitation, researchers have aggregated subjects from multiple existing genetic studies, but these subjects can have missing phenotypic information, including diagnostic criteria for certain substances that were not originally a focus of study. Recent advances in addiction neurobiology have shown that comorbid SUDs (e.g., the abuse of multiple substances) have similar genetic determinants, which makes it possible to infer missing SUD diagnostic criteria using criteria from another SUD and patient genotypes through statistical modeling. RESULTS: We propose a new approach based on matrix completion techniques to integrate features of comorbid health conditions and individual’s genotypes to infer unreported diagnostic criteria for a disorder. This approach optimizes a bi-linear model that uses the interactions between known disease correlations and candidate genes to impute missing criteria. An efficient stochastic and parallel algorithm was developed to optimize the model with a speed 20 times greater than the classic sequential algorithm. It was tested on 3441 subjects who had both cocaine and opioid use disorders and successfully inferred missing diagnostic criteria with consistently better accuracy than other recent statistical methods. CONCLUSIONS: The proposed matrix completion imputation method is a promising tool to impute unreported or unobserved symptoms or criteria for disease diagnosis. Integrating data at multiple scales or from heterogeneous sources may help improve the accuracy of phenotype imputation. BioMed Central 2018-11-22 /pmc/articles/PMC6249733/ /pubmed/30463556 http://dx.doi.org/10.1186/s12918-018-0623-5 Text en © The Author(s) 2018 Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License(http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver(http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Research
Lu, Jin
Sun, Jiangwen
Wang, Xinyu
Kranzler, Henry
Gelernter, Joel
Bi, Jinbo
Inferring phenotypes from substance use via collaborative matrix completion
title Inferring phenotypes from substance use via collaborative matrix completion
title_full Inferring phenotypes from substance use via collaborative matrix completion
title_fullStr Inferring phenotypes from substance use via collaborative matrix completion
title_full_unstemmed Inferring phenotypes from substance use via collaborative matrix completion
title_short Inferring phenotypes from substance use via collaborative matrix completion
title_sort inferring phenotypes from substance use via collaborative matrix completion
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6249733/
https://www.ncbi.nlm.nih.gov/pubmed/30463556
http://dx.doi.org/10.1186/s12918-018-0623-5
work_keys_str_mv AT lujin inferringphenotypesfromsubstanceuseviacollaborativematrixcompletion
AT sunjiangwen inferringphenotypesfromsubstanceuseviacollaborativematrixcompletion
AT wangxinyu inferringphenotypesfromsubstanceuseviacollaborativematrixcompletion
AT kranzlerhenry inferringphenotypesfromsubstanceuseviacollaborativematrixcompletion
AT gelernterjoel inferringphenotypesfromsubstanceuseviacollaborativematrixcompletion
AT bijinbo inferringphenotypesfromsubstanceuseviacollaborativematrixcompletion