Cargando…
Inferring phenotypes from substance use via collaborative matrix completion
BACKGROUND: Although substance use disorders (SUDs) are heritable, few genetic risk factors for them have been identified, in part due to the small sample sizes of study populations. To address this limitation, researchers have aggregated subjects from multiple existing genetic studies, but these su...
Autores principales: | , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2018
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6249733/ https://www.ncbi.nlm.nih.gov/pubmed/30463556 http://dx.doi.org/10.1186/s12918-018-0623-5 |
_version_ | 1783372804379377664 |
---|---|
author | Lu, Jin Sun, Jiangwen Wang, Xinyu Kranzler, Henry Gelernter, Joel Bi, Jinbo |
author_facet | Lu, Jin Sun, Jiangwen Wang, Xinyu Kranzler, Henry Gelernter, Joel Bi, Jinbo |
author_sort | Lu, Jin |
collection | PubMed |
description | BACKGROUND: Although substance use disorders (SUDs) are heritable, few genetic risk factors for them have been identified, in part due to the small sample sizes of study populations. To address this limitation, researchers have aggregated subjects from multiple existing genetic studies, but these subjects can have missing phenotypic information, including diagnostic criteria for certain substances that were not originally a focus of study. Recent advances in addiction neurobiology have shown that comorbid SUDs (e.g., the abuse of multiple substances) have similar genetic determinants, which makes it possible to infer missing SUD diagnostic criteria using criteria from another SUD and patient genotypes through statistical modeling. RESULTS: We propose a new approach based on matrix completion techniques to integrate features of comorbid health conditions and individual’s genotypes to infer unreported diagnostic criteria for a disorder. This approach optimizes a bi-linear model that uses the interactions between known disease correlations and candidate genes to impute missing criteria. An efficient stochastic and parallel algorithm was developed to optimize the model with a speed 20 times greater than the classic sequential algorithm. It was tested on 3441 subjects who had both cocaine and opioid use disorders and successfully inferred missing diagnostic criteria with consistently better accuracy than other recent statistical methods. CONCLUSIONS: The proposed matrix completion imputation method is a promising tool to impute unreported or unobserved symptoms or criteria for disease diagnosis. Integrating data at multiple scales or from heterogeneous sources may help improve the accuracy of phenotype imputation. |
format | Online Article Text |
id | pubmed-6249733 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2018 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-62497332018-11-26 Inferring phenotypes from substance use via collaborative matrix completion Lu, Jin Sun, Jiangwen Wang, Xinyu Kranzler, Henry Gelernter, Joel Bi, Jinbo BMC Syst Biol Research BACKGROUND: Although substance use disorders (SUDs) are heritable, few genetic risk factors for them have been identified, in part due to the small sample sizes of study populations. To address this limitation, researchers have aggregated subjects from multiple existing genetic studies, but these subjects can have missing phenotypic information, including diagnostic criteria for certain substances that were not originally a focus of study. Recent advances in addiction neurobiology have shown that comorbid SUDs (e.g., the abuse of multiple substances) have similar genetic determinants, which makes it possible to infer missing SUD diagnostic criteria using criteria from another SUD and patient genotypes through statistical modeling. RESULTS: We propose a new approach based on matrix completion techniques to integrate features of comorbid health conditions and individual’s genotypes to infer unreported diagnostic criteria for a disorder. This approach optimizes a bi-linear model that uses the interactions between known disease correlations and candidate genes to impute missing criteria. An efficient stochastic and parallel algorithm was developed to optimize the model with a speed 20 times greater than the classic sequential algorithm. It was tested on 3441 subjects who had both cocaine and opioid use disorders and successfully inferred missing diagnostic criteria with consistently better accuracy than other recent statistical methods. CONCLUSIONS: The proposed matrix completion imputation method is a promising tool to impute unreported or unobserved symptoms or criteria for disease diagnosis. Integrating data at multiple scales or from heterogeneous sources may help improve the accuracy of phenotype imputation. BioMed Central 2018-11-22 /pmc/articles/PMC6249733/ /pubmed/30463556 http://dx.doi.org/10.1186/s12918-018-0623-5 Text en © The Author(s) 2018 Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License(http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver(http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated. |
spellingShingle | Research Lu, Jin Sun, Jiangwen Wang, Xinyu Kranzler, Henry Gelernter, Joel Bi, Jinbo Inferring phenotypes from substance use via collaborative matrix completion |
title | Inferring phenotypes from substance use via collaborative matrix completion |
title_full | Inferring phenotypes from substance use via collaborative matrix completion |
title_fullStr | Inferring phenotypes from substance use via collaborative matrix completion |
title_full_unstemmed | Inferring phenotypes from substance use via collaborative matrix completion |
title_short | Inferring phenotypes from substance use via collaborative matrix completion |
title_sort | inferring phenotypes from substance use via collaborative matrix completion |
topic | Research |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6249733/ https://www.ncbi.nlm.nih.gov/pubmed/30463556 http://dx.doi.org/10.1186/s12918-018-0623-5 |
work_keys_str_mv | AT lujin inferringphenotypesfromsubstanceuseviacollaborativematrixcompletion AT sunjiangwen inferringphenotypesfromsubstanceuseviacollaborativematrixcompletion AT wangxinyu inferringphenotypesfromsubstanceuseviacollaborativematrixcompletion AT kranzlerhenry inferringphenotypesfromsubstanceuseviacollaborativematrixcompletion AT gelernterjoel inferringphenotypesfromsubstanceuseviacollaborativematrixcompletion AT bijinbo inferringphenotypesfromsubstanceuseviacollaborativematrixcompletion |