Cargando…

Integration of Machine Learning Methods to Dissect Genetically Imputed Transcriptomic Profiles in Alzheimer’s Disease

The genetic component of many common traits is associated with the gene expression and several variants act as expression quantitative loci, regulating the gene expression in a tissue specific manner. In this work, we applied tissue-specific cis-eQTL gene expression prediction models on the genotype...

Descripción completa

Detalles Bibliográficos
Autores principales: Maj, Carlo, Azevedo, Tiago, Giansanti, Valentina, Borisov, Oleg, Dimitri, Giovanna Maria, Spasov, Simeon, Lió, Pietro, Merelli, Ivan
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Frontiers Media S.A. 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6735530/
https://www.ncbi.nlm.nih.gov/pubmed/31552082
http://dx.doi.org/10.3389/fgene.2019.00726
_version_ 1783450368410124288
author Maj, Carlo
Azevedo, Tiago
Giansanti, Valentina
Borisov, Oleg
Dimitri, Giovanna Maria
Spasov, Simeon
Lió, Pietro
Merelli, Ivan
author_facet Maj, Carlo
Azevedo, Tiago
Giansanti, Valentina
Borisov, Oleg
Dimitri, Giovanna Maria
Spasov, Simeon
Lió, Pietro
Merelli, Ivan
author_sort Maj, Carlo
collection PubMed
description The genetic component of many common traits is associated with the gene expression and several variants act as expression quantitative loci, regulating the gene expression in a tissue specific manner. In this work, we applied tissue-specific cis-eQTL gene expression prediction models on the genotype of 808 samples including controls, subjects with mild cognitive impairment, and patients with Alzheimer's Disease. We then dissected the imputed transcriptomic profiles by means of different unsupervised and supervised machine learning approaches to identify potential biological associations. Our analysis suggests that unsupervised and supervised methods can provide complementary information, which can be integrated for a better characterization of the underlying biological system. In particular, a variational autoencoder representation of the transcriptomic profiles, followed by a support vector machine classification, has been used for tissue-specific gene prioritizations. Interestingly, the achieved gene prioritizations can be efficiently integrated as a feature selection step for improving the accuracy of deep learning classifier networks. The identified gene-tissue information suggests a potential role for inflammatory and regulatory processes in gut-brain axis related tissues. In line with the expected low heritability that can be apportioned to eQTL variants, we were able to achieve only relatively low prediction capability with deep learning classification models. However, our analysis revealed that the classification power strongly depends on the network structure, with recurrent neural networks being the best performing network class. Interestingly, cross-tissue analysis suggests a potentially greater role of models trained in brain tissues also by considering dementia-related endophenotypes. Overall, the present analysis suggests that the combination of supervised and unsupervised machine learning techniques can be used for the evaluation of high dimensional omics data.
format Online
Article
Text
id pubmed-6735530
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher Frontiers Media S.A.
record_format MEDLINE/PubMed
spelling pubmed-67355302019-09-24 Integration of Machine Learning Methods to Dissect Genetically Imputed Transcriptomic Profiles in Alzheimer’s Disease Maj, Carlo Azevedo, Tiago Giansanti, Valentina Borisov, Oleg Dimitri, Giovanna Maria Spasov, Simeon Lió, Pietro Merelli, Ivan Front Genet Genetics The genetic component of many common traits is associated with the gene expression and several variants act as expression quantitative loci, regulating the gene expression in a tissue specific manner. In this work, we applied tissue-specific cis-eQTL gene expression prediction models on the genotype of 808 samples including controls, subjects with mild cognitive impairment, and patients with Alzheimer's Disease. We then dissected the imputed transcriptomic profiles by means of different unsupervised and supervised machine learning approaches to identify potential biological associations. Our analysis suggests that unsupervised and supervised methods can provide complementary information, which can be integrated for a better characterization of the underlying biological system. In particular, a variational autoencoder representation of the transcriptomic profiles, followed by a support vector machine classification, has been used for tissue-specific gene prioritizations. Interestingly, the achieved gene prioritizations can be efficiently integrated as a feature selection step for improving the accuracy of deep learning classifier networks. The identified gene-tissue information suggests a potential role for inflammatory and regulatory processes in gut-brain axis related tissues. In line with the expected low heritability that can be apportioned to eQTL variants, we were able to achieve only relatively low prediction capability with deep learning classification models. However, our analysis revealed that the classification power strongly depends on the network structure, with recurrent neural networks being the best performing network class. Interestingly, cross-tissue analysis suggests a potentially greater role of models trained in brain tissues also by considering dementia-related endophenotypes. Overall, the present analysis suggests that the combination of supervised and unsupervised machine learning techniques can be used for the evaluation of high dimensional omics data. Frontiers Media S.A. 2019-09-03 /pmc/articles/PMC6735530/ /pubmed/31552082 http://dx.doi.org/10.3389/fgene.2019.00726 Text en Copyright © 2019 Maj, Azevedo, Giansanti, Borisov, Dimitri, Spasov, Alzheimer’s Disease Neuroimaging Initiative, Lió and Merelli http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle Genetics
Maj, Carlo
Azevedo, Tiago
Giansanti, Valentina
Borisov, Oleg
Dimitri, Giovanna Maria
Spasov, Simeon
Lió, Pietro
Merelli, Ivan
Integration of Machine Learning Methods to Dissect Genetically Imputed Transcriptomic Profiles in Alzheimer’s Disease
title Integration of Machine Learning Methods to Dissect Genetically Imputed Transcriptomic Profiles in Alzheimer’s Disease
title_full Integration of Machine Learning Methods to Dissect Genetically Imputed Transcriptomic Profiles in Alzheimer’s Disease
title_fullStr Integration of Machine Learning Methods to Dissect Genetically Imputed Transcriptomic Profiles in Alzheimer’s Disease
title_full_unstemmed Integration of Machine Learning Methods to Dissect Genetically Imputed Transcriptomic Profiles in Alzheimer’s Disease
title_short Integration of Machine Learning Methods to Dissect Genetically Imputed Transcriptomic Profiles in Alzheimer’s Disease
title_sort integration of machine learning methods to dissect genetically imputed transcriptomic profiles in alzheimer’s disease
topic Genetics
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6735530/
https://www.ncbi.nlm.nih.gov/pubmed/31552082
http://dx.doi.org/10.3389/fgene.2019.00726
work_keys_str_mv AT majcarlo integrationofmachinelearningmethodstodissectgeneticallyimputedtranscriptomicprofilesinalzheimersdisease
AT azevedotiago integrationofmachinelearningmethodstodissectgeneticallyimputedtranscriptomicprofilesinalzheimersdisease
AT giansantivalentina integrationofmachinelearningmethodstodissectgeneticallyimputedtranscriptomicprofilesinalzheimersdisease
AT borisovoleg integrationofmachinelearningmethodstodissectgeneticallyimputedtranscriptomicprofilesinalzheimersdisease
AT dimitrigiovannamaria integrationofmachinelearningmethodstodissectgeneticallyimputedtranscriptomicprofilesinalzheimersdisease
AT spasovsimeon integrationofmachinelearningmethodstodissectgeneticallyimputedtranscriptomicprofilesinalzheimersdisease
AT integrationofmachinelearningmethodstodissectgeneticallyimputedtranscriptomicprofilesinalzheimersdisease
AT liopietro integrationofmachinelearningmethodstodissectgeneticallyimputedtranscriptomicprofilesinalzheimersdisease
AT merelliivan integrationofmachinelearningmethodstodissectgeneticallyimputedtranscriptomicprofilesinalzheimersdisease