Cargando…

An Information-Theoretic Machine Learning Approach to Expression QTL Analysis

Expression Quantitative Trait Locus (eQTL) analysis is a powerful tool to study the biological mechanisms linking the genotype with gene expression. Such analyses can identify genomic locations where genotypic variants influence the expression of genes, both in close proximity to the variant (cis-eQ...

Descripción completa

Detalles Bibliográficos
Autores principales: Huang, Tao, Cai, Yu-Dong
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2013
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3692482/
https://www.ncbi.nlm.nih.gov/pubmed/23825689
http://dx.doi.org/10.1371/journal.pone.0067899
_version_ 1782274623175393280
author Huang, Tao
Cai, Yu-Dong
author_facet Huang, Tao
Cai, Yu-Dong
author_sort Huang, Tao
collection PubMed
description Expression Quantitative Trait Locus (eQTL) analysis is a powerful tool to study the biological mechanisms linking the genotype with gene expression. Such analyses can identify genomic locations where genotypic variants influence the expression of genes, both in close proximity to the variant (cis-eQTL), and on other chromosomes (trans-eQTL). Many traditional eQTL methods are based on a linear regression model. In this study, we propose a novel method by which to identify eQTL associations with information theory and machine learning approaches. Mutual Information (MI) is used to describe the association between genetic marker and gene expression. MI can detect both linear and non-linear associations. What’s more, it can capture the heterogeneity of the population. Advanced feature selection methods, Maximum Relevance Minimum Redundancy (mRMR) and Incremental Feature Selection (IFS), were applied to optimize the selection of the affected genes by the genetic marker. When we applied our method to a study of apoE-deficient mice, it was found that the cis-acting eQTLs are stronger than trans-acting eQTLs but there are more trans-acting eQTLs than cis-acting eQTLs. We compared our results (mRMR.eQTL) with R/qtl, and MatrixEQTL (modelLINEAR and modelANOVA). In female mice, 67.9% of mRMR.eQTL results can be confirmed by at least two other methods while only 14.4% of R/qtl result can be confirmed by at least two other methods. In male mice, 74.1% of mRMR.eQTL results can be confirmed by at least two other methods while only 18.2% of R/qtl result can be confirmed by at least two other methods. Our methods provide a new way to identify the association between genetic markers and gene expression. Our software is available from supporting information.
format Online
Article
Text
id pubmed-3692482
institution National Center for Biotechnology Information
language English
publishDate 2013
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-36924822013-07-02 An Information-Theoretic Machine Learning Approach to Expression QTL Analysis Huang, Tao Cai, Yu-Dong PLoS One Research Article Expression Quantitative Trait Locus (eQTL) analysis is a powerful tool to study the biological mechanisms linking the genotype with gene expression. Such analyses can identify genomic locations where genotypic variants influence the expression of genes, both in close proximity to the variant (cis-eQTL), and on other chromosomes (trans-eQTL). Many traditional eQTL methods are based on a linear regression model. In this study, we propose a novel method by which to identify eQTL associations with information theory and machine learning approaches. Mutual Information (MI) is used to describe the association between genetic marker and gene expression. MI can detect both linear and non-linear associations. What’s more, it can capture the heterogeneity of the population. Advanced feature selection methods, Maximum Relevance Minimum Redundancy (mRMR) and Incremental Feature Selection (IFS), were applied to optimize the selection of the affected genes by the genetic marker. When we applied our method to a study of apoE-deficient mice, it was found that the cis-acting eQTLs are stronger than trans-acting eQTLs but there are more trans-acting eQTLs than cis-acting eQTLs. We compared our results (mRMR.eQTL) with R/qtl, and MatrixEQTL (modelLINEAR and modelANOVA). In female mice, 67.9% of mRMR.eQTL results can be confirmed by at least two other methods while only 14.4% of R/qtl result can be confirmed by at least two other methods. In male mice, 74.1% of mRMR.eQTL results can be confirmed by at least two other methods while only 18.2% of R/qtl result can be confirmed by at least two other methods. Our methods provide a new way to identify the association between genetic markers and gene expression. Our software is available from supporting information. Public Library of Science 2013-06-25 /pmc/articles/PMC3692482/ /pubmed/23825689 http://dx.doi.org/10.1371/journal.pone.0067899 Text en © 2013 Huang, Cai http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited.
spellingShingle Research Article
Huang, Tao
Cai, Yu-Dong
An Information-Theoretic Machine Learning Approach to Expression QTL Analysis
title An Information-Theoretic Machine Learning Approach to Expression QTL Analysis
title_full An Information-Theoretic Machine Learning Approach to Expression QTL Analysis
title_fullStr An Information-Theoretic Machine Learning Approach to Expression QTL Analysis
title_full_unstemmed An Information-Theoretic Machine Learning Approach to Expression QTL Analysis
title_short An Information-Theoretic Machine Learning Approach to Expression QTL Analysis
title_sort information-theoretic machine learning approach to expression qtl analysis
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3692482/
https://www.ncbi.nlm.nih.gov/pubmed/23825689
http://dx.doi.org/10.1371/journal.pone.0067899
work_keys_str_mv AT huangtao aninformationtheoreticmachinelearningapproachtoexpressionqtlanalysis
AT caiyudong aninformationtheoreticmachinelearningapproachtoexpressionqtlanalysis
AT huangtao informationtheoreticmachinelearningapproachtoexpressionqtlanalysis
AT caiyudong informationtheoreticmachinelearningapproachtoexpressionqtlanalysis