Cargando…

A probabilistic method for identifying rare variants underlying complex traits

BACKGROUND: Identifying the genetic variants that contribute to disease susceptibilities is important both for developing methodologies and for studying complex diseases in molecular biology. It has been demonstrated that the spectrum of minor allelic frequencies (MAFs) of risk genetic variants rang...

Descripción completa

Detalles Bibliográficos
Autores principales: Wang, Jiayin, Zhao, Zhongmeng, Cao, Zhi, Yang, Aiyuan, Zhang, Jin
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2013
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3549819/
https://www.ncbi.nlm.nih.gov/pubmed/23369113
http://dx.doi.org/10.1186/1471-2164-14-S1-S11
_version_ 1782256477349609472
author Wang, Jiayin
Zhao, Zhongmeng
Cao, Zhi
Yang, Aiyuan
Zhang, Jin
author_facet Wang, Jiayin
Zhao, Zhongmeng
Cao, Zhi
Yang, Aiyuan
Zhang, Jin
author_sort Wang, Jiayin
collection PubMed
description BACKGROUND: Identifying the genetic variants that contribute to disease susceptibilities is important both for developing methodologies and for studying complex diseases in molecular biology. It has been demonstrated that the spectrum of minor allelic frequencies (MAFs) of risk genetic variants ranges from common to rare. Although association studies are shifting to incorporate rare variants (RVs) affecting complex traits, existing approaches do not show a high degree of success, and more efforts should be considered. RESULTS: In this article, we focus on detecting associations between multiple rare variants and traits. Similar to RareCover, a widely used approach, we assume that variants located close to each other tend to have similar impacts on traits. Therefore, we introduce elevated regions and background regions, where the elevated regions are considered to have a higher chance of harboring causal variants. We propose a hidden Markov random field (HMRF) model to select a set of rare variants that potentially underlie the phenotype, and then, a statistical test is applied. Thus, the association analysis can be achieved without pre-selection by experts. In our model, each variant has two hidden states that represent the causal/non-causal status and the region status. In addition, two Bayesian processes are used to compare and estimate the genotype, phenotype and model parameters. We compare our approach to the three current methods using different types of datasets, and though these are simulation experiments, our approach has higher statistical power than the other methods. The software package, RareProb and the simulation datasets are available at: http://www.engr.uconn.edu/~jiw09003.
format Online
Article
Text
id pubmed-3549819
institution National Center for Biotechnology Information
language English
publishDate 2013
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-35498192013-01-23 A probabilistic method for identifying rare variants underlying complex traits Wang, Jiayin Zhao, Zhongmeng Cao, Zhi Yang, Aiyuan Zhang, Jin BMC Genomics Proceedings BACKGROUND: Identifying the genetic variants that contribute to disease susceptibilities is important both for developing methodologies and for studying complex diseases in molecular biology. It has been demonstrated that the spectrum of minor allelic frequencies (MAFs) of risk genetic variants ranges from common to rare. Although association studies are shifting to incorporate rare variants (RVs) affecting complex traits, existing approaches do not show a high degree of success, and more efforts should be considered. RESULTS: In this article, we focus on detecting associations between multiple rare variants and traits. Similar to RareCover, a widely used approach, we assume that variants located close to each other tend to have similar impacts on traits. Therefore, we introduce elevated regions and background regions, where the elevated regions are considered to have a higher chance of harboring causal variants. We propose a hidden Markov random field (HMRF) model to select a set of rare variants that potentially underlie the phenotype, and then, a statistical test is applied. Thus, the association analysis can be achieved without pre-selection by experts. In our model, each variant has two hidden states that represent the causal/non-causal status and the region status. In addition, two Bayesian processes are used to compare and estimate the genotype, phenotype and model parameters. We compare our approach to the three current methods using different types of datasets, and though these are simulation experiments, our approach has higher statistical power than the other methods. The software package, RareProb and the simulation datasets are available at: http://www.engr.uconn.edu/~jiw09003. BioMed Central 2013-01-21 /pmc/articles/PMC3549819/ /pubmed/23369113 http://dx.doi.org/10.1186/1471-2164-14-S1-S11 Text en Copyright ©2013 Wang et al.; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Proceedings
Wang, Jiayin
Zhao, Zhongmeng
Cao, Zhi
Yang, Aiyuan
Zhang, Jin
A probabilistic method for identifying rare variants underlying complex traits
title A probabilistic method for identifying rare variants underlying complex traits
title_full A probabilistic method for identifying rare variants underlying complex traits
title_fullStr A probabilistic method for identifying rare variants underlying complex traits
title_full_unstemmed A probabilistic method for identifying rare variants underlying complex traits
title_short A probabilistic method for identifying rare variants underlying complex traits
title_sort probabilistic method for identifying rare variants underlying complex traits
topic Proceedings
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3549819/
https://www.ncbi.nlm.nih.gov/pubmed/23369113
http://dx.doi.org/10.1186/1471-2164-14-S1-S11
work_keys_str_mv AT wangjiayin aprobabilisticmethodforidentifyingrarevariantsunderlyingcomplextraits
AT zhaozhongmeng aprobabilisticmethodforidentifyingrarevariantsunderlyingcomplextraits
AT caozhi aprobabilisticmethodforidentifyingrarevariantsunderlyingcomplextraits
AT yangaiyuan aprobabilisticmethodforidentifyingrarevariantsunderlyingcomplextraits
AT zhangjin aprobabilisticmethodforidentifyingrarevariantsunderlyingcomplextraits
AT wangjiayin probabilisticmethodforidentifyingrarevariantsunderlyingcomplextraits
AT zhaozhongmeng probabilisticmethodforidentifyingrarevariantsunderlyingcomplextraits
AT caozhi probabilisticmethodforidentifyingrarevariantsunderlyingcomplextraits
AT yangaiyuan probabilisticmethodforidentifyingrarevariantsunderlyingcomplextraits
AT zhangjin probabilisticmethodforidentifyingrarevariantsunderlyingcomplextraits