Cargando…

M-DATA: A statistical approach to jointly analyzing de novo mutations for multiple traits

Recent studies have demonstrated that multiple early-onset diseases have shared risk genes, based on findings from de novo mutations (DNMs). Therefore, we may leverage information from one trait to improve statistical power to identify genes for another trait. However, there are few methods that can...

Descripción completa

Detalles Bibliográficos
Autores principales: Xie, Yuhan, Li, Mo, Dong, Weilai, Jiang, Wei, Zhao, Hongyu
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8568192/
https://www.ncbi.nlm.nih.gov/pubmed/34735430
http://dx.doi.org/10.1371/journal.pgen.1009849
_version_ 1784594384214294528
author Xie, Yuhan
Li, Mo
Dong, Weilai
Jiang, Wei
Zhao, Hongyu
author_facet Xie, Yuhan
Li, Mo
Dong, Weilai
Jiang, Wei
Zhao, Hongyu
author_sort Xie, Yuhan
collection PubMed
description Recent studies have demonstrated that multiple early-onset diseases have shared risk genes, based on findings from de novo mutations (DNMs). Therefore, we may leverage information from one trait to improve statistical power to identify genes for another trait. However, there are few methods that can jointly analyze DNMs from multiple traits. In this study, we develop a framework called M-DATA (Multi-trait framework for De novo mutation Association Test with Annotations) to increase the statistical power of association analysis by integrating data from multiple correlated traits and their functional annotations. Using the number of DNMs from multiple diseases, we develop a method based on an Expectation-Maximization algorithm to both infer the degree of association between two diseases as well as to estimate the gene association probability for each disease. We apply our method to a case study of jointly analyzing data from congenital heart disease (CHD) and autism. Our method was able to identify 23 genes for CHD from joint analysis, including 12 novel genes, which is substantially more than single-trait analysis, leading to novel insights into CHD disease etiology.
format Online
Article
Text
id pubmed-8568192
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-85681922021-11-05 M-DATA: A statistical approach to jointly analyzing de novo mutations for multiple traits Xie, Yuhan Li, Mo Dong, Weilai Jiang, Wei Zhao, Hongyu PLoS Genet Research Article Recent studies have demonstrated that multiple early-onset diseases have shared risk genes, based on findings from de novo mutations (DNMs). Therefore, we may leverage information from one trait to improve statistical power to identify genes for another trait. However, there are few methods that can jointly analyze DNMs from multiple traits. In this study, we develop a framework called M-DATA (Multi-trait framework for De novo mutation Association Test with Annotations) to increase the statistical power of association analysis by integrating data from multiple correlated traits and their functional annotations. Using the number of DNMs from multiple diseases, we develop a method based on an Expectation-Maximization algorithm to both infer the degree of association between two diseases as well as to estimate the gene association probability for each disease. We apply our method to a case study of jointly analyzing data from congenital heart disease (CHD) and autism. Our method was able to identify 23 genes for CHD from joint analysis, including 12 novel genes, which is substantially more than single-trait analysis, leading to novel insights into CHD disease etiology. Public Library of Science 2021-11-04 /pmc/articles/PMC8568192/ /pubmed/34735430 http://dx.doi.org/10.1371/journal.pgen.1009849 Text en © 2021 Xie et al https://creativecommons.org/licenses/by/4.0/This is an open access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle Research Article
Xie, Yuhan
Li, Mo
Dong, Weilai
Jiang, Wei
Zhao, Hongyu
M-DATA: A statistical approach to jointly analyzing de novo mutations for multiple traits
title M-DATA: A statistical approach to jointly analyzing de novo mutations for multiple traits
title_full M-DATA: A statistical approach to jointly analyzing de novo mutations for multiple traits
title_fullStr M-DATA: A statistical approach to jointly analyzing de novo mutations for multiple traits
title_full_unstemmed M-DATA: A statistical approach to jointly analyzing de novo mutations for multiple traits
title_short M-DATA: A statistical approach to jointly analyzing de novo mutations for multiple traits
title_sort m-data: a statistical approach to jointly analyzing de novo mutations for multiple traits
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8568192/
https://www.ncbi.nlm.nih.gov/pubmed/34735430
http://dx.doi.org/10.1371/journal.pgen.1009849
work_keys_str_mv AT xieyuhan mdataastatisticalapproachtojointlyanalyzingdenovomutationsformultipletraits
AT limo mdataastatisticalapproachtojointlyanalyzingdenovomutationsformultipletraits
AT dongweilai mdataastatisticalapproachtojointlyanalyzingdenovomutationsformultipletraits
AT jiangwei mdataastatisticalapproachtojointlyanalyzingdenovomutationsformultipletraits
AT zhaohongyu mdataastatisticalapproachtojointlyanalyzingdenovomutationsformultipletraits