Cargando…
M-DATA: A statistical approach to jointly analyzing de novo mutations for multiple traits
Recent studies have demonstrated that multiple early-onset diseases have shared risk genes, based on findings from de novo mutations (DNMs). Therefore, we may leverage information from one trait to improve statistical power to identify genes for another trait. However, there are few methods that can...
Autores principales: | , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Public Library of Science
2021
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8568192/ https://www.ncbi.nlm.nih.gov/pubmed/34735430 http://dx.doi.org/10.1371/journal.pgen.1009849 |
_version_ | 1784594384214294528 |
---|---|
author | Xie, Yuhan Li, Mo Dong, Weilai Jiang, Wei Zhao, Hongyu |
author_facet | Xie, Yuhan Li, Mo Dong, Weilai Jiang, Wei Zhao, Hongyu |
author_sort | Xie, Yuhan |
collection | PubMed |
description | Recent studies have demonstrated that multiple early-onset diseases have shared risk genes, based on findings from de novo mutations (DNMs). Therefore, we may leverage information from one trait to improve statistical power to identify genes for another trait. However, there are few methods that can jointly analyze DNMs from multiple traits. In this study, we develop a framework called M-DATA (Multi-trait framework for De novo mutation Association Test with Annotations) to increase the statistical power of association analysis by integrating data from multiple correlated traits and their functional annotations. Using the number of DNMs from multiple diseases, we develop a method based on an Expectation-Maximization algorithm to both infer the degree of association between two diseases as well as to estimate the gene association probability for each disease. We apply our method to a case study of jointly analyzing data from congenital heart disease (CHD) and autism. Our method was able to identify 23 genes for CHD from joint analysis, including 12 novel genes, which is substantially more than single-trait analysis, leading to novel insights into CHD disease etiology. |
format | Online Article Text |
id | pubmed-8568192 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2021 |
publisher | Public Library of Science |
record_format | MEDLINE/PubMed |
spelling | pubmed-85681922021-11-05 M-DATA: A statistical approach to jointly analyzing de novo mutations for multiple traits Xie, Yuhan Li, Mo Dong, Weilai Jiang, Wei Zhao, Hongyu PLoS Genet Research Article Recent studies have demonstrated that multiple early-onset diseases have shared risk genes, based on findings from de novo mutations (DNMs). Therefore, we may leverage information from one trait to improve statistical power to identify genes for another trait. However, there are few methods that can jointly analyze DNMs from multiple traits. In this study, we develop a framework called M-DATA (Multi-trait framework for De novo mutation Association Test with Annotations) to increase the statistical power of association analysis by integrating data from multiple correlated traits and their functional annotations. Using the number of DNMs from multiple diseases, we develop a method based on an Expectation-Maximization algorithm to both infer the degree of association between two diseases as well as to estimate the gene association probability for each disease. We apply our method to a case study of jointly analyzing data from congenital heart disease (CHD) and autism. Our method was able to identify 23 genes for CHD from joint analysis, including 12 novel genes, which is substantially more than single-trait analysis, leading to novel insights into CHD disease etiology. Public Library of Science 2021-11-04 /pmc/articles/PMC8568192/ /pubmed/34735430 http://dx.doi.org/10.1371/journal.pgen.1009849 Text en © 2021 Xie et al https://creativecommons.org/licenses/by/4.0/This is an open access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited. |
spellingShingle | Research Article Xie, Yuhan Li, Mo Dong, Weilai Jiang, Wei Zhao, Hongyu M-DATA: A statistical approach to jointly analyzing de novo mutations for multiple traits |
title | M-DATA: A statistical approach to jointly analyzing de novo mutations for multiple traits |
title_full | M-DATA: A statistical approach to jointly analyzing de novo mutations for multiple traits |
title_fullStr | M-DATA: A statistical approach to jointly analyzing de novo mutations for multiple traits |
title_full_unstemmed | M-DATA: A statistical approach to jointly analyzing de novo mutations for multiple traits |
title_short | M-DATA: A statistical approach to jointly analyzing de novo mutations for multiple traits |
title_sort | m-data: a statistical approach to jointly analyzing de novo mutations for multiple traits |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8568192/ https://www.ncbi.nlm.nih.gov/pubmed/34735430 http://dx.doi.org/10.1371/journal.pgen.1009849 |
work_keys_str_mv | AT xieyuhan mdataastatisticalapproachtojointlyanalyzingdenovomutationsformultipletraits AT limo mdataastatisticalapproachtojointlyanalyzingdenovomutationsformultipletraits AT dongweilai mdataastatisticalapproachtojointlyanalyzingdenovomutationsformultipletraits AT jiangwei mdataastatisticalapproachtojointlyanalyzingdenovomutationsformultipletraits AT zhaohongyu mdataastatisticalapproachtojointlyanalyzingdenovomutationsformultipletraits |