Cargando…

MARV: a tool for genome-wide multi-phenotype analysis of rare variants

BACKGROUND: Genome-wide association studies have enabled identification of thousands of loci for hundreds of traits. Yet, for most human traits a substantial part of the estimated heritability is unexplained. This and recent advances in technology to produce high-dimensional data cost-effectively ha...

Descripción completa

Detalles Bibliográficos
Autores principales: Kaakinen, Marika, Mägi, Reedik, Fischer, Krista, Heikkinen, Jani, Järvelin, Marjo-Riitta, Morris, Andrew P., Prokopenko, Inga
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2017
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5311849/
https://www.ncbi.nlm.nih.gov/pubmed/28209135
http://dx.doi.org/10.1186/s12859-017-1530-2
_version_ 1782508103711850496
author Kaakinen, Marika
Mägi, Reedik
Fischer, Krista
Heikkinen, Jani
Järvelin, Marjo-Riitta
Morris, Andrew P.
Prokopenko, Inga
author_facet Kaakinen, Marika
Mägi, Reedik
Fischer, Krista
Heikkinen, Jani
Järvelin, Marjo-Riitta
Morris, Andrew P.
Prokopenko, Inga
author_sort Kaakinen, Marika
collection PubMed
description BACKGROUND: Genome-wide association studies have enabled identification of thousands of loci for hundreds of traits. Yet, for most human traits a substantial part of the estimated heritability is unexplained. This and recent advances in technology to produce high-dimensional data cost-effectively have led to method development beyond standard common variant analysis, including single-phenotype rare variant and multi-phenotype common variant analysis, with the latter increasing power for locus discovery and providing suggestions of pleiotropic effects. However, there are currently no optimal methods and tools for the combined analysis of rare variants and multiple phenotypes. RESULTS: We propose a user-friendly software tool MARV for Multi-phenotype Analysis of Rare Variants. The tool is based on a method that collapses rare variants within a genomic region and models the proportion of minor alleles in the rare variants on a linear combination of multiple phenotypes. MARV provides analyses of all phenotype combinations within one run and calculates the Bayesian Information Criterion to facilitate model selection. The running time increases with the size of the genetic data while the number of phenotypes to analyse has little effect both on running time and required memory. We illustrate the use of MARV with analysis of triglycerides (TG), fasting insulin (FI) and waist-to-hip ratio (WHR) in 4,721 individuals from the Northern Finland Birth Cohort 1966. The analysis suggests novel multi-phenotype effects for these metabolic traits at APOA5 and ZNF259, and at ZNF259 provides stronger support for association (P (TG+FI) = 1.8 × 10(−9)) than observed in single phenotype rare variant analyses (P (TG) = 6.5 × 10(−8) and P (FI) = 0.27). CONCLUSIONS: MARV is a computationally efficient, flexible and user-friendly software tool allowing rapid identification of rare variant effects on multiple phenotypes, thus paving the way for novel discoveries and insights into biology of complex traits. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12859-017-1530-2) contains supplementary material, which is available to authorized users.
format Online
Article
Text
id pubmed-5311849
institution National Center for Biotechnology Information
language English
publishDate 2017
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-53118492017-02-22 MARV: a tool for genome-wide multi-phenotype analysis of rare variants Kaakinen, Marika Mägi, Reedik Fischer, Krista Heikkinen, Jani Järvelin, Marjo-Riitta Morris, Andrew P. Prokopenko, Inga BMC Bioinformatics Software BACKGROUND: Genome-wide association studies have enabled identification of thousands of loci for hundreds of traits. Yet, for most human traits a substantial part of the estimated heritability is unexplained. This and recent advances in technology to produce high-dimensional data cost-effectively have led to method development beyond standard common variant analysis, including single-phenotype rare variant and multi-phenotype common variant analysis, with the latter increasing power for locus discovery and providing suggestions of pleiotropic effects. However, there are currently no optimal methods and tools for the combined analysis of rare variants and multiple phenotypes. RESULTS: We propose a user-friendly software tool MARV for Multi-phenotype Analysis of Rare Variants. The tool is based on a method that collapses rare variants within a genomic region and models the proportion of minor alleles in the rare variants on a linear combination of multiple phenotypes. MARV provides analyses of all phenotype combinations within one run and calculates the Bayesian Information Criterion to facilitate model selection. The running time increases with the size of the genetic data while the number of phenotypes to analyse has little effect both on running time and required memory. We illustrate the use of MARV with analysis of triglycerides (TG), fasting insulin (FI) and waist-to-hip ratio (WHR) in 4,721 individuals from the Northern Finland Birth Cohort 1966. The analysis suggests novel multi-phenotype effects for these metabolic traits at APOA5 and ZNF259, and at ZNF259 provides stronger support for association (P (TG+FI) = 1.8 × 10(−9)) than observed in single phenotype rare variant analyses (P (TG) = 6.5 × 10(−8) and P (FI) = 0.27). CONCLUSIONS: MARV is a computationally efficient, flexible and user-friendly software tool allowing rapid identification of rare variant effects on multiple phenotypes, thus paving the way for novel discoveries and insights into biology of complex traits. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12859-017-1530-2) contains supplementary material, which is available to authorized users. BioMed Central 2017-02-16 /pmc/articles/PMC5311849/ /pubmed/28209135 http://dx.doi.org/10.1186/s12859-017-1530-2 Text en © The Author(s). 2017 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Software
Kaakinen, Marika
Mägi, Reedik
Fischer, Krista
Heikkinen, Jani
Järvelin, Marjo-Riitta
Morris, Andrew P.
Prokopenko, Inga
MARV: a tool for genome-wide multi-phenotype analysis of rare variants
title MARV: a tool for genome-wide multi-phenotype analysis of rare variants
title_full MARV: a tool for genome-wide multi-phenotype analysis of rare variants
title_fullStr MARV: a tool for genome-wide multi-phenotype analysis of rare variants
title_full_unstemmed MARV: a tool for genome-wide multi-phenotype analysis of rare variants
title_short MARV: a tool for genome-wide multi-phenotype analysis of rare variants
title_sort marv: a tool for genome-wide multi-phenotype analysis of rare variants
topic Software
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5311849/
https://www.ncbi.nlm.nih.gov/pubmed/28209135
http://dx.doi.org/10.1186/s12859-017-1530-2
work_keys_str_mv AT kaakinenmarika marvatoolforgenomewidemultiphenotypeanalysisofrarevariants
AT magireedik marvatoolforgenomewidemultiphenotypeanalysisofrarevariants
AT fischerkrista marvatoolforgenomewidemultiphenotypeanalysisofrarevariants
AT heikkinenjani marvatoolforgenomewidemultiphenotypeanalysisofrarevariants
AT jarvelinmarjoriitta marvatoolforgenomewidemultiphenotypeanalysisofrarevariants
AT morrisandrewp marvatoolforgenomewidemultiphenotypeanalysisofrarevariants
AT prokopenkoinga marvatoolforgenomewidemultiphenotypeanalysisofrarevariants