Cargando…

A fast-linear mixed model for genome-wide haplotype association analysis: application to agronomic traits in maize

BACKGROUND: Haplotypes combine the effects of several single nucleotide polymorphisms (SNPs) with high linkage disequilibrium, which benefit the genome-wide association analysis (GWAS). In the haplotype association analysis, both haplotype alleles and blocks are tested. Haplotype alleles can be infe...

Descripción completa

Detalles Bibliográficos
Autores principales: Chen, Heli, Hao, Zhiyu, Zhao, Yunfeng, Yang, Runqing
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7014697/
https://www.ncbi.nlm.nih.gov/pubmed/32046650
http://dx.doi.org/10.1186/s12864-020-6552-x
_version_ 1783496688817668096
author Chen, Heli
Hao, Zhiyu
Zhao, Yunfeng
Yang, Runqing
author_facet Chen, Heli
Hao, Zhiyu
Zhao, Yunfeng
Yang, Runqing
author_sort Chen, Heli
collection PubMed
description BACKGROUND: Haplotypes combine the effects of several single nucleotide polymorphisms (SNPs) with high linkage disequilibrium, which benefit the genome-wide association analysis (GWAS). In the haplotype association analysis, both haplotype alleles and blocks are tested. Haplotype alleles can be inferred with the same statistics as SNPs in the linear mixed model, while blocks require the formulation of unified statistics to fit different genetic units, such as SNPs, haplotypes, and copy number variations. RESULTS: Based on the FaST-LMM, the fastLmPure function in the R/RcppArmadillo package has been introduced to speed up genome-wide regression scans by a re-weighted least square estimation. When large or highly significant blocks are tested based on EMMAX, the genome-wide haplotype association analysis takes only one to two rounds of genome-wide regression scans. With a genomic dataset of 541,595 SNPs from 513 maize inbred lines, 90,770 haplotype blocks were constructed across the whole genome, and three types of markers (SNPs, haplotype alleles, and haplotype blocks) were genome-widely associated with 17 agronomic traits in maize using the software developed here. CONCLUSIONS: Two SNPs were identified for LNAE, four haplotype alleles for TMAL, LNAE, CD, and DTH, and only three blocks reached the significant level for TMAL, CD, and KNPR. Compared to the R/lm function, the computational time was reduced by ~ 10–15 times.
format Online
Article
Text
id pubmed-7014697
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-70146972020-02-18 A fast-linear mixed model for genome-wide haplotype association analysis: application to agronomic traits in maize Chen, Heli Hao, Zhiyu Zhao, Yunfeng Yang, Runqing BMC Genomics Research Article BACKGROUND: Haplotypes combine the effects of several single nucleotide polymorphisms (SNPs) with high linkage disequilibrium, which benefit the genome-wide association analysis (GWAS). In the haplotype association analysis, both haplotype alleles and blocks are tested. Haplotype alleles can be inferred with the same statistics as SNPs in the linear mixed model, while blocks require the formulation of unified statistics to fit different genetic units, such as SNPs, haplotypes, and copy number variations. RESULTS: Based on the FaST-LMM, the fastLmPure function in the R/RcppArmadillo package has been introduced to speed up genome-wide regression scans by a re-weighted least square estimation. When large or highly significant blocks are tested based on EMMAX, the genome-wide haplotype association analysis takes only one to two rounds of genome-wide regression scans. With a genomic dataset of 541,595 SNPs from 513 maize inbred lines, 90,770 haplotype blocks were constructed across the whole genome, and three types of markers (SNPs, haplotype alleles, and haplotype blocks) were genome-widely associated with 17 agronomic traits in maize using the software developed here. CONCLUSIONS: Two SNPs were identified for LNAE, four haplotype alleles for TMAL, LNAE, CD, and DTH, and only three blocks reached the significant level for TMAL, CD, and KNPR. Compared to the R/lm function, the computational time was reduced by ~ 10–15 times. BioMed Central 2020-02-11 /pmc/articles/PMC7014697/ /pubmed/32046650 http://dx.doi.org/10.1186/s12864-020-6552-x Text en © The Author(s). 2020 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Research Article
Chen, Heli
Hao, Zhiyu
Zhao, Yunfeng
Yang, Runqing
A fast-linear mixed model for genome-wide haplotype association analysis: application to agronomic traits in maize
title A fast-linear mixed model for genome-wide haplotype association analysis: application to agronomic traits in maize
title_full A fast-linear mixed model for genome-wide haplotype association analysis: application to agronomic traits in maize
title_fullStr A fast-linear mixed model for genome-wide haplotype association analysis: application to agronomic traits in maize
title_full_unstemmed A fast-linear mixed model for genome-wide haplotype association analysis: application to agronomic traits in maize
title_short A fast-linear mixed model for genome-wide haplotype association analysis: application to agronomic traits in maize
title_sort fast-linear mixed model for genome-wide haplotype association analysis: application to agronomic traits in maize
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7014697/
https://www.ncbi.nlm.nih.gov/pubmed/32046650
http://dx.doi.org/10.1186/s12864-020-6552-x
work_keys_str_mv AT chenheli afastlinearmixedmodelforgenomewidehaplotypeassociationanalysisapplicationtoagronomictraitsinmaize
AT haozhiyu afastlinearmixedmodelforgenomewidehaplotypeassociationanalysisapplicationtoagronomictraitsinmaize
AT zhaoyunfeng afastlinearmixedmodelforgenomewidehaplotypeassociationanalysisapplicationtoagronomictraitsinmaize
AT yangrunqing afastlinearmixedmodelforgenomewidehaplotypeassociationanalysisapplicationtoagronomictraitsinmaize
AT chenheli fastlinearmixedmodelforgenomewidehaplotypeassociationanalysisapplicationtoagronomictraitsinmaize
AT haozhiyu fastlinearmixedmodelforgenomewidehaplotypeassociationanalysisapplicationtoagronomictraitsinmaize
AT zhaoyunfeng fastlinearmixedmodelforgenomewidehaplotypeassociationanalysisapplicationtoagronomictraitsinmaize
AT yangrunqing fastlinearmixedmodelforgenomewidehaplotypeassociationanalysisapplicationtoagronomictraitsinmaize