Cargando…

Data modeling as a main source of discrepancies in single and multiple marker association methods

BACKGROUND: Genome-wide association studies have successfully identified several loci underlying complex diseases in humans. The development of high density SNP maps in domestic animal species should allow the detection of QTLs for economically important traits through association studies with much...

Descripción completa

Detalles Bibliográficos
Autores principales: Ledur, Mônica Corrêa, Navarro, Nicolas, Pérez-Enciso, Miguel
Formato: Texto
Lenguaje:English
Publicado: BioMed Central 2009
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2654503/
https://www.ncbi.nlm.nih.gov/pubmed/19278548
_version_ 1782165376942997504
author Ledur, Mônica Corrêa
Navarro, Nicolas
Pérez-Enciso, Miguel
author_facet Ledur, Mônica Corrêa
Navarro, Nicolas
Pérez-Enciso, Miguel
author_sort Ledur, Mônica Corrêa
collection PubMed
description BACKGROUND: Genome-wide association studies have successfully identified several loci underlying complex diseases in humans. The development of high density SNP maps in domestic animal species should allow the detection of QTLs for economically important traits through association studies with much higher accuracy than traditional linkage analysis. Here we report the association analysis of the dataset simulated for the XII QTL-MAS meeting (Uppsala). We used two strategies, single marker association and haplotype-based association (Blossoc) that were applied to i) the raw data, and ii) the data corrected for infinitesimal, sex and generation effects. RESULTS: Both methods performed similarly in detecting the most strongly associated SNPs, about ten loci in total. The most significant ones were located in chromosomes 1, 4 and 5. Overall, the largest differences were found between corrected and raw data, rather than between single and multiple marker analysis. The use of raw data increased greatly the number of significant loci, but possibly also the rate of false positives. Bootstrap model aggregation removed most of discrepancies between adjusted and raw data when SMA was employed. CONCLUSION: Model choice should be carefully considered in genome-wide association studies.
format Text
id pubmed-2654503
institution National Center for Biotechnology Information
language English
publishDate 2009
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-26545032009-03-13 Data modeling as a main source of discrepancies in single and multiple marker association methods Ledur, Mônica Corrêa Navarro, Nicolas Pérez-Enciso, Miguel BMC Proc Proceedings BACKGROUND: Genome-wide association studies have successfully identified several loci underlying complex diseases in humans. The development of high density SNP maps in domestic animal species should allow the detection of QTLs for economically important traits through association studies with much higher accuracy than traditional linkage analysis. Here we report the association analysis of the dataset simulated for the XII QTL-MAS meeting (Uppsala). We used two strategies, single marker association and haplotype-based association (Blossoc) that were applied to i) the raw data, and ii) the data corrected for infinitesimal, sex and generation effects. RESULTS: Both methods performed similarly in detecting the most strongly associated SNPs, about ten loci in total. The most significant ones were located in chromosomes 1, 4 and 5. Overall, the largest differences were found between corrected and raw data, rather than between single and multiple marker analysis. The use of raw data increased greatly the number of significant loci, but possibly also the rate of false positives. Bootstrap model aggregation removed most of discrepancies between adjusted and raw data when SMA was employed. CONCLUSION: Model choice should be carefully considered in genome-wide association studies. BioMed Central 2009-02-23 /pmc/articles/PMC2654503/ /pubmed/19278548 Text en Copyright © 2009 Ledur et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an open access article distributed under the terms of the Creative Commons Attribution License ( (http://creativecommons.org/licenses/by/2.0) ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Proceedings
Ledur, Mônica Corrêa
Navarro, Nicolas
Pérez-Enciso, Miguel
Data modeling as a main source of discrepancies in single and multiple marker association methods
title Data modeling as a main source of discrepancies in single and multiple marker association methods
title_full Data modeling as a main source of discrepancies in single and multiple marker association methods
title_fullStr Data modeling as a main source of discrepancies in single and multiple marker association methods
title_full_unstemmed Data modeling as a main source of discrepancies in single and multiple marker association methods
title_short Data modeling as a main source of discrepancies in single and multiple marker association methods
title_sort data modeling as a main source of discrepancies in single and multiple marker association methods
topic Proceedings
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2654503/
https://www.ncbi.nlm.nih.gov/pubmed/19278548
work_keys_str_mv AT ledurmonicacorrea datamodelingasamainsourceofdiscrepanciesinsingleandmultiplemarkerassociationmethods
AT navarronicolas datamodelingasamainsourceofdiscrepanciesinsingleandmultiplemarkerassociationmethods
AT perezencisomiguel datamodelingasamainsourceofdiscrepanciesinsingleandmultiplemarkerassociationmethods