Cargando…

Studying the effects of haplotype partitioning methods on the RA-associated genomic results from the North American Rheumatoid Arthritis Consortium (NARAC) dataset

The human genome, which includes thousands of genes, represents a big data challenge. Rheumatoid arthritis (RA) is a complex autoimmune disease with a genetic basis. Many single-nucleotide polymorphism (SNP) association methods partition a genome into haplotype blocks. The aim of this genome wide as...

Descripción completa

Detalles Bibliográficos
Autores principales: Saad, Mohamed N., Mabrouk, Mai S., Eldeib, Ayman M., Shaker, Olfat G.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Elsevier 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6403413/
https://www.ncbi.nlm.nih.gov/pubmed/30891314
http://dx.doi.org/10.1016/j.jare.2019.01.006
_version_ 1783400598566076416
author Saad, Mohamed N.
Mabrouk, Mai S.
Eldeib, Ayman M.
Shaker, Olfat G.
author_facet Saad, Mohamed N.
Mabrouk, Mai S.
Eldeib, Ayman M.
Shaker, Olfat G.
author_sort Saad, Mohamed N.
collection PubMed
description The human genome, which includes thousands of genes, represents a big data challenge. Rheumatoid arthritis (RA) is a complex autoimmune disease with a genetic basis. Many single-nucleotide polymorphism (SNP) association methods partition a genome into haplotype blocks. The aim of this genome wide association study (GWAS) was to select the most appropriate haplotype block partitioning method for the North American Rheumatoid Arthritis Consortium (NARAC) dataset. The methods used for the NARAC dataset were the individual SNP approach and the following haplotype block methods: the four-gamete test (FGT), confidence interval test (CIT), and solid spine of linkage disequilibrium (SSLD). The measured parameters that reflect the strength of the association between the biomarker and RA were the P-value after Bonferroni correction and other parameters used to compare the output of each haplotype block method. This work presents a comparison among the individual SNP approach and the three haplotype block methods to select the method that can detect all the significant SNPs when applied alone. The GWAS results from the NARAC dataset obtained with the different methods are presented. The individual SNP, CIT, FGT, and SSLD methods detected 541, 1516, 1551, and 1831 RA-associated SNPs respectively, and the individual SNP, FGT, CIT, and SSLD methods detected 65, 156, 159, and 450 significant SNPs respectively, that were not detected by the other methods. Three hundred eighty-three SNPs were discovered by the haplotype block methods and the individual SNP approach, while 1021 SNPs were discovered by all three haplotype block methods. The 383 SNPs detected by all the methods are promising candidates for studying RA susceptibility. A hybrid technique involving all four methods should be applied to detect the significant SNPs associated with RA in the NARAC dataset, but the SSLD method may be preferred because of its advantages when only one method was used.
format Online
Article
Text
id pubmed-6403413
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher Elsevier
record_format MEDLINE/PubMed
spelling pubmed-64034132019-03-19 Studying the effects of haplotype partitioning methods on the RA-associated genomic results from the North American Rheumatoid Arthritis Consortium (NARAC) dataset Saad, Mohamed N. Mabrouk, Mai S. Eldeib, Ayman M. Shaker, Olfat G. J Adv Res Original Article The human genome, which includes thousands of genes, represents a big data challenge. Rheumatoid arthritis (RA) is a complex autoimmune disease with a genetic basis. Many single-nucleotide polymorphism (SNP) association methods partition a genome into haplotype blocks. The aim of this genome wide association study (GWAS) was to select the most appropriate haplotype block partitioning method for the North American Rheumatoid Arthritis Consortium (NARAC) dataset. The methods used for the NARAC dataset were the individual SNP approach and the following haplotype block methods: the four-gamete test (FGT), confidence interval test (CIT), and solid spine of linkage disequilibrium (SSLD). The measured parameters that reflect the strength of the association between the biomarker and RA were the P-value after Bonferroni correction and other parameters used to compare the output of each haplotype block method. This work presents a comparison among the individual SNP approach and the three haplotype block methods to select the method that can detect all the significant SNPs when applied alone. The GWAS results from the NARAC dataset obtained with the different methods are presented. The individual SNP, CIT, FGT, and SSLD methods detected 541, 1516, 1551, and 1831 RA-associated SNPs respectively, and the individual SNP, FGT, CIT, and SSLD methods detected 65, 156, 159, and 450 significant SNPs respectively, that were not detected by the other methods. Three hundred eighty-three SNPs were discovered by the haplotype block methods and the individual SNP approach, while 1021 SNPs were discovered by all three haplotype block methods. The 383 SNPs detected by all the methods are promising candidates for studying RA susceptibility. A hybrid technique involving all four methods should be applied to detect the significant SNPs associated with RA in the NARAC dataset, but the SSLD method may be preferred because of its advantages when only one method was used. Elsevier 2019-01-18 /pmc/articles/PMC6403413/ /pubmed/30891314 http://dx.doi.org/10.1016/j.jare.2019.01.006 Text en © 2019 The Authors http://creativecommons.org/licenses/by-nc-nd/4.0/ This is an open access article under the CC BY-NC-ND license (http://creativecommons.org/licenses/by-nc-nd/4.0/).
spellingShingle Original Article
Saad, Mohamed N.
Mabrouk, Mai S.
Eldeib, Ayman M.
Shaker, Olfat G.
Studying the effects of haplotype partitioning methods on the RA-associated genomic results from the North American Rheumatoid Arthritis Consortium (NARAC) dataset
title Studying the effects of haplotype partitioning methods on the RA-associated genomic results from the North American Rheumatoid Arthritis Consortium (NARAC) dataset
title_full Studying the effects of haplotype partitioning methods on the RA-associated genomic results from the North American Rheumatoid Arthritis Consortium (NARAC) dataset
title_fullStr Studying the effects of haplotype partitioning methods on the RA-associated genomic results from the North American Rheumatoid Arthritis Consortium (NARAC) dataset
title_full_unstemmed Studying the effects of haplotype partitioning methods on the RA-associated genomic results from the North American Rheumatoid Arthritis Consortium (NARAC) dataset
title_short Studying the effects of haplotype partitioning methods on the RA-associated genomic results from the North American Rheumatoid Arthritis Consortium (NARAC) dataset
title_sort studying the effects of haplotype partitioning methods on the ra-associated genomic results from the north american rheumatoid arthritis consortium (narac) dataset
topic Original Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6403413/
https://www.ncbi.nlm.nih.gov/pubmed/30891314
http://dx.doi.org/10.1016/j.jare.2019.01.006
work_keys_str_mv AT saadmohamedn studyingtheeffectsofhaplotypepartitioningmethodsontheraassociatedgenomicresultsfromthenorthamericanrheumatoidarthritisconsortiumnaracdataset
AT mabroukmais studyingtheeffectsofhaplotypepartitioningmethodsontheraassociatedgenomicresultsfromthenorthamericanrheumatoidarthritisconsortiumnaracdataset
AT eldeibaymanm studyingtheeffectsofhaplotypepartitioningmethodsontheraassociatedgenomicresultsfromthenorthamericanrheumatoidarthritisconsortiumnaracdataset
AT shakerolfatg studyingtheeffectsofhaplotypepartitioningmethodsontheraassociatedgenomicresultsfromthenorthamericanrheumatoidarthritisconsortiumnaracdataset