Cargando…

Estimating genome-wide IBD sharing from SNP data via an efficient hidden Markov model of LD with application to gene mapping

Motivation: Association analysis is the method of choice for studying complex multifactorial diseases. The premise of this method is that affected persons contain some common genomic regions with similar SNP alleles and such areas will be found in this analysis. An important disadvantage of GWA stud...

Descripción completa

Detalles Bibliográficos
Autores principales:	Bercovici, Sivan, Meek, Christopher, Wexler, Ydo, Geiger, Dan
Formato:	Texto
Lenguaje:	English
Publicado:	Oxford University Press 2010
Materias:	Ismb 2010 Conference Proceedings July 11 to July 13, 2010, Boston, Ma, Usa
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2881389/ https://www.ncbi.nlm.nih.gov/pubmed/20529903 http://dx.doi.org/10.1093/bioinformatics/btq204

_version_	1782182113016020992
author	Bercovici, Sivan Meek, Christopher Wexler, Ydo Geiger, Dan
author_facet	Bercovici, Sivan Meek, Christopher Wexler, Ydo Geiger, Dan
author_sort	Bercovici, Sivan
collection	PubMed
description	Motivation: Association analysis is the method of choice for studying complex multifactorial diseases. The premise of this method is that affected persons contain some common genomic regions with similar SNP alleles and such areas will be found in this analysis. An important disadvantage of GWA studies is that it does not distinguish between genomic areas that are inherited from a common ancestor [identical by descent (IBD)] and areas that are identical merely by state [identical by state (IBS)]. Clearly, areas that can be marked with higher probability as IBD and have the same correlation with the disease status of identical areas that are more probably only IBS, are better candidates to be causative, and yet this distinction is not encoded in standard association analysis. Results: We develop a factorial hidden Markov model-based algorithm for computing genome-wide IBD sharing. The algorithm accepts as input SNP data of measured individuals and estimates the probability of IBD at each locus for every pair of individuals. For two g-degree relatives, when g≥8, the computation yields a precision of IBD tagging of over 50% higher than previous methods for 95% recall. Our algorithm uses a first-order Markovian model for the linkage disequilibrium process and employs a reduction of the state space of the inheritance vector from being exponential in g to quadratic. The higher accuracy along with the reduced time complexity marks our method as a feasible means for IBD mapping in practical scenarios. Availability: A software implementation, called IBDMAP, is freely available at http://bioinfo.cs.technion.ac.il/IBDmap. Contact: sberco@gmail.com
format	Text
id	pubmed-2881389
institution	National Center for Biotechnology Information
language	English
publishDate	2010
publisher	Oxford University Press
record_format	MEDLINE/PubMed
spelling	pubmed-28813892010-06-08 Estimating genome-wide IBD sharing from SNP data via an efficient hidden Markov model of LD with application to gene mapping Bercovici, Sivan Meek, Christopher Wexler, Ydo Geiger, Dan Bioinformatics Ismb 2010 Conference Proceedings July 11 to July 13, 2010, Boston, Ma, Usa Motivation: Association analysis is the method of choice for studying complex multifactorial diseases. The premise of this method is that affected persons contain some common genomic regions with similar SNP alleles and such areas will be found in this analysis. An important disadvantage of GWA studies is that it does not distinguish between genomic areas that are inherited from a common ancestor [identical by descent (IBD)] and areas that are identical merely by state [identical by state (IBS)]. Clearly, areas that can be marked with higher probability as IBD and have the same correlation with the disease status of identical areas that are more probably only IBS, are better candidates to be causative, and yet this distinction is not encoded in standard association analysis. Results: We develop a factorial hidden Markov model-based algorithm for computing genome-wide IBD sharing. The algorithm accepts as input SNP data of measured individuals and estimates the probability of IBD at each locus for every pair of individuals. For two g-degree relatives, when g≥8, the computation yields a precision of IBD tagging of over 50% higher than previous methods for 95% recall. Our algorithm uses a first-order Markovian model for the linkage disequilibrium process and employs a reduction of the state space of the inheritance vector from being exponential in g to quadratic. The higher accuracy along with the reduced time complexity marks our method as a feasible means for IBD mapping in practical scenarios. Availability: A software implementation, called IBDMAP, is freely available at http://bioinfo.cs.technion.ac.il/IBDmap. Contact: sberco@gmail.com Oxford University Press 2010-06-15 2010-06-01 /pmc/articles/PMC2881389/ /pubmed/20529903 http://dx.doi.org/10.1093/bioinformatics/btq204 Text en © The Author(s) 2010. Published by Oxford University Press. http://creativecommons.org/licenses/by-nc/2.0/uk/ This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/2.5), which permits unrestricted non-commercial use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle	Ismb 2010 Conference Proceedings July 11 to July 13, 2010, Boston, Ma, Usa Bercovici, Sivan Meek, Christopher Wexler, Ydo Geiger, Dan Estimating genome-wide IBD sharing from SNP data via an efficient hidden Markov model of LD with application to gene mapping
title	Estimating genome-wide IBD sharing from SNP data via an efficient hidden Markov model of LD with application to gene mapping
title_full	Estimating genome-wide IBD sharing from SNP data via an efficient hidden Markov model of LD with application to gene mapping
title_fullStr	Estimating genome-wide IBD sharing from SNP data via an efficient hidden Markov model of LD with application to gene mapping
title_full_unstemmed	Estimating genome-wide IBD sharing from SNP data via an efficient hidden Markov model of LD with application to gene mapping
title_short	Estimating genome-wide IBD sharing from SNP data via an efficient hidden Markov model of LD with application to gene mapping
title_sort	estimating genome-wide ibd sharing from snp data via an efficient hidden markov model of ld with application to gene mapping
topic	Ismb 2010 Conference Proceedings July 11 to July 13, 2010, Boston, Ma, Usa
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2881389/ https://www.ncbi.nlm.nih.gov/pubmed/20529903 http://dx.doi.org/10.1093/bioinformatics/btq204
work_keys_str_mv	AT bercovicisivan estimatinggenomewideibdsharingfromsnpdataviaanefficienthiddenmarkovmodelofldwithapplicationtogenemapping AT meekchristopher estimatinggenomewideibdsharingfromsnpdataviaanefficienthiddenmarkovmodelofldwithapplicationtogenemapping AT wexlerydo estimatinggenomewideibdsharingfromsnpdataviaanefficienthiddenmarkovmodelofldwithapplicationtogenemapping AT geigerdan estimatinggenomewideibdsharingfromsnpdataviaanefficienthiddenmarkovmodelofldwithapplicationtogenemapping

Estimating genome-wide IBD sharing from SNP data via an efficient hidden Markov model of LD with application to gene mapping

Ejemplares similares