Cargando…

A likelihood ratio-based method to predict exact pedigrees for complex families from next-generation sequencing data

MOTIVATION: Next generation sequencing technology considerably changed the way we screen for pathogenic mutations in rare Mendelian disorders. However, the identification of the disease-causing mutation amongst thousands of variants of partly unknown relevance is still challenging and efficient tech...

Descripción completa

Detalles Bibliográficos
Autores principales: Heinrich, Verena, Kamphans, Tom, Mundlos, Stefan, Robinson, Peter N, Krawitz, Peter M
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2017
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5408770/
https://www.ncbi.nlm.nih.gov/pubmed/27565584
http://dx.doi.org/10.1093/bioinformatics/btw550
_version_ 1783232359968014336
author Heinrich, Verena
Kamphans, Tom
Mundlos, Stefan
Robinson, Peter N
Krawitz, Peter M
author_facet Heinrich, Verena
Kamphans, Tom
Mundlos, Stefan
Robinson, Peter N
Krawitz, Peter M
author_sort Heinrich, Verena
collection PubMed
description MOTIVATION: Next generation sequencing technology considerably changed the way we screen for pathogenic mutations in rare Mendelian disorders. However, the identification of the disease-causing mutation amongst thousands of variants of partly unknown relevance is still challenging and efficient techniques that reduce the genomic search space play a decisive role. Often segregation- or linkage analysis are used to prioritize candidates, however, these approaches require correct information about the degree of relationship among the sequenced samples. For quality assurance an automated control of pedigree structures and sample assignment is therefore highly desirable in order to detect label mix-ups that might otherwise corrupt downstream analysis. RESULTS: We developed an algorithm based on likelihood ratios that discriminates between different classes of relationship for an arbitrary number of genotyped samples. By identifying the most likely class we are able to reconstruct entire pedigrees iteratively, even for highly consanguineous families. We tested our approach on exome data of different sequencing studies and achieved high precision for all pedigree predictions. By analyzing the precision for varying degrees of relatedness or inbreeding we could show that a prediction is robust down to magnitudes of a few hundred loci. AVAILABILITY AND IMPLEMENTATION: A java standalone application that computes the relationships between multiple samples as well as a Rscript that visualizes the pedigree information is available for download as well as a web service at www.gene-talk.de. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
format Online
Article
Text
id pubmed-5408770
institution National Center for Biotechnology Information
language English
publishDate 2017
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-54087702017-05-03 A likelihood ratio-based method to predict exact pedigrees for complex families from next-generation sequencing data Heinrich, Verena Kamphans, Tom Mundlos, Stefan Robinson, Peter N Krawitz, Peter M Bioinformatics Original Papers MOTIVATION: Next generation sequencing technology considerably changed the way we screen for pathogenic mutations in rare Mendelian disorders. However, the identification of the disease-causing mutation amongst thousands of variants of partly unknown relevance is still challenging and efficient techniques that reduce the genomic search space play a decisive role. Often segregation- or linkage analysis are used to prioritize candidates, however, these approaches require correct information about the degree of relationship among the sequenced samples. For quality assurance an automated control of pedigree structures and sample assignment is therefore highly desirable in order to detect label mix-ups that might otherwise corrupt downstream analysis. RESULTS: We developed an algorithm based on likelihood ratios that discriminates between different classes of relationship for an arbitrary number of genotyped samples. By identifying the most likely class we are able to reconstruct entire pedigrees iteratively, even for highly consanguineous families. We tested our approach on exome data of different sequencing studies and achieved high precision for all pedigree predictions. By analyzing the precision for varying degrees of relatedness or inbreeding we could show that a prediction is robust down to magnitudes of a few hundred loci. AVAILABILITY AND IMPLEMENTATION: A java standalone application that computes the relationships between multiple samples as well as a Rscript that visualizes the pedigree information is available for download as well as a web service at www.gene-talk.de. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online. Oxford University Press 2017-01-01 2016-08-26 /pmc/articles/PMC5408770/ /pubmed/27565584 http://dx.doi.org/10.1093/bioinformatics/btw550 Text en © The Author 2016. Published by Oxford University Press. http://creativecommons.org/licenses/by-nc/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/4.0/), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial re-use, please contact journals.permissions@oup.com
spellingShingle Original Papers
Heinrich, Verena
Kamphans, Tom
Mundlos, Stefan
Robinson, Peter N
Krawitz, Peter M
A likelihood ratio-based method to predict exact pedigrees for complex families from next-generation sequencing data
title A likelihood ratio-based method to predict exact pedigrees for complex families from next-generation sequencing data
title_full A likelihood ratio-based method to predict exact pedigrees for complex families from next-generation sequencing data
title_fullStr A likelihood ratio-based method to predict exact pedigrees for complex families from next-generation sequencing data
title_full_unstemmed A likelihood ratio-based method to predict exact pedigrees for complex families from next-generation sequencing data
title_short A likelihood ratio-based method to predict exact pedigrees for complex families from next-generation sequencing data
title_sort likelihood ratio-based method to predict exact pedigrees for complex families from next-generation sequencing data
topic Original Papers
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5408770/
https://www.ncbi.nlm.nih.gov/pubmed/27565584
http://dx.doi.org/10.1093/bioinformatics/btw550
work_keys_str_mv AT heinrichverena alikelihoodratiobasedmethodtopredictexactpedigreesforcomplexfamiliesfromnextgenerationsequencingdata
AT kamphanstom alikelihoodratiobasedmethodtopredictexactpedigreesforcomplexfamiliesfromnextgenerationsequencingdata
AT mundlosstefan alikelihoodratiobasedmethodtopredictexactpedigreesforcomplexfamiliesfromnextgenerationsequencingdata
AT robinsonpetern alikelihoodratiobasedmethodtopredictexactpedigreesforcomplexfamiliesfromnextgenerationsequencingdata
AT krawitzpeterm alikelihoodratiobasedmethodtopredictexactpedigreesforcomplexfamiliesfromnextgenerationsequencingdata
AT heinrichverena likelihoodratiobasedmethodtopredictexactpedigreesforcomplexfamiliesfromnextgenerationsequencingdata
AT kamphanstom likelihoodratiobasedmethodtopredictexactpedigreesforcomplexfamiliesfromnextgenerationsequencingdata
AT mundlosstefan likelihoodratiobasedmethodtopredictexactpedigreesforcomplexfamiliesfromnextgenerationsequencingdata
AT robinsonpetern likelihoodratiobasedmethodtopredictexactpedigreesforcomplexfamiliesfromnextgenerationsequencingdata
AT krawitzpeterm likelihoodratiobasedmethodtopredictexactpedigreesforcomplexfamiliesfromnextgenerationsequencingdata