Cargando…
A likelihood ratio-based method to predict exact pedigrees for complex families from next-generation sequencing data
MOTIVATION: Next generation sequencing technology considerably changed the way we screen for pathogenic mutations in rare Mendelian disorders. However, the identification of the disease-causing mutation amongst thousands of variants of partly unknown relevance is still challenging and efficient tech...
Autores principales: | , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Oxford University Press
2017
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5408770/ https://www.ncbi.nlm.nih.gov/pubmed/27565584 http://dx.doi.org/10.1093/bioinformatics/btw550 |
_version_ | 1783232359968014336 |
---|---|
author | Heinrich, Verena Kamphans, Tom Mundlos, Stefan Robinson, Peter N Krawitz, Peter M |
author_facet | Heinrich, Verena Kamphans, Tom Mundlos, Stefan Robinson, Peter N Krawitz, Peter M |
author_sort | Heinrich, Verena |
collection | PubMed |
description | MOTIVATION: Next generation sequencing technology considerably changed the way we screen for pathogenic mutations in rare Mendelian disorders. However, the identification of the disease-causing mutation amongst thousands of variants of partly unknown relevance is still challenging and efficient techniques that reduce the genomic search space play a decisive role. Often segregation- or linkage analysis are used to prioritize candidates, however, these approaches require correct information about the degree of relationship among the sequenced samples. For quality assurance an automated control of pedigree structures and sample assignment is therefore highly desirable in order to detect label mix-ups that might otherwise corrupt downstream analysis. RESULTS: We developed an algorithm based on likelihood ratios that discriminates between different classes of relationship for an arbitrary number of genotyped samples. By identifying the most likely class we are able to reconstruct entire pedigrees iteratively, even for highly consanguineous families. We tested our approach on exome data of different sequencing studies and achieved high precision for all pedigree predictions. By analyzing the precision for varying degrees of relatedness or inbreeding we could show that a prediction is robust down to magnitudes of a few hundred loci. AVAILABILITY AND IMPLEMENTATION: A java standalone application that computes the relationships between multiple samples as well as a Rscript that visualizes the pedigree information is available for download as well as a web service at www.gene-talk.de. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online. |
format | Online Article Text |
id | pubmed-5408770 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2017 |
publisher | Oxford University Press |
record_format | MEDLINE/PubMed |
spelling | pubmed-54087702017-05-03 A likelihood ratio-based method to predict exact pedigrees for complex families from next-generation sequencing data Heinrich, Verena Kamphans, Tom Mundlos, Stefan Robinson, Peter N Krawitz, Peter M Bioinformatics Original Papers MOTIVATION: Next generation sequencing technology considerably changed the way we screen for pathogenic mutations in rare Mendelian disorders. However, the identification of the disease-causing mutation amongst thousands of variants of partly unknown relevance is still challenging and efficient techniques that reduce the genomic search space play a decisive role. Often segregation- or linkage analysis are used to prioritize candidates, however, these approaches require correct information about the degree of relationship among the sequenced samples. For quality assurance an automated control of pedigree structures and sample assignment is therefore highly desirable in order to detect label mix-ups that might otherwise corrupt downstream analysis. RESULTS: We developed an algorithm based on likelihood ratios that discriminates between different classes of relationship for an arbitrary number of genotyped samples. By identifying the most likely class we are able to reconstruct entire pedigrees iteratively, even for highly consanguineous families. We tested our approach on exome data of different sequencing studies and achieved high precision for all pedigree predictions. By analyzing the precision for varying degrees of relatedness or inbreeding we could show that a prediction is robust down to magnitudes of a few hundred loci. AVAILABILITY AND IMPLEMENTATION: A java standalone application that computes the relationships between multiple samples as well as a Rscript that visualizes the pedigree information is available for download as well as a web service at www.gene-talk.de. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online. Oxford University Press 2017-01-01 2016-08-26 /pmc/articles/PMC5408770/ /pubmed/27565584 http://dx.doi.org/10.1093/bioinformatics/btw550 Text en © The Author 2016. Published by Oxford University Press. http://creativecommons.org/licenses/by-nc/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/4.0/), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial re-use, please contact journals.permissions@oup.com |
spellingShingle | Original Papers Heinrich, Verena Kamphans, Tom Mundlos, Stefan Robinson, Peter N Krawitz, Peter M A likelihood ratio-based method to predict exact pedigrees for complex families from next-generation sequencing data |
title | A likelihood ratio-based method to predict exact pedigrees for complex families from next-generation sequencing data |
title_full | A likelihood ratio-based method to predict exact pedigrees for complex families from next-generation sequencing data |
title_fullStr | A likelihood ratio-based method to predict exact pedigrees for complex families from next-generation sequencing data |
title_full_unstemmed | A likelihood ratio-based method to predict exact pedigrees for complex families from next-generation sequencing data |
title_short | A likelihood ratio-based method to predict exact pedigrees for complex families from next-generation sequencing data |
title_sort | likelihood ratio-based method to predict exact pedigrees for complex families from next-generation sequencing data |
topic | Original Papers |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5408770/ https://www.ncbi.nlm.nih.gov/pubmed/27565584 http://dx.doi.org/10.1093/bioinformatics/btw550 |
work_keys_str_mv | AT heinrichverena alikelihoodratiobasedmethodtopredictexactpedigreesforcomplexfamiliesfromnextgenerationsequencingdata AT kamphanstom alikelihoodratiobasedmethodtopredictexactpedigreesforcomplexfamiliesfromnextgenerationsequencingdata AT mundlosstefan alikelihoodratiobasedmethodtopredictexactpedigreesforcomplexfamiliesfromnextgenerationsequencingdata AT robinsonpetern alikelihoodratiobasedmethodtopredictexactpedigreesforcomplexfamiliesfromnextgenerationsequencingdata AT krawitzpeterm alikelihoodratiobasedmethodtopredictexactpedigreesforcomplexfamiliesfromnextgenerationsequencingdata AT heinrichverena likelihoodratiobasedmethodtopredictexactpedigreesforcomplexfamiliesfromnextgenerationsequencingdata AT kamphanstom likelihoodratiobasedmethodtopredictexactpedigreesforcomplexfamiliesfromnextgenerationsequencingdata AT mundlosstefan likelihoodratiobasedmethodtopredictexactpedigreesforcomplexfamiliesfromnextgenerationsequencingdata AT robinsonpetern likelihoodratiobasedmethodtopredictexactpedigreesforcomplexfamiliesfromnextgenerationsequencingdata AT krawitzpeterm likelihoodratiobasedmethodtopredictexactpedigreesforcomplexfamiliesfromnextgenerationsequencingdata |