Cargando…
A Continuous Statistical Phasing Framework for the Analysis of Forensic Mitochondrial DNA Mixtures
Despite the benefits of quantitative data generated by massively parallel sequencing, resolving mitotypes from mixtures occurring in certain ratios remains challenging. In this study, a bioinformatic mixture deconvolution method centered on population-based phasing was developed and validated. The m...
Autores principales: | , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
MDPI
2021
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7909279/ https://www.ncbi.nlm.nih.gov/pubmed/33498312 http://dx.doi.org/10.3390/genes12020128 |
_version_ | 1783655898666762240 |
---|---|
author | Smart, Utpal Cihlar, Jennifer Churchill Mandape, Sammed N. Muenzler, Melissa King, Jonathan L. Budowle, Bruce Woerner, August E. |
author_facet | Smart, Utpal Cihlar, Jennifer Churchill Mandape, Sammed N. Muenzler, Melissa King, Jonathan L. Budowle, Bruce Woerner, August E. |
author_sort | Smart, Utpal |
collection | PubMed |
description | Despite the benefits of quantitative data generated by massively parallel sequencing, resolving mitotypes from mixtures occurring in certain ratios remains challenging. In this study, a bioinformatic mixture deconvolution method centered on population-based phasing was developed and validated. The method was first tested on 270 in silico two-person mixtures varying in mixture proportions. An assortment of external reference panels containing information on haplotypic variation (from similar and different haplogroups) was leveraged to assess the effect of panel composition on phasing accuracy. Building on these simulations, mitochondrial genomes from the Human Mitochondrial DataBase were sourced to populate the panels and key parameter values were identified by deconvolving an additional 7290 in silico two-person mixtures. Finally, employing an optimized reference panel and phasing parameters, the approach was validated with in vitro two-person mixtures with differing proportions. Deconvolution was most accurate when the haplotypes in the mixture were similar to haplotypes present in the reference panel and when the mixture ratios were neither highly imbalanced nor subequal (e.g., 4:1). Overall, errors in haplotype estimation were largely bounded by the accuracy of the mixture’s genotype results. The proposed framework is the first available approach that automates the reconstruction of complete individual mitotypes from mixtures, even in ratios that have traditionally been considered problematic. |
format | Online Article Text |
id | pubmed-7909279 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2021 |
publisher | MDPI |
record_format | MEDLINE/PubMed |
spelling | pubmed-79092792021-02-27 A Continuous Statistical Phasing Framework for the Analysis of Forensic Mitochondrial DNA Mixtures Smart, Utpal Cihlar, Jennifer Churchill Mandape, Sammed N. Muenzler, Melissa King, Jonathan L. Budowle, Bruce Woerner, August E. Genes (Basel) Article Despite the benefits of quantitative data generated by massively parallel sequencing, resolving mitotypes from mixtures occurring in certain ratios remains challenging. In this study, a bioinformatic mixture deconvolution method centered on population-based phasing was developed and validated. The method was first tested on 270 in silico two-person mixtures varying in mixture proportions. An assortment of external reference panels containing information on haplotypic variation (from similar and different haplogroups) was leveraged to assess the effect of panel composition on phasing accuracy. Building on these simulations, mitochondrial genomes from the Human Mitochondrial DataBase were sourced to populate the panels and key parameter values were identified by deconvolving an additional 7290 in silico two-person mixtures. Finally, employing an optimized reference panel and phasing parameters, the approach was validated with in vitro two-person mixtures with differing proportions. Deconvolution was most accurate when the haplotypes in the mixture were similar to haplotypes present in the reference panel and when the mixture ratios were neither highly imbalanced nor subequal (e.g., 4:1). Overall, errors in haplotype estimation were largely bounded by the accuracy of the mixture’s genotype results. The proposed framework is the first available approach that automates the reconstruction of complete individual mitotypes from mixtures, even in ratios that have traditionally been considered problematic. MDPI 2021-01-20 /pmc/articles/PMC7909279/ /pubmed/33498312 http://dx.doi.org/10.3390/genes12020128 Text en © 2021 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/). |
spellingShingle | Article Smart, Utpal Cihlar, Jennifer Churchill Mandape, Sammed N. Muenzler, Melissa King, Jonathan L. Budowle, Bruce Woerner, August E. A Continuous Statistical Phasing Framework for the Analysis of Forensic Mitochondrial DNA Mixtures |
title | A Continuous Statistical Phasing Framework for the Analysis of Forensic Mitochondrial DNA Mixtures |
title_full | A Continuous Statistical Phasing Framework for the Analysis of Forensic Mitochondrial DNA Mixtures |
title_fullStr | A Continuous Statistical Phasing Framework for the Analysis of Forensic Mitochondrial DNA Mixtures |
title_full_unstemmed | A Continuous Statistical Phasing Framework for the Analysis of Forensic Mitochondrial DNA Mixtures |
title_short | A Continuous Statistical Phasing Framework for the Analysis of Forensic Mitochondrial DNA Mixtures |
title_sort | continuous statistical phasing framework for the analysis of forensic mitochondrial dna mixtures |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7909279/ https://www.ncbi.nlm.nih.gov/pubmed/33498312 http://dx.doi.org/10.3390/genes12020128 |
work_keys_str_mv | AT smartutpal acontinuousstatisticalphasingframeworkfortheanalysisofforensicmitochondrialdnamixtures AT cihlarjenniferchurchill acontinuousstatisticalphasingframeworkfortheanalysisofforensicmitochondrialdnamixtures AT mandapesammedn acontinuousstatisticalphasingframeworkfortheanalysisofforensicmitochondrialdnamixtures AT muenzlermelissa acontinuousstatisticalphasingframeworkfortheanalysisofforensicmitochondrialdnamixtures AT kingjonathanl acontinuousstatisticalphasingframeworkfortheanalysisofforensicmitochondrialdnamixtures AT budowlebruce acontinuousstatisticalphasingframeworkfortheanalysisofforensicmitochondrialdnamixtures AT woernerauguste acontinuousstatisticalphasingframeworkfortheanalysisofforensicmitochondrialdnamixtures AT smartutpal continuousstatisticalphasingframeworkfortheanalysisofforensicmitochondrialdnamixtures AT cihlarjenniferchurchill continuousstatisticalphasingframeworkfortheanalysisofforensicmitochondrialdnamixtures AT mandapesammedn continuousstatisticalphasingframeworkfortheanalysisofforensicmitochondrialdnamixtures AT muenzlermelissa continuousstatisticalphasingframeworkfortheanalysisofforensicmitochondrialdnamixtures AT kingjonathanl continuousstatisticalphasingframeworkfortheanalysisofforensicmitochondrialdnamixtures AT budowlebruce continuousstatisticalphasingframeworkfortheanalysisofforensicmitochondrialdnamixtures AT woernerauguste continuousstatisticalphasingframeworkfortheanalysisofforensicmitochondrialdnamixtures |