Cargando…
NUMT Confounding Biases Mitochondrial Heteroplasmy Calls in Favor of the Reference Allele
Homology between mitochondrial DNA (mtDNA) and nuclear DNA of mitochondrial origin (nuMTs) causes confounding when aligning short sequence reads to the reference human genome, as the true sequence origin cannot be determined. Using a systematic in silico approach, we here report the impact of all po...
Autores principales: | , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Frontiers Media S.A.
2019
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6773831/ https://www.ncbi.nlm.nih.gov/pubmed/31612134 http://dx.doi.org/10.3389/fcell.2019.00201 |
_version_ | 1783455964540698624 |
---|---|
author | Maude, Hannah Davidson, Mira Charitakis, Natalie Diaz, Leo Bowers, William H. T. Gradovich, Eva Andrew, Toby Huntley, Derek |
author_facet | Maude, Hannah Davidson, Mira Charitakis, Natalie Diaz, Leo Bowers, William H. T. Gradovich, Eva Andrew, Toby Huntley, Derek |
author_sort | Maude, Hannah |
collection | PubMed |
description | Homology between mitochondrial DNA (mtDNA) and nuclear DNA of mitochondrial origin (nuMTs) causes confounding when aligning short sequence reads to the reference human genome, as the true sequence origin cannot be determined. Using a systematic in silico approach, we here report the impact of all potential mitochondrial variants on alignment accuracy and variant calling. A total of 49,707 possible mutations were introduced across the 16,569 bp reference mitochondrial genome (16,569 × 3 alternative alleles), one variant at-at-time. The resulting in silico fragmentation and alignment to the entire reference genome (GRCh38) revealed preferential mapping of mutated mitochondrial fragments to nuclear loci, as variants increased loci similarity to nuMTs, for a total of 807, 362, and 41 variants at 333, 144, and 27 positions when using 100, 150, and 300 bp single-end fragments. We subsequently modeled these affected variants at 50% heteroplasmy and carried out variant calling, observing bias in the reported allele frequencies in favor of the reference allele. Four variants (chrM:6023A, chrM:4456T, chrM:5147A, and chrM:7521A) including a possible hypertension factor, chrM:4456T, caused 100% loss of coverage at the mutated position (with all 100 bp single-end fragments aligning to homologous, nuclear positions instead of chrM), rendering these variants undetectable when aligning to the entire reference genome. Furthermore, four mitochondrial variants reported to be pathogenic were found to cause significant loss of coverage and select haplogroup-defining SNPs were shown to exacerbate the loss of coverage caused by surrounding variants. Increased fragment length and use of paired-end reads both improved alignment accuracy. |
format | Online Article Text |
id | pubmed-6773831 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2019 |
publisher | Frontiers Media S.A. |
record_format | MEDLINE/PubMed |
spelling | pubmed-67738312019-10-14 NUMT Confounding Biases Mitochondrial Heteroplasmy Calls in Favor of the Reference Allele Maude, Hannah Davidson, Mira Charitakis, Natalie Diaz, Leo Bowers, William H. T. Gradovich, Eva Andrew, Toby Huntley, Derek Front Cell Dev Biol Cell and Developmental Biology Homology between mitochondrial DNA (mtDNA) and nuclear DNA of mitochondrial origin (nuMTs) causes confounding when aligning short sequence reads to the reference human genome, as the true sequence origin cannot be determined. Using a systematic in silico approach, we here report the impact of all potential mitochondrial variants on alignment accuracy and variant calling. A total of 49,707 possible mutations were introduced across the 16,569 bp reference mitochondrial genome (16,569 × 3 alternative alleles), one variant at-at-time. The resulting in silico fragmentation and alignment to the entire reference genome (GRCh38) revealed preferential mapping of mutated mitochondrial fragments to nuclear loci, as variants increased loci similarity to nuMTs, for a total of 807, 362, and 41 variants at 333, 144, and 27 positions when using 100, 150, and 300 bp single-end fragments. We subsequently modeled these affected variants at 50% heteroplasmy and carried out variant calling, observing bias in the reported allele frequencies in favor of the reference allele. Four variants (chrM:6023A, chrM:4456T, chrM:5147A, and chrM:7521A) including a possible hypertension factor, chrM:4456T, caused 100% loss of coverage at the mutated position (with all 100 bp single-end fragments aligning to homologous, nuclear positions instead of chrM), rendering these variants undetectable when aligning to the entire reference genome. Furthermore, four mitochondrial variants reported to be pathogenic were found to cause significant loss of coverage and select haplogroup-defining SNPs were shown to exacerbate the loss of coverage caused by surrounding variants. Increased fragment length and use of paired-end reads both improved alignment accuracy. Frontiers Media S.A. 2019-09-25 /pmc/articles/PMC6773831/ /pubmed/31612134 http://dx.doi.org/10.3389/fcell.2019.00201 Text en Copyright © 2019 Maude, Davidson, Charitakis, Diaz, Bowers, Gradovich, Andrew and Huntley. http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms. |
spellingShingle | Cell and Developmental Biology Maude, Hannah Davidson, Mira Charitakis, Natalie Diaz, Leo Bowers, William H. T. Gradovich, Eva Andrew, Toby Huntley, Derek NUMT Confounding Biases Mitochondrial Heteroplasmy Calls in Favor of the Reference Allele |
title | NUMT Confounding Biases Mitochondrial Heteroplasmy Calls in Favor of the Reference Allele |
title_full | NUMT Confounding Biases Mitochondrial Heteroplasmy Calls in Favor of the Reference Allele |
title_fullStr | NUMT Confounding Biases Mitochondrial Heteroplasmy Calls in Favor of the Reference Allele |
title_full_unstemmed | NUMT Confounding Biases Mitochondrial Heteroplasmy Calls in Favor of the Reference Allele |
title_short | NUMT Confounding Biases Mitochondrial Heteroplasmy Calls in Favor of the Reference Allele |
title_sort | numt confounding biases mitochondrial heteroplasmy calls in favor of the reference allele |
topic | Cell and Developmental Biology |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6773831/ https://www.ncbi.nlm.nih.gov/pubmed/31612134 http://dx.doi.org/10.3389/fcell.2019.00201 |
work_keys_str_mv | AT maudehannah numtconfoundingbiasesmitochondrialheteroplasmycallsinfavorofthereferenceallele AT davidsonmira numtconfoundingbiasesmitochondrialheteroplasmycallsinfavorofthereferenceallele AT charitakisnatalie numtconfoundingbiasesmitochondrialheteroplasmycallsinfavorofthereferenceallele AT diazleo numtconfoundingbiasesmitochondrialheteroplasmycallsinfavorofthereferenceallele AT bowerswilliamht numtconfoundingbiasesmitochondrialheteroplasmycallsinfavorofthereferenceallele AT gradovicheva numtconfoundingbiasesmitochondrialheteroplasmycallsinfavorofthereferenceallele AT andrewtoby numtconfoundingbiasesmitochondrialheteroplasmycallsinfavorofthereferenceallele AT huntleyderek numtconfoundingbiasesmitochondrialheteroplasmycallsinfavorofthereferenceallele |