Cargando…

NUMT Confounding Biases Mitochondrial Heteroplasmy Calls in Favor of the Reference Allele

Homology between mitochondrial DNA (mtDNA) and nuclear DNA of mitochondrial origin (nuMTs) causes confounding when aligning short sequence reads to the reference human genome, as the true sequence origin cannot be determined. Using a systematic in silico approach, we here report the impact of all po...

Descripción completa

Detalles Bibliográficos
Autores principales: Maude, Hannah, Davidson, Mira, Charitakis, Natalie, Diaz, Leo, Bowers, William H. T., Gradovich, Eva, Andrew, Toby, Huntley, Derek
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Frontiers Media S.A. 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6773831/
https://www.ncbi.nlm.nih.gov/pubmed/31612134
http://dx.doi.org/10.3389/fcell.2019.00201
_version_ 1783455964540698624
author Maude, Hannah
Davidson, Mira
Charitakis, Natalie
Diaz, Leo
Bowers, William H. T.
Gradovich, Eva
Andrew, Toby
Huntley, Derek
author_facet Maude, Hannah
Davidson, Mira
Charitakis, Natalie
Diaz, Leo
Bowers, William H. T.
Gradovich, Eva
Andrew, Toby
Huntley, Derek
author_sort Maude, Hannah
collection PubMed
description Homology between mitochondrial DNA (mtDNA) and nuclear DNA of mitochondrial origin (nuMTs) causes confounding when aligning short sequence reads to the reference human genome, as the true sequence origin cannot be determined. Using a systematic in silico approach, we here report the impact of all potential mitochondrial variants on alignment accuracy and variant calling. A total of 49,707 possible mutations were introduced across the 16,569 bp reference mitochondrial genome (16,569 × 3 alternative alleles), one variant at-at-time. The resulting in silico fragmentation and alignment to the entire reference genome (GRCh38) revealed preferential mapping of mutated mitochondrial fragments to nuclear loci, as variants increased loci similarity to nuMTs, for a total of 807, 362, and 41 variants at 333, 144, and 27 positions when using 100, 150, and 300 bp single-end fragments. We subsequently modeled these affected variants at 50% heteroplasmy and carried out variant calling, observing bias in the reported allele frequencies in favor of the reference allele. Four variants (chrM:6023A, chrM:4456T, chrM:5147A, and chrM:7521A) including a possible hypertension factor, chrM:4456T, caused 100% loss of coverage at the mutated position (with all 100 bp single-end fragments aligning to homologous, nuclear positions instead of chrM), rendering these variants undetectable when aligning to the entire reference genome. Furthermore, four mitochondrial variants reported to be pathogenic were found to cause significant loss of coverage and select haplogroup-defining SNPs were shown to exacerbate the loss of coverage caused by surrounding variants. Increased fragment length and use of paired-end reads both improved alignment accuracy.
format Online
Article
Text
id pubmed-6773831
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher Frontiers Media S.A.
record_format MEDLINE/PubMed
spelling pubmed-67738312019-10-14 NUMT Confounding Biases Mitochondrial Heteroplasmy Calls in Favor of the Reference Allele Maude, Hannah Davidson, Mira Charitakis, Natalie Diaz, Leo Bowers, William H. T. Gradovich, Eva Andrew, Toby Huntley, Derek Front Cell Dev Biol Cell and Developmental Biology Homology between mitochondrial DNA (mtDNA) and nuclear DNA of mitochondrial origin (nuMTs) causes confounding when aligning short sequence reads to the reference human genome, as the true sequence origin cannot be determined. Using a systematic in silico approach, we here report the impact of all potential mitochondrial variants on alignment accuracy and variant calling. A total of 49,707 possible mutations were introduced across the 16,569 bp reference mitochondrial genome (16,569 × 3 alternative alleles), one variant at-at-time. The resulting in silico fragmentation and alignment to the entire reference genome (GRCh38) revealed preferential mapping of mutated mitochondrial fragments to nuclear loci, as variants increased loci similarity to nuMTs, for a total of 807, 362, and 41 variants at 333, 144, and 27 positions when using 100, 150, and 300 bp single-end fragments. We subsequently modeled these affected variants at 50% heteroplasmy and carried out variant calling, observing bias in the reported allele frequencies in favor of the reference allele. Four variants (chrM:6023A, chrM:4456T, chrM:5147A, and chrM:7521A) including a possible hypertension factor, chrM:4456T, caused 100% loss of coverage at the mutated position (with all 100 bp single-end fragments aligning to homologous, nuclear positions instead of chrM), rendering these variants undetectable when aligning to the entire reference genome. Furthermore, four mitochondrial variants reported to be pathogenic were found to cause significant loss of coverage and select haplogroup-defining SNPs were shown to exacerbate the loss of coverage caused by surrounding variants. Increased fragment length and use of paired-end reads both improved alignment accuracy. Frontiers Media S.A. 2019-09-25 /pmc/articles/PMC6773831/ /pubmed/31612134 http://dx.doi.org/10.3389/fcell.2019.00201 Text en Copyright © 2019 Maude, Davidson, Charitakis, Diaz, Bowers, Gradovich, Andrew and Huntley. http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle Cell and Developmental Biology
Maude, Hannah
Davidson, Mira
Charitakis, Natalie
Diaz, Leo
Bowers, William H. T.
Gradovich, Eva
Andrew, Toby
Huntley, Derek
NUMT Confounding Biases Mitochondrial Heteroplasmy Calls in Favor of the Reference Allele
title NUMT Confounding Biases Mitochondrial Heteroplasmy Calls in Favor of the Reference Allele
title_full NUMT Confounding Biases Mitochondrial Heteroplasmy Calls in Favor of the Reference Allele
title_fullStr NUMT Confounding Biases Mitochondrial Heteroplasmy Calls in Favor of the Reference Allele
title_full_unstemmed NUMT Confounding Biases Mitochondrial Heteroplasmy Calls in Favor of the Reference Allele
title_short NUMT Confounding Biases Mitochondrial Heteroplasmy Calls in Favor of the Reference Allele
title_sort numt confounding biases mitochondrial heteroplasmy calls in favor of the reference allele
topic Cell and Developmental Biology
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6773831/
https://www.ncbi.nlm.nih.gov/pubmed/31612134
http://dx.doi.org/10.3389/fcell.2019.00201
work_keys_str_mv AT maudehannah numtconfoundingbiasesmitochondrialheteroplasmycallsinfavorofthereferenceallele
AT davidsonmira numtconfoundingbiasesmitochondrialheteroplasmycallsinfavorofthereferenceallele
AT charitakisnatalie numtconfoundingbiasesmitochondrialheteroplasmycallsinfavorofthereferenceallele
AT diazleo numtconfoundingbiasesmitochondrialheteroplasmycallsinfavorofthereferenceallele
AT bowerswilliamht numtconfoundingbiasesmitochondrialheteroplasmycallsinfavorofthereferenceallele
AT gradovicheva numtconfoundingbiasesmitochondrialheteroplasmycallsinfavorofthereferenceallele
AT andrewtoby numtconfoundingbiasesmitochondrialheteroplasmycallsinfavorofthereferenceallele
AT huntleyderek numtconfoundingbiasesmitochondrialheteroplasmycallsinfavorofthereferenceallele