Cargando…

The ability of human nuclear DNA to cause false positive low-abundance heteroplasmy calls varies across the mitochondrial genome

BACKGROUND: Low-abundance mutations in mitochondrial populations (mutations with minor allele frequency ≤ 1%), are associated with cancer, aging, and neurodegenerative disorders. While recent progress in high-throughput sequencing technology has significantly improved the heteroplasmy identification...

Descripción completa

Detalles Bibliográficos
Autores principales: Albayrak, Levent, Khanipov, Kamil, Pimenova, Maria, Golovko, George, Rojas, Mark, Pavlidis, Ioannis, Chumakov, Sergei, Aguilar, Gerardo, Chávez, Arturo, Widger, William R., Fofanov, Yuriy
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2016
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5153897/
https://www.ncbi.nlm.nih.gov/pubmed/27955616
http://dx.doi.org/10.1186/s12864-016-3375-x
_version_ 1782474787030827008
author Albayrak, Levent
Khanipov, Kamil
Pimenova, Maria
Golovko, George
Rojas, Mark
Pavlidis, Ioannis
Chumakov, Sergei
Aguilar, Gerardo
Chávez, Arturo
Widger, William R.
Fofanov, Yuriy
author_facet Albayrak, Levent
Khanipov, Kamil
Pimenova, Maria
Golovko, George
Rojas, Mark
Pavlidis, Ioannis
Chumakov, Sergei
Aguilar, Gerardo
Chávez, Arturo
Widger, William R.
Fofanov, Yuriy
author_sort Albayrak, Levent
collection PubMed
description BACKGROUND: Low-abundance mutations in mitochondrial populations (mutations with minor allele frequency ≤ 1%), are associated with cancer, aging, and neurodegenerative disorders. While recent progress in high-throughput sequencing technology has significantly improved the heteroplasmy identification process, the ability of this technology to detect low-abundance mutations can be affected by the presence of similar sequences originating from nuclear DNA (nDNA). To determine to what extent nDNA can cause false positive low-abundance heteroplasmy calls, we have identified mitochondrial locations of all subsequences that are common or similar (one mismatch allowed) between nDNA and mitochondrial DNA (mtDNA). RESULTS: Performed analysis revealed up to a 25-fold variation in the lengths of longest common and longest similar (one mismatch allowed) subsequences across the mitochondrial genome. The size of the longest subsequences shared between nDNA and mtDNA in several regions of the mitochondrial genome were found to be as low as 11 bases, which not only allows using these regions to design new, very specific PCR primers, but also supports the hypothesis of the non-random introduction of mtDNA into the human nuclear DNA. CONCLUSION: Analysis of the mitochondrial locations of the subsequences shared between nDNA and mtDNA suggested that even very short (36 bases) single-end sequencing reads can be used to identify low-abundance variation in 20.4% of the mitochondrial genome. For longer (76 and 150 bases) reads, the proportion of the mitochondrial genome where nDNA presence will not interfere found to be 44.5 and 67.9%, when low-abundance mutations at 100% of locations can be identified using 417 bases long single reads. This observation suggests that the analysis of low-abundance variations in mitochondria population can be extended to a variety of large data collections such as NCBI Sequence Read Archive, European Nucleotide Archive, The Cancer Genome Atlas, and International Cancer Genome Consortium. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12864-016-3375-x) contains supplementary material, which is available to authorized users.
format Online
Article
Text
id pubmed-5153897
institution National Center for Biotechnology Information
language English
publishDate 2016
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-51538972016-12-20 The ability of human nuclear DNA to cause false positive low-abundance heteroplasmy calls varies across the mitochondrial genome Albayrak, Levent Khanipov, Kamil Pimenova, Maria Golovko, George Rojas, Mark Pavlidis, Ioannis Chumakov, Sergei Aguilar, Gerardo Chávez, Arturo Widger, William R. Fofanov, Yuriy BMC Genomics Research Article BACKGROUND: Low-abundance mutations in mitochondrial populations (mutations with minor allele frequency ≤ 1%), are associated with cancer, aging, and neurodegenerative disorders. While recent progress in high-throughput sequencing technology has significantly improved the heteroplasmy identification process, the ability of this technology to detect low-abundance mutations can be affected by the presence of similar sequences originating from nuclear DNA (nDNA). To determine to what extent nDNA can cause false positive low-abundance heteroplasmy calls, we have identified mitochondrial locations of all subsequences that are common or similar (one mismatch allowed) between nDNA and mitochondrial DNA (mtDNA). RESULTS: Performed analysis revealed up to a 25-fold variation in the lengths of longest common and longest similar (one mismatch allowed) subsequences across the mitochondrial genome. The size of the longest subsequences shared between nDNA and mtDNA in several regions of the mitochondrial genome were found to be as low as 11 bases, which not only allows using these regions to design new, very specific PCR primers, but also supports the hypothesis of the non-random introduction of mtDNA into the human nuclear DNA. CONCLUSION: Analysis of the mitochondrial locations of the subsequences shared between nDNA and mtDNA suggested that even very short (36 bases) single-end sequencing reads can be used to identify low-abundance variation in 20.4% of the mitochondrial genome. For longer (76 and 150 bases) reads, the proportion of the mitochondrial genome where nDNA presence will not interfere found to be 44.5 and 67.9%, when low-abundance mutations at 100% of locations can be identified using 417 bases long single reads. This observation suggests that the analysis of low-abundance variations in mitochondria population can be extended to a variety of large data collections such as NCBI Sequence Read Archive, European Nucleotide Archive, The Cancer Genome Atlas, and International Cancer Genome Consortium. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12864-016-3375-x) contains supplementary material, which is available to authorized users. BioMed Central 2016-12-12 /pmc/articles/PMC5153897/ /pubmed/27955616 http://dx.doi.org/10.1186/s12864-016-3375-x Text en © The Author(s). 2016 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Research Article
Albayrak, Levent
Khanipov, Kamil
Pimenova, Maria
Golovko, George
Rojas, Mark
Pavlidis, Ioannis
Chumakov, Sergei
Aguilar, Gerardo
Chávez, Arturo
Widger, William R.
Fofanov, Yuriy
The ability of human nuclear DNA to cause false positive low-abundance heteroplasmy calls varies across the mitochondrial genome
title The ability of human nuclear DNA to cause false positive low-abundance heteroplasmy calls varies across the mitochondrial genome
title_full The ability of human nuclear DNA to cause false positive low-abundance heteroplasmy calls varies across the mitochondrial genome
title_fullStr The ability of human nuclear DNA to cause false positive low-abundance heteroplasmy calls varies across the mitochondrial genome
title_full_unstemmed The ability of human nuclear DNA to cause false positive low-abundance heteroplasmy calls varies across the mitochondrial genome
title_short The ability of human nuclear DNA to cause false positive low-abundance heteroplasmy calls varies across the mitochondrial genome
title_sort ability of human nuclear dna to cause false positive low-abundance heteroplasmy calls varies across the mitochondrial genome
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5153897/
https://www.ncbi.nlm.nih.gov/pubmed/27955616
http://dx.doi.org/10.1186/s12864-016-3375-x
work_keys_str_mv AT albayraklevent theabilityofhumannucleardnatocausefalsepositivelowabundanceheteroplasmycallsvariesacrossthemitochondrialgenome
AT khanipovkamil theabilityofhumannucleardnatocausefalsepositivelowabundanceheteroplasmycallsvariesacrossthemitochondrialgenome
AT pimenovamaria theabilityofhumannucleardnatocausefalsepositivelowabundanceheteroplasmycallsvariesacrossthemitochondrialgenome
AT golovkogeorge theabilityofhumannucleardnatocausefalsepositivelowabundanceheteroplasmycallsvariesacrossthemitochondrialgenome
AT rojasmark theabilityofhumannucleardnatocausefalsepositivelowabundanceheteroplasmycallsvariesacrossthemitochondrialgenome
AT pavlidisioannis theabilityofhumannucleardnatocausefalsepositivelowabundanceheteroplasmycallsvariesacrossthemitochondrialgenome
AT chumakovsergei theabilityofhumannucleardnatocausefalsepositivelowabundanceheteroplasmycallsvariesacrossthemitochondrialgenome
AT aguilargerardo theabilityofhumannucleardnatocausefalsepositivelowabundanceheteroplasmycallsvariesacrossthemitochondrialgenome
AT chavezarturo theabilityofhumannucleardnatocausefalsepositivelowabundanceheteroplasmycallsvariesacrossthemitochondrialgenome
AT widgerwilliamr theabilityofhumannucleardnatocausefalsepositivelowabundanceheteroplasmycallsvariesacrossthemitochondrialgenome
AT fofanovyuriy theabilityofhumannucleardnatocausefalsepositivelowabundanceheteroplasmycallsvariesacrossthemitochondrialgenome
AT albayraklevent abilityofhumannucleardnatocausefalsepositivelowabundanceheteroplasmycallsvariesacrossthemitochondrialgenome
AT khanipovkamil abilityofhumannucleardnatocausefalsepositivelowabundanceheteroplasmycallsvariesacrossthemitochondrialgenome
AT pimenovamaria abilityofhumannucleardnatocausefalsepositivelowabundanceheteroplasmycallsvariesacrossthemitochondrialgenome
AT golovkogeorge abilityofhumannucleardnatocausefalsepositivelowabundanceheteroplasmycallsvariesacrossthemitochondrialgenome
AT rojasmark abilityofhumannucleardnatocausefalsepositivelowabundanceheteroplasmycallsvariesacrossthemitochondrialgenome
AT pavlidisioannis abilityofhumannucleardnatocausefalsepositivelowabundanceheteroplasmycallsvariesacrossthemitochondrialgenome
AT chumakovsergei abilityofhumannucleardnatocausefalsepositivelowabundanceheteroplasmycallsvariesacrossthemitochondrialgenome
AT aguilargerardo abilityofhumannucleardnatocausefalsepositivelowabundanceheteroplasmycallsvariesacrossthemitochondrialgenome
AT chavezarturo abilityofhumannucleardnatocausefalsepositivelowabundanceheteroplasmycallsvariesacrossthemitochondrialgenome
AT widgerwilliamr abilityofhumannucleardnatocausefalsepositivelowabundanceheteroplasmycallsvariesacrossthemitochondrialgenome
AT fofanovyuriy abilityofhumannucleardnatocausefalsepositivelowabundanceheteroplasmycallsvariesacrossthemitochondrialgenome