Cargando…

Comprehensive Identification of Mitochondrial Pseudogenes (NUMTs) in the Human Telomere-to-Telomere Reference Genome

Practices related to mitochondrial research have long been hindered by the presence of mitochondrial pseudogenes within the nuclear genome (NUMTs). Even though partially assembled human reference genomes like hg38 have included NUMTs compilation, the exhaustive NUMTs within the only complete referen...

Descripción completa

Detalles Bibliográficos
Autores principales: Tao, Yichen, He, Chengpeng, Lin, Deng, Gu, Zhenglong, Pu, Weilin
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10671835/
https://www.ncbi.nlm.nih.gov/pubmed/38003036
http://dx.doi.org/10.3390/genes14112092
_version_ 1785149464653070336
author Tao, Yichen
He, Chengpeng
Lin, Deng
Gu, Zhenglong
Pu, Weilin
author_facet Tao, Yichen
He, Chengpeng
Lin, Deng
Gu, Zhenglong
Pu, Weilin
author_sort Tao, Yichen
collection PubMed
description Practices related to mitochondrial research have long been hindered by the presence of mitochondrial pseudogenes within the nuclear genome (NUMTs). Even though partially assembled human reference genomes like hg38 have included NUMTs compilation, the exhaustive NUMTs within the only complete reference genome (T2T-CHR13) remain unknown. Here, we comprehensively identified the fixed NUMTs within the reference genome using human pan-mitogenome (HPMT) from GeneBank. The inclusion of HPMT serves the purpose of establishing an authentic mitochondrial DNA (mtDNA) mutational spectrum for the identification of NUMTs, distinguishing it from the polymorphic variations found in NUMTs. Using HPMT, we identified approximately 10% of additional NUMTs in three human reference genomes under stricter thresholds. And we also observed an approximate 6% increase in NUMTs in T2T-CHR13 compared to hg38, including NUMTs on the short arms of chromosomes 13, 14, and 15 that were not assembled previously. Furthermore, alignments based on 20-mer from mtDNA suggested the presence of more mtDNA-like short segments within the nuclear genome, which should be avoided for short amplicon or cell free mtDNA detection. Finally, through the assay of transposase-accessible chromatin with high-throughput sequencing (ATAC-seq) on cell lines before and after mtDNA elimination, we concluded that NUMTs have a minimal impact on bulk ATAC-seq, even though 16% of sequencing data originated from mtDNA
format Online
Article
Text
id pubmed-10671835
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-106718352023-11-17 Comprehensive Identification of Mitochondrial Pseudogenes (NUMTs) in the Human Telomere-to-Telomere Reference Genome Tao, Yichen He, Chengpeng Lin, Deng Gu, Zhenglong Pu, Weilin Genes (Basel) Article Practices related to mitochondrial research have long been hindered by the presence of mitochondrial pseudogenes within the nuclear genome (NUMTs). Even though partially assembled human reference genomes like hg38 have included NUMTs compilation, the exhaustive NUMTs within the only complete reference genome (T2T-CHR13) remain unknown. Here, we comprehensively identified the fixed NUMTs within the reference genome using human pan-mitogenome (HPMT) from GeneBank. The inclusion of HPMT serves the purpose of establishing an authentic mitochondrial DNA (mtDNA) mutational spectrum for the identification of NUMTs, distinguishing it from the polymorphic variations found in NUMTs. Using HPMT, we identified approximately 10% of additional NUMTs in three human reference genomes under stricter thresholds. And we also observed an approximate 6% increase in NUMTs in T2T-CHR13 compared to hg38, including NUMTs on the short arms of chromosomes 13, 14, and 15 that were not assembled previously. Furthermore, alignments based on 20-mer from mtDNA suggested the presence of more mtDNA-like short segments within the nuclear genome, which should be avoided for short amplicon or cell free mtDNA detection. Finally, through the assay of transposase-accessible chromatin with high-throughput sequencing (ATAC-seq) on cell lines before and after mtDNA elimination, we concluded that NUMTs have a minimal impact on bulk ATAC-seq, even though 16% of sequencing data originated from mtDNA MDPI 2023-11-17 /pmc/articles/PMC10671835/ /pubmed/38003036 http://dx.doi.org/10.3390/genes14112092 Text en © 2023 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Tao, Yichen
He, Chengpeng
Lin, Deng
Gu, Zhenglong
Pu, Weilin
Comprehensive Identification of Mitochondrial Pseudogenes (NUMTs) in the Human Telomere-to-Telomere Reference Genome
title Comprehensive Identification of Mitochondrial Pseudogenes (NUMTs) in the Human Telomere-to-Telomere Reference Genome
title_full Comprehensive Identification of Mitochondrial Pseudogenes (NUMTs) in the Human Telomere-to-Telomere Reference Genome
title_fullStr Comprehensive Identification of Mitochondrial Pseudogenes (NUMTs) in the Human Telomere-to-Telomere Reference Genome
title_full_unstemmed Comprehensive Identification of Mitochondrial Pseudogenes (NUMTs) in the Human Telomere-to-Telomere Reference Genome
title_short Comprehensive Identification of Mitochondrial Pseudogenes (NUMTs) in the Human Telomere-to-Telomere Reference Genome
title_sort comprehensive identification of mitochondrial pseudogenes (numts) in the human telomere-to-telomere reference genome
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10671835/
https://www.ncbi.nlm.nih.gov/pubmed/38003036
http://dx.doi.org/10.3390/genes14112092
work_keys_str_mv AT taoyichen comprehensiveidentificationofmitochondrialpseudogenesnumtsinthehumantelomeretotelomerereferencegenome
AT hechengpeng comprehensiveidentificationofmitochondrialpseudogenesnumtsinthehumantelomeretotelomerereferencegenome
AT lindeng comprehensiveidentificationofmitochondrialpseudogenesnumtsinthehumantelomeretotelomerereferencegenome
AT guzhenglong comprehensiveidentificationofmitochondrialpseudogenesnumtsinthehumantelomeretotelomerereferencegenome
AT puweilin comprehensiveidentificationofmitochondrialpseudogenesnumtsinthehumantelomeretotelomerereferencegenome