Cargando…

Distilling Artificial Recombinants from Large Sets of Complete mtDNA Genomes

BACKGROUND: Large-scale genome sequencing poses enormous problems to the logistics of laboratory work and data handling. When numerous fragments of different genomes are PCR amplified and sequenced in a laboratory, there is a high immanent risk of sample confusion. For genetic markers, such as mitoc...

Descripción completa

Detalles Bibliográficos
Autores principales: Kong, Qing-Peng, Salas, Antonio, Sun, Chang, Fuku, Noriyuki, Tanaka, Masashi, Zhong, Li, Wang, Cheng-Ye, Yao, Yong-Gang, Bandelt, Hans-Jürgen
Formato: Texto
Lenguaje:English
Publicado: Public Library of Science 2008
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2515346/
https://www.ncbi.nlm.nih.gov/pubmed/18714389
http://dx.doi.org/10.1371/journal.pone.0003016
_version_ 1782158420200128512
author Kong, Qing-Peng
Salas, Antonio
Sun, Chang
Fuku, Noriyuki
Tanaka, Masashi
Zhong, Li
Wang, Cheng-Ye
Yao, Yong-Gang
Bandelt, Hans-Jürgen
author_facet Kong, Qing-Peng
Salas, Antonio
Sun, Chang
Fuku, Noriyuki
Tanaka, Masashi
Zhong, Li
Wang, Cheng-Ye
Yao, Yong-Gang
Bandelt, Hans-Jürgen
author_sort Kong, Qing-Peng
collection PubMed
description BACKGROUND: Large-scale genome sequencing poses enormous problems to the logistics of laboratory work and data handling. When numerous fragments of different genomes are PCR amplified and sequenced in a laboratory, there is a high immanent risk of sample confusion. For genetic markers, such as mitochondrial DNA (mtDNA), which are free of natural recombination, single instances of sample mix-up involving different branches of the mtDNA phylogeny would give rise to reticulate patterns and should therefore be detectable. METHODOLOGY/PRINCIPAL FINDINGS: We have developed a strategy for comparing new complete mtDNA genomes, one by one, to a current skeleton of the worldwide mtDNA phylogeny. The mutations distinguishing the reference sequence from a putative recombinant sequence can then be allocated to two or more different branches of this phylogenetic skeleton. Thus, one would search for two (or three) near-matches in the total mtDNA database that together best explain the variation seen in the recombinants. The evolutionary pathway from the mtDNA tree connecting this pair together with the recombinant then generate a grid-like median network, from which one can read off the exchanged segments. CONCLUSIONS: We have applied this procedure to a large collection of complete human mtDNA sequences, where several recombinants could be distilled by our method. All these recombinant sequences were subsequently corrected by de novo experiments – fully concordant with the predictions from our data-analytical approach.
format Text
id pubmed-2515346
institution National Center for Biotechnology Information
language English
publishDate 2008
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-25153462008-08-20 Distilling Artificial Recombinants from Large Sets of Complete mtDNA Genomes Kong, Qing-Peng Salas, Antonio Sun, Chang Fuku, Noriyuki Tanaka, Masashi Zhong, Li Wang, Cheng-Ye Yao, Yong-Gang Bandelt, Hans-Jürgen PLoS One Research Article BACKGROUND: Large-scale genome sequencing poses enormous problems to the logistics of laboratory work and data handling. When numerous fragments of different genomes are PCR amplified and sequenced in a laboratory, there is a high immanent risk of sample confusion. For genetic markers, such as mitochondrial DNA (mtDNA), which are free of natural recombination, single instances of sample mix-up involving different branches of the mtDNA phylogeny would give rise to reticulate patterns and should therefore be detectable. METHODOLOGY/PRINCIPAL FINDINGS: We have developed a strategy for comparing new complete mtDNA genomes, one by one, to a current skeleton of the worldwide mtDNA phylogeny. The mutations distinguishing the reference sequence from a putative recombinant sequence can then be allocated to two or more different branches of this phylogenetic skeleton. Thus, one would search for two (or three) near-matches in the total mtDNA database that together best explain the variation seen in the recombinants. The evolutionary pathway from the mtDNA tree connecting this pair together with the recombinant then generate a grid-like median network, from which one can read off the exchanged segments. CONCLUSIONS: We have applied this procedure to a large collection of complete human mtDNA sequences, where several recombinants could be distilled by our method. All these recombinant sequences were subsequently corrected by de novo experiments – fully concordant with the predictions from our data-analytical approach. Public Library of Science 2008-08-20 /pmc/articles/PMC2515346/ /pubmed/18714389 http://dx.doi.org/10.1371/journal.pone.0003016 Text en Kong et al. http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited.
spellingShingle Research Article
Kong, Qing-Peng
Salas, Antonio
Sun, Chang
Fuku, Noriyuki
Tanaka, Masashi
Zhong, Li
Wang, Cheng-Ye
Yao, Yong-Gang
Bandelt, Hans-Jürgen
Distilling Artificial Recombinants from Large Sets of Complete mtDNA Genomes
title Distilling Artificial Recombinants from Large Sets of Complete mtDNA Genomes
title_full Distilling Artificial Recombinants from Large Sets of Complete mtDNA Genomes
title_fullStr Distilling Artificial Recombinants from Large Sets of Complete mtDNA Genomes
title_full_unstemmed Distilling Artificial Recombinants from Large Sets of Complete mtDNA Genomes
title_short Distilling Artificial Recombinants from Large Sets of Complete mtDNA Genomes
title_sort distilling artificial recombinants from large sets of complete mtdna genomes
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2515346/
https://www.ncbi.nlm.nih.gov/pubmed/18714389
http://dx.doi.org/10.1371/journal.pone.0003016
work_keys_str_mv AT kongqingpeng distillingartificialrecombinantsfromlargesetsofcompletemtdnagenomes
AT salasantonio distillingartificialrecombinantsfromlargesetsofcompletemtdnagenomes
AT sunchang distillingartificialrecombinantsfromlargesetsofcompletemtdnagenomes
AT fukunoriyuki distillingartificialrecombinantsfromlargesetsofcompletemtdnagenomes
AT tanakamasashi distillingartificialrecombinantsfromlargesetsofcompletemtdnagenomes
AT zhongli distillingartificialrecombinantsfromlargesetsofcompletemtdnagenomes
AT wangchengye distillingartificialrecombinantsfromlargesetsofcompletemtdnagenomes
AT yaoyonggang distillingartificialrecombinantsfromlargesetsofcompletemtdnagenomes
AT bandelthansjurgen distillingartificialrecombinantsfromlargesetsofcompletemtdnagenomes