Cargando…

Distinguishing successive ancient polyploidy levels based on genome-internal syntenic alignment

BACKGROUND: A basic tool for studying the polyploidization history of a genome, especially in plants, is the distribution of duplicate gene similarities in syntenically aligned regions of a genome. This distribution can usually be decomposed into two or more components identifiable by peaks, or loca...

Descripción completa

Detalles Bibliográficos
Autores principales:	Zhang, Yue, Zheng, Chunfang, Sankoff, David
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	BioMed Central 2019
Materias:	Research
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6915858/ https://www.ncbi.nlm.nih.gov/pubmed/31842736 http://dx.doi.org/10.1186/s12859-019-3202-x

_version_	1783480108023021568
author	Zhang, Yue Zheng, Chunfang Sankoff, David
author_facet	Zhang, Yue Zheng, Chunfang Sankoff, David
author_sort	Zhang, Yue
collection	PubMed
description	BACKGROUND: A basic tool for studying the polyploidization history of a genome, especially in plants, is the distribution of duplicate gene similarities in syntenically aligned regions of a genome. This distribution can usually be decomposed into two or more components identifiable by peaks, or local maxima, each representing a different polyploidization event. The distributions may be generated by means of a discrete time branching process, followed by a sequence divergence model. The branching process, as well as the inference of fractionation rates based on it, requires knowledge of the ploidy level of each event, which cannot be directly inferred from the pair similarity distribution. RESULTS: For a sequence of two events of unknown ploidy, either tetraploid, giving rise to whole genome doubling (WGD), or hexaploid, giving rise to whole genome tripling (WGT), we base our analysis on triples of similar genes. We calculate the probability of the four triplet types with origins in one or the other event, or both, and impose a mutational model so that the distribution resembles the original data. Using a ML transition point in the similarities between the two events as a discriminator for the hypothesized origin of each similarity, we calculate the predicted number of triplets of each type for each model combining WGT and/or WGD. This yields a predicted profile of triplet types for each model. We compare the observed and predicted triplet profiles for each model to confirm the polyploidization history of durian, poplar and cabbage. CONCLUSIONS: We have developed a way of inferring the ploidy of up to three successive WGD and/or WGT events by estimating the time of origin of each of the similarities in triples of genes. This may be generalized to a larger number of events and to higher ploidies.
format	Online Article Text
id	pubmed-6915858
institution	National Center for Biotechnology Information
language	English
publishDate	2019
publisher	BioMed Central
record_format	MEDLINE/PubMed
spelling	pubmed-69158582019-12-30 Distinguishing successive ancient polyploidy levels based on genome-internal syntenic alignment Zhang, Yue Zheng, Chunfang Sankoff, David BMC Bioinformatics Research BACKGROUND: A basic tool for studying the polyploidization history of a genome, especially in plants, is the distribution of duplicate gene similarities in syntenically aligned regions of a genome. This distribution can usually be decomposed into two or more components identifiable by peaks, or local maxima, each representing a different polyploidization event. The distributions may be generated by means of a discrete time branching process, followed by a sequence divergence model. The branching process, as well as the inference of fractionation rates based on it, requires knowledge of the ploidy level of each event, which cannot be directly inferred from the pair similarity distribution. RESULTS: For a sequence of two events of unknown ploidy, either tetraploid, giving rise to whole genome doubling (WGD), or hexaploid, giving rise to whole genome tripling (WGT), we base our analysis on triples of similar genes. We calculate the probability of the four triplet types with origins in one or the other event, or both, and impose a mutational model so that the distribution resembles the original data. Using a ML transition point in the similarities between the two events as a discriminator for the hypothesized origin of each similarity, we calculate the predicted number of triplets of each type for each model combining WGT and/or WGD. This yields a predicted profile of triplet types for each model. We compare the observed and predicted triplet profiles for each model to confirm the polyploidization history of durian, poplar and cabbage. CONCLUSIONS: We have developed a way of inferring the ploidy of up to three successive WGD and/or WGT events by estimating the time of origin of each of the similarities in triples of genes. This may be generalized to a larger number of events and to higher ploidies. BioMed Central 2019-12-17 /pmc/articles/PMC6915858/ /pubmed/31842736 http://dx.doi.org/10.1186/s12859-019-3202-x Text en © The Author(s) 2019 Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver(http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle	Research Zhang, Yue Zheng, Chunfang Sankoff, David Distinguishing successive ancient polyploidy levels based on genome-internal syntenic alignment
title	Distinguishing successive ancient polyploidy levels based on genome-internal syntenic alignment
title_full	Distinguishing successive ancient polyploidy levels based on genome-internal syntenic alignment
title_fullStr	Distinguishing successive ancient polyploidy levels based on genome-internal syntenic alignment
title_full_unstemmed	Distinguishing successive ancient polyploidy levels based on genome-internal syntenic alignment
title_short	Distinguishing successive ancient polyploidy levels based on genome-internal syntenic alignment
title_sort	distinguishing successive ancient polyploidy levels based on genome-internal syntenic alignment
topic	Research
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6915858/ https://www.ncbi.nlm.nih.gov/pubmed/31842736 http://dx.doi.org/10.1186/s12859-019-3202-x
work_keys_str_mv	AT zhangyue distinguishingsuccessiveancientpolyploidylevelsbasedongenomeinternalsyntenicalignment AT zhengchunfang distinguishingsuccessiveancientpolyploidylevelsbasedongenomeinternalsyntenicalignment AT sankoffdavid distinguishingsuccessiveancientpolyploidylevelsbasedongenomeinternalsyntenicalignment

Distinguishing successive ancient polyploidy levels based on genome-internal syntenic alignment

Ejemplares similares