Cargando…

Ab initio GO-based mining for non-tandem-duplicated functional clusters in three model plant diploid genomes

A functional Non-Tandem Duplicated Cluster (FNTDC) is a group of non-tandem-duplicated genes that are located closer than expected by mere chance and have a role in the same biological function. The identification of secondary-compounds–related FNTDC has gained increased interest in recent years, bu...

Descripción completa

Detalles Bibliográficos
Autores principales: Bagnaresi, Paolo, Cattivelli, Luigi
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7304597/
https://www.ncbi.nlm.nih.gov/pubmed/32559249
http://dx.doi.org/10.1371/journal.pone.0234782
_version_ 1783548286639013888
author Bagnaresi, Paolo
Cattivelli, Luigi
author_facet Bagnaresi, Paolo
Cattivelli, Luigi
author_sort Bagnaresi, Paolo
collection PubMed
description A functional Non-Tandem Duplicated Cluster (FNTDC) is a group of non-tandem-duplicated genes that are located closer than expected by mere chance and have a role in the same biological function. The identification of secondary-compounds–related FNTDC has gained increased interest in recent years, but little ab-initio attempts aiming to the identification of FNTDCs covering all biological functions, including primary metabolism compounds, have been carried out. We report an extensive FNTDC dataset accompanied by a detailed assessment on parameters used for genome scanning and their impact on FNTDC detection. We propose 70% identity and 70% alignment coverage as intermediate settings to exclude tandem duplicated genes and a dynamic scanning window of 24 genes. These settings were applied to rice, arabidopsis and grapevine genomes to call for FNTDCs. Besides the best-known secondary metabolism clusters, we identified many FNTDCs associated to primary metabolism ranging from macromolecules synthesis/editing, TOR signalling, ubiquitination, proton and electron transfer complexes. Using the intermediate FNTDC setting parameters (at P-value 1e(-6)), 130, 70 and 140 candidate FNTDCs were called in rice, arabidopsis and grapevine, respectively, and 20 to 30% of GO tags associated to called FNTDC were common among the 3 genomes. The datasets developed along with this work provide a rich framework for pinpointing candidate FNTDCs reflecting all GO-BP tags covering both primary and secondary metabolism with large macromolecular complexes/metabolons as the most represented FNTDCs. Noteworthy, several FNTDCs are tagged with GOs referring to organelle-targeted multi-enzyme complex, a finding that suggest the migration of endosymbiont gene chunks towards nuclei could be at the basis of these class of candidate FNTDCs. Most FNTDC appear to have evolved prior of genome duplication events. More than one-third of genes interspersed/adjacent to called FNTDCs lacked any functional annotation; however, their co-localization may provide hints towards a candidate biological role.
format Online
Article
Text
id pubmed-7304597
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-73045972020-06-19 Ab initio GO-based mining for non-tandem-duplicated functional clusters in three model plant diploid genomes Bagnaresi, Paolo Cattivelli, Luigi PLoS One Research Article A functional Non-Tandem Duplicated Cluster (FNTDC) is a group of non-tandem-duplicated genes that are located closer than expected by mere chance and have a role in the same biological function. The identification of secondary-compounds–related FNTDC has gained increased interest in recent years, but little ab-initio attempts aiming to the identification of FNTDCs covering all biological functions, including primary metabolism compounds, have been carried out. We report an extensive FNTDC dataset accompanied by a detailed assessment on parameters used for genome scanning and their impact on FNTDC detection. We propose 70% identity and 70% alignment coverage as intermediate settings to exclude tandem duplicated genes and a dynamic scanning window of 24 genes. These settings were applied to rice, arabidopsis and grapevine genomes to call for FNTDCs. Besides the best-known secondary metabolism clusters, we identified many FNTDCs associated to primary metabolism ranging from macromolecules synthesis/editing, TOR signalling, ubiquitination, proton and electron transfer complexes. Using the intermediate FNTDC setting parameters (at P-value 1e(-6)), 130, 70 and 140 candidate FNTDCs were called in rice, arabidopsis and grapevine, respectively, and 20 to 30% of GO tags associated to called FNTDC were common among the 3 genomes. The datasets developed along with this work provide a rich framework for pinpointing candidate FNTDCs reflecting all GO-BP tags covering both primary and secondary metabolism with large macromolecular complexes/metabolons as the most represented FNTDCs. Noteworthy, several FNTDCs are tagged with GOs referring to organelle-targeted multi-enzyme complex, a finding that suggest the migration of endosymbiont gene chunks towards nuclei could be at the basis of these class of candidate FNTDCs. Most FNTDC appear to have evolved prior of genome duplication events. More than one-third of genes interspersed/adjacent to called FNTDCs lacked any functional annotation; however, their co-localization may provide hints towards a candidate biological role. Public Library of Science 2020-06-19 /pmc/articles/PMC7304597/ /pubmed/32559249 http://dx.doi.org/10.1371/journal.pone.0234782 Text en © 2020 Bagnaresi, Cattivelli http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle Research Article
Bagnaresi, Paolo
Cattivelli, Luigi
Ab initio GO-based mining for non-tandem-duplicated functional clusters in three model plant diploid genomes
title Ab initio GO-based mining for non-tandem-duplicated functional clusters in three model plant diploid genomes
title_full Ab initio GO-based mining for non-tandem-duplicated functional clusters in three model plant diploid genomes
title_fullStr Ab initio GO-based mining for non-tandem-duplicated functional clusters in three model plant diploid genomes
title_full_unstemmed Ab initio GO-based mining for non-tandem-duplicated functional clusters in three model plant diploid genomes
title_short Ab initio GO-based mining for non-tandem-duplicated functional clusters in three model plant diploid genomes
title_sort ab initio go-based mining for non-tandem-duplicated functional clusters in three model plant diploid genomes
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7304597/
https://www.ncbi.nlm.nih.gov/pubmed/32559249
http://dx.doi.org/10.1371/journal.pone.0234782
work_keys_str_mv AT bagnaresipaolo abinitiogobasedminingfornontandemduplicatedfunctionalclustersinthreemodelplantdiploidgenomes
AT cattivelliluigi abinitiogobasedminingfornontandemduplicatedfunctionalclustersinthreemodelplantdiploidgenomes