Cargando…
Ab initio GO-based mining for non-tandem-duplicated functional clusters in three model plant diploid genomes
A functional Non-Tandem Duplicated Cluster (FNTDC) is a group of non-tandem-duplicated genes that are located closer than expected by mere chance and have a role in the same biological function. The identification of secondary-compounds–related FNTDC has gained increased interest in recent years, bu...
Autores principales: | , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Public Library of Science
2020
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7304597/ https://www.ncbi.nlm.nih.gov/pubmed/32559249 http://dx.doi.org/10.1371/journal.pone.0234782 |
_version_ | 1783548286639013888 |
---|---|
author | Bagnaresi, Paolo Cattivelli, Luigi |
author_facet | Bagnaresi, Paolo Cattivelli, Luigi |
author_sort | Bagnaresi, Paolo |
collection | PubMed |
description | A functional Non-Tandem Duplicated Cluster (FNTDC) is a group of non-tandem-duplicated genes that are located closer than expected by mere chance and have a role in the same biological function. The identification of secondary-compounds–related FNTDC has gained increased interest in recent years, but little ab-initio attempts aiming to the identification of FNTDCs covering all biological functions, including primary metabolism compounds, have been carried out. We report an extensive FNTDC dataset accompanied by a detailed assessment on parameters used for genome scanning and their impact on FNTDC detection. We propose 70% identity and 70% alignment coverage as intermediate settings to exclude tandem duplicated genes and a dynamic scanning window of 24 genes. These settings were applied to rice, arabidopsis and grapevine genomes to call for FNTDCs. Besides the best-known secondary metabolism clusters, we identified many FNTDCs associated to primary metabolism ranging from macromolecules synthesis/editing, TOR signalling, ubiquitination, proton and electron transfer complexes. Using the intermediate FNTDC setting parameters (at P-value 1e(-6)), 130, 70 and 140 candidate FNTDCs were called in rice, arabidopsis and grapevine, respectively, and 20 to 30% of GO tags associated to called FNTDC were common among the 3 genomes. The datasets developed along with this work provide a rich framework for pinpointing candidate FNTDCs reflecting all GO-BP tags covering both primary and secondary metabolism with large macromolecular complexes/metabolons as the most represented FNTDCs. Noteworthy, several FNTDCs are tagged with GOs referring to organelle-targeted multi-enzyme complex, a finding that suggest the migration of endosymbiont gene chunks towards nuclei could be at the basis of these class of candidate FNTDCs. Most FNTDC appear to have evolved prior of genome duplication events. More than one-third of genes interspersed/adjacent to called FNTDCs lacked any functional annotation; however, their co-localization may provide hints towards a candidate biological role. |
format | Online Article Text |
id | pubmed-7304597 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2020 |
publisher | Public Library of Science |
record_format | MEDLINE/PubMed |
spelling | pubmed-73045972020-06-19 Ab initio GO-based mining for non-tandem-duplicated functional clusters in three model plant diploid genomes Bagnaresi, Paolo Cattivelli, Luigi PLoS One Research Article A functional Non-Tandem Duplicated Cluster (FNTDC) is a group of non-tandem-duplicated genes that are located closer than expected by mere chance and have a role in the same biological function. The identification of secondary-compounds–related FNTDC has gained increased interest in recent years, but little ab-initio attempts aiming to the identification of FNTDCs covering all biological functions, including primary metabolism compounds, have been carried out. We report an extensive FNTDC dataset accompanied by a detailed assessment on parameters used for genome scanning and their impact on FNTDC detection. We propose 70% identity and 70% alignment coverage as intermediate settings to exclude tandem duplicated genes and a dynamic scanning window of 24 genes. These settings were applied to rice, arabidopsis and grapevine genomes to call for FNTDCs. Besides the best-known secondary metabolism clusters, we identified many FNTDCs associated to primary metabolism ranging from macromolecules synthesis/editing, TOR signalling, ubiquitination, proton and electron transfer complexes. Using the intermediate FNTDC setting parameters (at P-value 1e(-6)), 130, 70 and 140 candidate FNTDCs were called in rice, arabidopsis and grapevine, respectively, and 20 to 30% of GO tags associated to called FNTDC were common among the 3 genomes. The datasets developed along with this work provide a rich framework for pinpointing candidate FNTDCs reflecting all GO-BP tags covering both primary and secondary metabolism with large macromolecular complexes/metabolons as the most represented FNTDCs. Noteworthy, several FNTDCs are tagged with GOs referring to organelle-targeted multi-enzyme complex, a finding that suggest the migration of endosymbiont gene chunks towards nuclei could be at the basis of these class of candidate FNTDCs. Most FNTDC appear to have evolved prior of genome duplication events. More than one-third of genes interspersed/adjacent to called FNTDCs lacked any functional annotation; however, their co-localization may provide hints towards a candidate biological role. Public Library of Science 2020-06-19 /pmc/articles/PMC7304597/ /pubmed/32559249 http://dx.doi.org/10.1371/journal.pone.0234782 Text en © 2020 Bagnaresi, Cattivelli http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited. |
spellingShingle | Research Article Bagnaresi, Paolo Cattivelli, Luigi Ab initio GO-based mining for non-tandem-duplicated functional clusters in three model plant diploid genomes |
title | Ab initio GO-based mining for non-tandem-duplicated functional clusters in three model plant diploid genomes |
title_full | Ab initio GO-based mining for non-tandem-duplicated functional clusters in three model plant diploid genomes |
title_fullStr | Ab initio GO-based mining for non-tandem-duplicated functional clusters in three model plant diploid genomes |
title_full_unstemmed | Ab initio GO-based mining for non-tandem-duplicated functional clusters in three model plant diploid genomes |
title_short | Ab initio GO-based mining for non-tandem-duplicated functional clusters in three model plant diploid genomes |
title_sort | ab initio go-based mining for non-tandem-duplicated functional clusters in three model plant diploid genomes |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7304597/ https://www.ncbi.nlm.nih.gov/pubmed/32559249 http://dx.doi.org/10.1371/journal.pone.0234782 |
work_keys_str_mv | AT bagnaresipaolo abinitiogobasedminingfornontandemduplicatedfunctionalclustersinthreemodelplantdiploidgenomes AT cattivelliluigi abinitiogobasedminingfornontandemduplicatedfunctionalclustersinthreemodelplantdiploidgenomes |