Cargando…

Multispecies genome-wide analysis defines the MAP3K gene family in Gossypium hirsutum and reveals conserved family expansions

BACKGROUND: Gene families are sets of structurally and evolutionarily related genes – in one or multiple species – that typically share a conserved biological function. As such, the identification and subsequent analyses of entire gene families are widely employed in the fields of evolutionary and f...

Descripción completa

Detalles Bibliográficos
Autores principales: Bokros, Norbert, Popescu, Sorina C., Popescu, George V.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6419318/
https://www.ncbi.nlm.nih.gov/pubmed/30871456
http://dx.doi.org/10.1186/s12859-019-2624-9
_version_ 1783403917339525120
author Bokros, Norbert
Popescu, Sorina C.
Popescu, George V.
author_facet Bokros, Norbert
Popescu, Sorina C.
Popescu, George V.
author_sort Bokros, Norbert
collection PubMed
description BACKGROUND: Gene families are sets of structurally and evolutionarily related genes – in one or multiple species – that typically share a conserved biological function. As such, the identification and subsequent analyses of entire gene families are widely employed in the fields of evolutionary and functional genomics of both well established and newly sequenced plant genomes. Currently, plant gene families are typically identified using one of two major ways: 1) HMM-profile based searches using models built on Arabidopsis thaliana genes or 2) coding sequence homology searches using curated databases. Integrated databases containing functionally annotated genes and gene families have been developed for model organisms and several important crops; however, a comprehensive methodology for gene family annotation is currently lacking, preventing automated annotation of newly sequenced genomes. RESULTS: This paper proposes a combined measure of homology identification, motif conservation, phylogenomic and integrated gene expression analyses to define gene family structures in multiple plant species. The MAP3K gene families in seven plant species, including two currently unexamined species Gossypium hirsutum, and Zostera marina, were characterized to reveal new insights into their collective function and evolution and demonstrate the effectiveness of our novel methodology. CONCLUSION: Compared with recent reports, this methodology performs significantly better for the identification and analysis of gene family members in several monocots/dicots, diploid as well as polyploid plant species. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (10.1186/s12859-019-2624-9) contains supplementary material, which is available to authorized users.
format Online
Article
Text
id pubmed-6419318
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-64193182019-03-27 Multispecies genome-wide analysis defines the MAP3K gene family in Gossypium hirsutum and reveals conserved family expansions Bokros, Norbert Popescu, Sorina C. Popescu, George V. BMC Bioinformatics Research BACKGROUND: Gene families are sets of structurally and evolutionarily related genes – in one or multiple species – that typically share a conserved biological function. As such, the identification and subsequent analyses of entire gene families are widely employed in the fields of evolutionary and functional genomics of both well established and newly sequenced plant genomes. Currently, plant gene families are typically identified using one of two major ways: 1) HMM-profile based searches using models built on Arabidopsis thaliana genes or 2) coding sequence homology searches using curated databases. Integrated databases containing functionally annotated genes and gene families have been developed for model organisms and several important crops; however, a comprehensive methodology for gene family annotation is currently lacking, preventing automated annotation of newly sequenced genomes. RESULTS: This paper proposes a combined measure of homology identification, motif conservation, phylogenomic and integrated gene expression analyses to define gene family structures in multiple plant species. The MAP3K gene families in seven plant species, including two currently unexamined species Gossypium hirsutum, and Zostera marina, were characterized to reveal new insights into their collective function and evolution and demonstrate the effectiveness of our novel methodology. CONCLUSION: Compared with recent reports, this methodology performs significantly better for the identification and analysis of gene family members in several monocots/dicots, diploid as well as polyploid plant species. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (10.1186/s12859-019-2624-9) contains supplementary material, which is available to authorized users. BioMed Central 2019-03-14 /pmc/articles/PMC6419318/ /pubmed/30871456 http://dx.doi.org/10.1186/s12859-019-2624-9 Text en © The Author(s). 2019 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Research
Bokros, Norbert
Popescu, Sorina C.
Popescu, George V.
Multispecies genome-wide analysis defines the MAP3K gene family in Gossypium hirsutum and reveals conserved family expansions
title Multispecies genome-wide analysis defines the MAP3K gene family in Gossypium hirsutum and reveals conserved family expansions
title_full Multispecies genome-wide analysis defines the MAP3K gene family in Gossypium hirsutum and reveals conserved family expansions
title_fullStr Multispecies genome-wide analysis defines the MAP3K gene family in Gossypium hirsutum and reveals conserved family expansions
title_full_unstemmed Multispecies genome-wide analysis defines the MAP3K gene family in Gossypium hirsutum and reveals conserved family expansions
title_short Multispecies genome-wide analysis defines the MAP3K gene family in Gossypium hirsutum and reveals conserved family expansions
title_sort multispecies genome-wide analysis defines the map3k gene family in gossypium hirsutum and reveals conserved family expansions
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6419318/
https://www.ncbi.nlm.nih.gov/pubmed/30871456
http://dx.doi.org/10.1186/s12859-019-2624-9
work_keys_str_mv AT bokrosnorbert multispeciesgenomewideanalysisdefinesthemap3kgenefamilyingossypiumhirsutumandrevealsconservedfamilyexpansions
AT popescusorinac multispeciesgenomewideanalysisdefinesthemap3kgenefamilyingossypiumhirsutumandrevealsconservedfamilyexpansions
AT popescugeorgev multispeciesgenomewideanalysisdefinesthemap3kgenefamilyingossypiumhirsutumandrevealsconservedfamilyexpansions