Cargando…

Phylogenetic analysis and classification of the Brassica rapa SET-domain protein family

BACKGROUND: The SET (Su(var)3-9, Enhancer-of-zeste, Trithorax) domain is an evolutionarily conserved sequence of approximately 130-150 amino acids, and constitutes the catalytic site of lysine methyltransferases (KMTs). KMTs perform many crucial biological functions via histone methylation of chroma...

Descripción completa

Detalles Bibliográficos
Autores principales: Huang, Yong, Liu, Chunlin, Shen, Wen-Hui, Ruan, Ying
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2011
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3264562/
https://www.ncbi.nlm.nih.gov/pubmed/22168908
http://dx.doi.org/10.1186/1471-2229-11-175
_version_ 1782221986759442432
author Huang, Yong
Liu, Chunlin
Shen, Wen-Hui
Ruan, Ying
author_facet Huang, Yong
Liu, Chunlin
Shen, Wen-Hui
Ruan, Ying
author_sort Huang, Yong
collection PubMed
description BACKGROUND: The SET (Su(var)3-9, Enhancer-of-zeste, Trithorax) domain is an evolutionarily conserved sequence of approximately 130-150 amino acids, and constitutes the catalytic site of lysine methyltransferases (KMTs). KMTs perform many crucial biological functions via histone methylation of chromatin. Histone methylation marks are interpreted differently depending on the histone type (i.e. H3 or H4), the lysine position (e.g. H3K4, H3K9, H3K27, H3K36 or H4K20) and the number of added methyl groups (i.e. me1, me2 or me3). For example, H3K4me3 and H3K36me3 are associated with transcriptional activation, but H3K9me2 and H3K27me3 are associated with gene silencing. The substrate specificity and activity of KMTs are determined by sequences within the SET domain and other regions of the protein. RESULTS: Here we identified 49 SET-domain proteins from the recently sequenced Brassica rapa genome. We performed sequence similarity and protein domain organization analysis of these proteins, along with the SET-domain proteins from the dicot Arabidopsis thaliana, the monocots Oryza sativa and Brachypodium distachyon, and the green alga Ostreococcus tauri. We showed that plant SET-domain proteins can be grouped into 6 distinct classes, namely KMT1, KMT2, KMT3, KMT6, KMT7 and S-ET. Apart from the S-ET class, which has an interrupted SET domain and may be involved in methylation of nonhistone proteins, the other classes have characteristics of histone methyltransferases exhibiting different substrate specificities: KMT1 for H3K9, KMT2 for H3K4, KMT3 for H3K36, KMT6 for H3K27 and KMT7 also for H3K4. We also propose a coherent and rational nomenclature for plant SET-domain proteins. Comparisons of sequence similarity and synteny of B. rapa and A. thaliana SET-domain proteins revealed recent gene duplication events for some KMTs. CONCLUSION: This study provides the first characterization of the SET-domain KMT proteins of B. rapa. Phylogenetic analysis data allowed the development of a coherent and rational nomenclature of this important family of proteins in plants, as in animals. The results obtained in this study will provide a base for nomenclature of KMTs in other plant species and facilitate the functional characterization of these important epigenetic regulatory genes in Brassica crops.
format Online
Article
Text
id pubmed-3264562
institution National Center for Biotechnology Information
language English
publishDate 2011
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-32645622012-01-24 Phylogenetic analysis and classification of the Brassica rapa SET-domain protein family Huang, Yong Liu, Chunlin Shen, Wen-Hui Ruan, Ying BMC Plant Biol Research Article BACKGROUND: The SET (Su(var)3-9, Enhancer-of-zeste, Trithorax) domain is an evolutionarily conserved sequence of approximately 130-150 amino acids, and constitutes the catalytic site of lysine methyltransferases (KMTs). KMTs perform many crucial biological functions via histone methylation of chromatin. Histone methylation marks are interpreted differently depending on the histone type (i.e. H3 or H4), the lysine position (e.g. H3K4, H3K9, H3K27, H3K36 or H4K20) and the number of added methyl groups (i.e. me1, me2 or me3). For example, H3K4me3 and H3K36me3 are associated with transcriptional activation, but H3K9me2 and H3K27me3 are associated with gene silencing. The substrate specificity and activity of KMTs are determined by sequences within the SET domain and other regions of the protein. RESULTS: Here we identified 49 SET-domain proteins from the recently sequenced Brassica rapa genome. We performed sequence similarity and protein domain organization analysis of these proteins, along with the SET-domain proteins from the dicot Arabidopsis thaliana, the monocots Oryza sativa and Brachypodium distachyon, and the green alga Ostreococcus tauri. We showed that plant SET-domain proteins can be grouped into 6 distinct classes, namely KMT1, KMT2, KMT3, KMT6, KMT7 and S-ET. Apart from the S-ET class, which has an interrupted SET domain and may be involved in methylation of nonhistone proteins, the other classes have characteristics of histone methyltransferases exhibiting different substrate specificities: KMT1 for H3K9, KMT2 for H3K4, KMT3 for H3K36, KMT6 for H3K27 and KMT7 also for H3K4. We also propose a coherent and rational nomenclature for plant SET-domain proteins. Comparisons of sequence similarity and synteny of B. rapa and A. thaliana SET-domain proteins revealed recent gene duplication events for some KMTs. CONCLUSION: This study provides the first characterization of the SET-domain KMT proteins of B. rapa. Phylogenetic analysis data allowed the development of a coherent and rational nomenclature of this important family of proteins in plants, as in animals. The results obtained in this study will provide a base for nomenclature of KMTs in other plant species and facilitate the functional characterization of these important epigenetic regulatory genes in Brassica crops. BioMed Central 2011-12-14 /pmc/articles/PMC3264562/ /pubmed/22168908 http://dx.doi.org/10.1186/1471-2229-11-175 Text en Copyright ©2011 Huang et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research Article
Huang, Yong
Liu, Chunlin
Shen, Wen-Hui
Ruan, Ying
Phylogenetic analysis and classification of the Brassica rapa SET-domain protein family
title Phylogenetic analysis and classification of the Brassica rapa SET-domain protein family
title_full Phylogenetic analysis and classification of the Brassica rapa SET-domain protein family
title_fullStr Phylogenetic analysis and classification of the Brassica rapa SET-domain protein family
title_full_unstemmed Phylogenetic analysis and classification of the Brassica rapa SET-domain protein family
title_short Phylogenetic analysis and classification of the Brassica rapa SET-domain protein family
title_sort phylogenetic analysis and classification of the brassica rapa set-domain protein family
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3264562/
https://www.ncbi.nlm.nih.gov/pubmed/22168908
http://dx.doi.org/10.1186/1471-2229-11-175
work_keys_str_mv AT huangyong phylogeneticanalysisandclassificationofthebrassicarapasetdomainproteinfamily
AT liuchunlin phylogeneticanalysisandclassificationofthebrassicarapasetdomainproteinfamily
AT shenwenhui phylogeneticanalysisandclassificationofthebrassicarapasetdomainproteinfamily
AT ruanying phylogeneticanalysisandclassificationofthebrassicarapasetdomainproteinfamily