Cargando…

LCGbase: A Comprehensive Database for Lineage-Based Co-regulated Genes

Animal genes of different lineages, such as vertebrates and arthropods, are well-organized and blended into dynamic chromosomal structures that represent a primary regulatory mechanism for body development and cellular differentiation. The majority of genes in a genome are actually clustered, which...

Descripción completa

Detalles Bibliográficos
Autores principales: Wang, Dapeng, Zhang, Yubin, Fan, Zhonghua, Liu, Guiming, Yu, Jun
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Libertas Academica 2011
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3256993/
https://www.ncbi.nlm.nih.gov/pubmed/22267903
http://dx.doi.org/10.4137/EBO.S8540
_version_ 1782221087700942848
author Wang, Dapeng
Zhang, Yubin
Fan, Zhonghua
Liu, Guiming
Yu, Jun
author_facet Wang, Dapeng
Zhang, Yubin
Fan, Zhonghua
Liu, Guiming
Yu, Jun
author_sort Wang, Dapeng
collection PubMed
description Animal genes of different lineages, such as vertebrates and arthropods, are well-organized and blended into dynamic chromosomal structures that represent a primary regulatory mechanism for body development and cellular differentiation. The majority of genes in a genome are actually clustered, which are evolutionarily stable to different extents and biologically meaningful when evaluated among genomes within and across lineages. Until now, many questions concerning gene organization, such as what is the minimal number of genes in a cluster and what is the driving force leading to gene co-regulation, remain to be addressed. Here, we provide a user-friendly database—LCGbase (a comprehensive database for lineage-based co-regulated genes)—hosting information on evolutionary dynamics of gene clustering and ordering within animal kingdoms in two different lineages: vertebrates and arthropods. The database is constructed on a web-based Linux-Apache-MySQL-PHP framework and effective interactive user-inquiry service. Compared to other gene annotation databases with similar purposes, our database has three comprehensible advantages. First, our database is inclusive, including all high-quality genome assemblies of vertebrates and representative arthropod species. Second, it is human-centric since we map all gene clusters from other genomes in an order of lineage-ranks (such as primates, mammals, warm-blooded, and reptiles) onto human genome and start the database from well-defined gene pairs (a minimal cluster where the two adjacent genes are oriented as co-directional, convergent, and divergent pairs) to large gene clusters. Furthermore, users can search for any adjacent genes and their detailed annotations. Third, the database provides flexible parameter definitions, such as the distance of transcription start sites between two adjacent genes, which is extendable to genes that flanking the cluster across species. We also provide useful tools for sequence alignment, gene ontology (GO) annotation, promoter identification, gene expression (co-expression), and evolutionary analysis. This database not only provides a way to define lineage-specific and species-specific gene clusters but also facilitates future studies on gene co-regulation, epigenetic control of gene expression (DNA methylation and histone marks), and chromosomal structures in a context of gene clusters and species evolution. LCGbase is freely available at http://lcgbase.big.ac.cn/LCGbase.
format Online
Article
Text
id pubmed-3256993
institution National Center for Biotechnology Information
language English
publishDate 2011
publisher Libertas Academica
record_format MEDLINE/PubMed
spelling pubmed-32569932012-01-20 LCGbase: A Comprehensive Database for Lineage-Based Co-regulated Genes Wang, Dapeng Zhang, Yubin Fan, Zhonghua Liu, Guiming Yu, Jun Evol Bioinform Online Original Research Animal genes of different lineages, such as vertebrates and arthropods, are well-organized and blended into dynamic chromosomal structures that represent a primary regulatory mechanism for body development and cellular differentiation. The majority of genes in a genome are actually clustered, which are evolutionarily stable to different extents and biologically meaningful when evaluated among genomes within and across lineages. Until now, many questions concerning gene organization, such as what is the minimal number of genes in a cluster and what is the driving force leading to gene co-regulation, remain to be addressed. Here, we provide a user-friendly database—LCGbase (a comprehensive database for lineage-based co-regulated genes)—hosting information on evolutionary dynamics of gene clustering and ordering within animal kingdoms in two different lineages: vertebrates and arthropods. The database is constructed on a web-based Linux-Apache-MySQL-PHP framework and effective interactive user-inquiry service. Compared to other gene annotation databases with similar purposes, our database has three comprehensible advantages. First, our database is inclusive, including all high-quality genome assemblies of vertebrates and representative arthropod species. Second, it is human-centric since we map all gene clusters from other genomes in an order of lineage-ranks (such as primates, mammals, warm-blooded, and reptiles) onto human genome and start the database from well-defined gene pairs (a minimal cluster where the two adjacent genes are oriented as co-directional, convergent, and divergent pairs) to large gene clusters. Furthermore, users can search for any adjacent genes and their detailed annotations. Third, the database provides flexible parameter definitions, such as the distance of transcription start sites between two adjacent genes, which is extendable to genes that flanking the cluster across species. We also provide useful tools for sequence alignment, gene ontology (GO) annotation, promoter identification, gene expression (co-expression), and evolutionary analysis. This database not only provides a way to define lineage-specific and species-specific gene clusters but also facilitates future studies on gene co-regulation, epigenetic control of gene expression (DNA methylation and histone marks), and chromosomal structures in a context of gene clusters and species evolution. LCGbase is freely available at http://lcgbase.big.ac.cn/LCGbase. Libertas Academica 2011-12-13 /pmc/articles/PMC3256993/ /pubmed/22267903 http://dx.doi.org/10.4137/EBO.S8540 Text en © the author(s), publisher and licensee Libertas Academica Ltd. This is an open access article. Unrestricted non-commercial use is permitted provided the original work is properly cited.
spellingShingle Original Research
Wang, Dapeng
Zhang, Yubin
Fan, Zhonghua
Liu, Guiming
Yu, Jun
LCGbase: A Comprehensive Database for Lineage-Based Co-regulated Genes
title LCGbase: A Comprehensive Database for Lineage-Based Co-regulated Genes
title_full LCGbase: A Comprehensive Database for Lineage-Based Co-regulated Genes
title_fullStr LCGbase: A Comprehensive Database for Lineage-Based Co-regulated Genes
title_full_unstemmed LCGbase: A Comprehensive Database for Lineage-Based Co-regulated Genes
title_short LCGbase: A Comprehensive Database for Lineage-Based Co-regulated Genes
title_sort lcgbase: a comprehensive database for lineage-based co-regulated genes
topic Original Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3256993/
https://www.ncbi.nlm.nih.gov/pubmed/22267903
http://dx.doi.org/10.4137/EBO.S8540
work_keys_str_mv AT wangdapeng lcgbaseacomprehensivedatabaseforlineagebasedcoregulatedgenes
AT zhangyubin lcgbaseacomprehensivedatabaseforlineagebasedcoregulatedgenes
AT fanzhonghua lcgbaseacomprehensivedatabaseforlineagebasedcoregulatedgenes
AT liuguiming lcgbaseacomprehensivedatabaseforlineagebasedcoregulatedgenes
AT yujun lcgbaseacomprehensivedatabaseforlineagebasedcoregulatedgenes