Cargando…

The Duplicated Genes Database: Identification and Functional Annotation of Co-Localised Duplicated Genes across Genomes

BACKGROUND: There has been a surge in studies linking genome structure and gene expression, with special focus on duplicated genes. Although initially duplicated from the same sequence, duplicated genes can diverge strongly over evolution and take on different functions or regulated expression. Howe...

Descripción completa

Detalles Bibliográficos
Autores principales: Ouedraogo, Marion, Bettembourg, Charles, Bretaudeau, Anthony, Sallou, Olivier, Diot, Christian, Demeure, Olivier, Lecerf, Frédéric
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2012
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3508997/
https://www.ncbi.nlm.nih.gov/pubmed/23209799
http://dx.doi.org/10.1371/journal.pone.0050653
_version_ 1782251271335444480
author Ouedraogo, Marion
Bettembourg, Charles
Bretaudeau, Anthony
Sallou, Olivier
Diot, Christian
Demeure, Olivier
Lecerf, Frédéric
author_facet Ouedraogo, Marion
Bettembourg, Charles
Bretaudeau, Anthony
Sallou, Olivier
Diot, Christian
Demeure, Olivier
Lecerf, Frédéric
author_sort Ouedraogo, Marion
collection PubMed
description BACKGROUND: There has been a surge in studies linking genome structure and gene expression, with special focus on duplicated genes. Although initially duplicated from the same sequence, duplicated genes can diverge strongly over evolution and take on different functions or regulated expression. However, information on the function and expression of duplicated genes remains sparse. Identifying groups of duplicated genes in different genomes and characterizing their expression and function would therefore be of great interest to the research community. The ‘Duplicated Genes Database’ (DGD) was developed for this purpose. METHODOLOGY: Nine species were included in the DGD. For each species, BLAST analyses were conducted on peptide sequences corresponding to the genes mapped on a same chromosome. Groups of duplicated genes were defined based on these pairwise BLAST comparisons and the genomic location of the genes. For each group, Pearson correlations between gene expression data and semantic similarities between functional GO annotations were also computed when the relevant information was available. CONCLUSIONS: The Duplicated Gene Database provides a list of co-localised and duplicated genes for several species with the available gene co-expression level and semantic similarity value of functional annotation. Adding these data to the groups of duplicated genes provides biological information that can prove useful to gene expression analyses. The Duplicated Gene Database can be freely accessed through the DGD website at http://dgd.genouest.org.
format Online
Article
Text
id pubmed-3508997
institution National Center for Biotechnology Information
language English
publishDate 2012
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-35089972012-12-03 The Duplicated Genes Database: Identification and Functional Annotation of Co-Localised Duplicated Genes across Genomes Ouedraogo, Marion Bettembourg, Charles Bretaudeau, Anthony Sallou, Olivier Diot, Christian Demeure, Olivier Lecerf, Frédéric PLoS One Research Article BACKGROUND: There has been a surge in studies linking genome structure and gene expression, with special focus on duplicated genes. Although initially duplicated from the same sequence, duplicated genes can diverge strongly over evolution and take on different functions or regulated expression. However, information on the function and expression of duplicated genes remains sparse. Identifying groups of duplicated genes in different genomes and characterizing their expression and function would therefore be of great interest to the research community. The ‘Duplicated Genes Database’ (DGD) was developed for this purpose. METHODOLOGY: Nine species were included in the DGD. For each species, BLAST analyses were conducted on peptide sequences corresponding to the genes mapped on a same chromosome. Groups of duplicated genes were defined based on these pairwise BLAST comparisons and the genomic location of the genes. For each group, Pearson correlations between gene expression data and semantic similarities between functional GO annotations were also computed when the relevant information was available. CONCLUSIONS: The Duplicated Gene Database provides a list of co-localised and duplicated genes for several species with the available gene co-expression level and semantic similarity value of functional annotation. Adding these data to the groups of duplicated genes provides biological information that can prove useful to gene expression analyses. The Duplicated Gene Database can be freely accessed through the DGD website at http://dgd.genouest.org. Public Library of Science 2012-11-28 /pmc/articles/PMC3508997/ /pubmed/23209799 http://dx.doi.org/10.1371/journal.pone.0050653 Text en © 2012 Ouedraogo et al http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited.
spellingShingle Research Article
Ouedraogo, Marion
Bettembourg, Charles
Bretaudeau, Anthony
Sallou, Olivier
Diot, Christian
Demeure, Olivier
Lecerf, Frédéric
The Duplicated Genes Database: Identification and Functional Annotation of Co-Localised Duplicated Genes across Genomes
title The Duplicated Genes Database: Identification and Functional Annotation of Co-Localised Duplicated Genes across Genomes
title_full The Duplicated Genes Database: Identification and Functional Annotation of Co-Localised Duplicated Genes across Genomes
title_fullStr The Duplicated Genes Database: Identification and Functional Annotation of Co-Localised Duplicated Genes across Genomes
title_full_unstemmed The Duplicated Genes Database: Identification and Functional Annotation of Co-Localised Duplicated Genes across Genomes
title_short The Duplicated Genes Database: Identification and Functional Annotation of Co-Localised Duplicated Genes across Genomes
title_sort duplicated genes database: identification and functional annotation of co-localised duplicated genes across genomes
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3508997/
https://www.ncbi.nlm.nih.gov/pubmed/23209799
http://dx.doi.org/10.1371/journal.pone.0050653
work_keys_str_mv AT ouedraogomarion theduplicatedgenesdatabaseidentificationandfunctionalannotationofcolocalisedduplicatedgenesacrossgenomes
AT bettembourgcharles theduplicatedgenesdatabaseidentificationandfunctionalannotationofcolocalisedduplicatedgenesacrossgenomes
AT bretaudeauanthony theduplicatedgenesdatabaseidentificationandfunctionalannotationofcolocalisedduplicatedgenesacrossgenomes
AT sallouolivier theduplicatedgenesdatabaseidentificationandfunctionalannotationofcolocalisedduplicatedgenesacrossgenomes
AT diotchristian theduplicatedgenesdatabaseidentificationandfunctionalannotationofcolocalisedduplicatedgenesacrossgenomes
AT demeureolivier theduplicatedgenesdatabaseidentificationandfunctionalannotationofcolocalisedduplicatedgenesacrossgenomes
AT lecerffrederic theduplicatedgenesdatabaseidentificationandfunctionalannotationofcolocalisedduplicatedgenesacrossgenomes
AT ouedraogomarion duplicatedgenesdatabaseidentificationandfunctionalannotationofcolocalisedduplicatedgenesacrossgenomes
AT bettembourgcharles duplicatedgenesdatabaseidentificationandfunctionalannotationofcolocalisedduplicatedgenesacrossgenomes
AT bretaudeauanthony duplicatedgenesdatabaseidentificationandfunctionalannotationofcolocalisedduplicatedgenesacrossgenomes
AT sallouolivier duplicatedgenesdatabaseidentificationandfunctionalannotationofcolocalisedduplicatedgenesacrossgenomes
AT diotchristian duplicatedgenesdatabaseidentificationandfunctionalannotationofcolocalisedduplicatedgenesacrossgenomes
AT demeureolivier duplicatedgenesdatabaseidentificationandfunctionalannotationofcolocalisedduplicatedgenesacrossgenomes
AT lecerffrederic duplicatedgenesdatabaseidentificationandfunctionalannotationofcolocalisedduplicatedgenesacrossgenomes