Cargando…

Identification, characterization, and utilization of single copy genes in 29 angiosperm genomes

BACKGROUND: Single copy genes are common across angiosperm genomes. With the sufficiently high quality sequenced genomes, the identification of large-scale single copy genes among multiple species is possible. Although some characteristics have been reported, our study provides novel insights into s...

Descripción completa

Detalles Bibliográficos
Autores principales: Han, Fengming, Peng, Yong, Xu, Lijia, Xiao, Peigen
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2014
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4092219/
https://www.ncbi.nlm.nih.gov/pubmed/24950957
http://dx.doi.org/10.1186/1471-2164-15-504
_version_ 1782325466011533312
author Han, Fengming
Peng, Yong
Xu, Lijia
Xiao, Peigen
author_facet Han, Fengming
Peng, Yong
Xu, Lijia
Xiao, Peigen
author_sort Han, Fengming
collection PubMed
description BACKGROUND: Single copy genes are common across angiosperm genomes. With the sufficiently high quality sequenced genomes, the identification of large-scale single copy genes among multiple species is possible. Although some characteristics have been reported, our study provides novel insights into single copy genes. RESULTS: We identified single copy genes across 29 angiosperm genomes. A significant negative correlation was found between the number of duplicate blocks and the number of single copy genes. We found that a considerable number of single copy genes are located in organelles, showing a preference for binding and catalytic activity. The analysis of effective number of codons (Nc) illustrates that single copy genes have a stronger codon bias than non-single copy genes in eudicots. The relative high expression level of single copy genes was partially confirmed by the RNA-seq data, rather than the Codon Adaptation Index (CAI). Unlike in most other species, a strongly negatively correlation occurs between Nc and GC3 among single copy genes in grass genomes. When compared to all non-single copy genes, single copy genes indicate more conservation (as indicated by Ka and Ks values). But our alternative splicing (AS) results reveal that selective constraints are weaker in single copy genes than in low copy family genes (1–10 in-paralogs) and stronger than high copy family genes (>10 in-paralogs). Using concatenated shared single copy genes, we obtained a well-resolved phylogenetic tree. With the addition of intron sequences, the branch support is improved, but striking incongruences are also evident. Therefore, it is noteworthy that inclusion of intron sequences seems more appropriate for the phylogenetic reconstruction at lower taxonomic levels. CONCLUSIONS: Our analysis provides insight into the evolutionary characteristics of single copy genes across 29 angiosperm genomes. The results suggest that there are key differences in evolutionary constraints between single copy genes and non-single copy genes. And to some extent, these evolutionary constraints show some species-specific differences, especially between eudicots and monocots. Our preliminary evidence also suggests that the concatenated shared single copy genes are well suited for use in resolving phylogenetic relationships. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/1471-2164-15-504) contains supplementary material, which is available to authorized users.
format Online
Article
Text
id pubmed-4092219
institution National Center for Biotechnology Information
language English
publishDate 2014
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-40922192014-07-21 Identification, characterization, and utilization of single copy genes in 29 angiosperm genomes Han, Fengming Peng, Yong Xu, Lijia Xiao, Peigen BMC Genomics Research Article BACKGROUND: Single copy genes are common across angiosperm genomes. With the sufficiently high quality sequenced genomes, the identification of large-scale single copy genes among multiple species is possible. Although some characteristics have been reported, our study provides novel insights into single copy genes. RESULTS: We identified single copy genes across 29 angiosperm genomes. A significant negative correlation was found between the number of duplicate blocks and the number of single copy genes. We found that a considerable number of single copy genes are located in organelles, showing a preference for binding and catalytic activity. The analysis of effective number of codons (Nc) illustrates that single copy genes have a stronger codon bias than non-single copy genes in eudicots. The relative high expression level of single copy genes was partially confirmed by the RNA-seq data, rather than the Codon Adaptation Index (CAI). Unlike in most other species, a strongly negatively correlation occurs between Nc and GC3 among single copy genes in grass genomes. When compared to all non-single copy genes, single copy genes indicate more conservation (as indicated by Ka and Ks values). But our alternative splicing (AS) results reveal that selective constraints are weaker in single copy genes than in low copy family genes (1–10 in-paralogs) and stronger than high copy family genes (>10 in-paralogs). Using concatenated shared single copy genes, we obtained a well-resolved phylogenetic tree. With the addition of intron sequences, the branch support is improved, but striking incongruences are also evident. Therefore, it is noteworthy that inclusion of intron sequences seems more appropriate for the phylogenetic reconstruction at lower taxonomic levels. CONCLUSIONS: Our analysis provides insight into the evolutionary characteristics of single copy genes across 29 angiosperm genomes. The results suggest that there are key differences in evolutionary constraints between single copy genes and non-single copy genes. And to some extent, these evolutionary constraints show some species-specific differences, especially between eudicots and monocots. Our preliminary evidence also suggests that the concatenated shared single copy genes are well suited for use in resolving phylogenetic relationships. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/1471-2164-15-504) contains supplementary material, which is available to authorized users. BioMed Central 2014-06-21 /pmc/articles/PMC4092219/ /pubmed/24950957 http://dx.doi.org/10.1186/1471-2164-15-504 Text en © Han et al.; licensee BioMed Central Ltd. 2014 This article is published under license to BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly credited. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Research Article
Han, Fengming
Peng, Yong
Xu, Lijia
Xiao, Peigen
Identification, characterization, and utilization of single copy genes in 29 angiosperm genomes
title Identification, characterization, and utilization of single copy genes in 29 angiosperm genomes
title_full Identification, characterization, and utilization of single copy genes in 29 angiosperm genomes
title_fullStr Identification, characterization, and utilization of single copy genes in 29 angiosperm genomes
title_full_unstemmed Identification, characterization, and utilization of single copy genes in 29 angiosperm genomes
title_short Identification, characterization, and utilization of single copy genes in 29 angiosperm genomes
title_sort identification, characterization, and utilization of single copy genes in 29 angiosperm genomes
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4092219/
https://www.ncbi.nlm.nih.gov/pubmed/24950957
http://dx.doi.org/10.1186/1471-2164-15-504
work_keys_str_mv AT hanfengming identificationcharacterizationandutilizationofsinglecopygenesin29angiospermgenomes
AT pengyong identificationcharacterizationandutilizationofsinglecopygenesin29angiospermgenomes
AT xulijia identificationcharacterizationandutilizationofsinglecopygenesin29angiospermgenomes
AT xiaopeigen identificationcharacterizationandutilizationofsinglecopygenesin29angiospermgenomes