Cargando…
Identification, characterization, and utilization of single copy genes in 29 angiosperm genomes
BACKGROUND: Single copy genes are common across angiosperm genomes. With the sufficiently high quality sequenced genomes, the identification of large-scale single copy genes among multiple species is possible. Although some characteristics have been reported, our study provides novel insights into s...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2014
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4092219/ https://www.ncbi.nlm.nih.gov/pubmed/24950957 http://dx.doi.org/10.1186/1471-2164-15-504 |
_version_ | 1782325466011533312 |
---|---|
author | Han, Fengming Peng, Yong Xu, Lijia Xiao, Peigen |
author_facet | Han, Fengming Peng, Yong Xu, Lijia Xiao, Peigen |
author_sort | Han, Fengming |
collection | PubMed |
description | BACKGROUND: Single copy genes are common across angiosperm genomes. With the sufficiently high quality sequenced genomes, the identification of large-scale single copy genes among multiple species is possible. Although some characteristics have been reported, our study provides novel insights into single copy genes. RESULTS: We identified single copy genes across 29 angiosperm genomes. A significant negative correlation was found between the number of duplicate blocks and the number of single copy genes. We found that a considerable number of single copy genes are located in organelles, showing a preference for binding and catalytic activity. The analysis of effective number of codons (Nc) illustrates that single copy genes have a stronger codon bias than non-single copy genes in eudicots. The relative high expression level of single copy genes was partially confirmed by the RNA-seq data, rather than the Codon Adaptation Index (CAI). Unlike in most other species, a strongly negatively correlation occurs between Nc and GC3 among single copy genes in grass genomes. When compared to all non-single copy genes, single copy genes indicate more conservation (as indicated by Ka and Ks values). But our alternative splicing (AS) results reveal that selective constraints are weaker in single copy genes than in low copy family genes (1–10 in-paralogs) and stronger than high copy family genes (>10 in-paralogs). Using concatenated shared single copy genes, we obtained a well-resolved phylogenetic tree. With the addition of intron sequences, the branch support is improved, but striking incongruences are also evident. Therefore, it is noteworthy that inclusion of intron sequences seems more appropriate for the phylogenetic reconstruction at lower taxonomic levels. CONCLUSIONS: Our analysis provides insight into the evolutionary characteristics of single copy genes across 29 angiosperm genomes. The results suggest that there are key differences in evolutionary constraints between single copy genes and non-single copy genes. And to some extent, these evolutionary constraints show some species-specific differences, especially between eudicots and monocots. Our preliminary evidence also suggests that the concatenated shared single copy genes are well suited for use in resolving phylogenetic relationships. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/1471-2164-15-504) contains supplementary material, which is available to authorized users. |
format | Online Article Text |
id | pubmed-4092219 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2014 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-40922192014-07-21 Identification, characterization, and utilization of single copy genes in 29 angiosperm genomes Han, Fengming Peng, Yong Xu, Lijia Xiao, Peigen BMC Genomics Research Article BACKGROUND: Single copy genes are common across angiosperm genomes. With the sufficiently high quality sequenced genomes, the identification of large-scale single copy genes among multiple species is possible. Although some characteristics have been reported, our study provides novel insights into single copy genes. RESULTS: We identified single copy genes across 29 angiosperm genomes. A significant negative correlation was found between the number of duplicate blocks and the number of single copy genes. We found that a considerable number of single copy genes are located in organelles, showing a preference for binding and catalytic activity. The analysis of effective number of codons (Nc) illustrates that single copy genes have a stronger codon bias than non-single copy genes in eudicots. The relative high expression level of single copy genes was partially confirmed by the RNA-seq data, rather than the Codon Adaptation Index (CAI). Unlike in most other species, a strongly negatively correlation occurs between Nc and GC3 among single copy genes in grass genomes. When compared to all non-single copy genes, single copy genes indicate more conservation (as indicated by Ka and Ks values). But our alternative splicing (AS) results reveal that selective constraints are weaker in single copy genes than in low copy family genes (1–10 in-paralogs) and stronger than high copy family genes (>10 in-paralogs). Using concatenated shared single copy genes, we obtained a well-resolved phylogenetic tree. With the addition of intron sequences, the branch support is improved, but striking incongruences are also evident. Therefore, it is noteworthy that inclusion of intron sequences seems more appropriate for the phylogenetic reconstruction at lower taxonomic levels. CONCLUSIONS: Our analysis provides insight into the evolutionary characteristics of single copy genes across 29 angiosperm genomes. The results suggest that there are key differences in evolutionary constraints between single copy genes and non-single copy genes. And to some extent, these evolutionary constraints show some species-specific differences, especially between eudicots and monocots. Our preliminary evidence also suggests that the concatenated shared single copy genes are well suited for use in resolving phylogenetic relationships. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/1471-2164-15-504) contains supplementary material, which is available to authorized users. BioMed Central 2014-06-21 /pmc/articles/PMC4092219/ /pubmed/24950957 http://dx.doi.org/10.1186/1471-2164-15-504 Text en © Han et al.; licensee BioMed Central Ltd. 2014 This article is published under license to BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly credited. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated. |
spellingShingle | Research Article Han, Fengming Peng, Yong Xu, Lijia Xiao, Peigen Identification, characterization, and utilization of single copy genes in 29 angiosperm genomes |
title | Identification, characterization, and utilization of single copy genes in 29 angiosperm genomes |
title_full | Identification, characterization, and utilization of single copy genes in 29 angiosperm genomes |
title_fullStr | Identification, characterization, and utilization of single copy genes in 29 angiosperm genomes |
title_full_unstemmed | Identification, characterization, and utilization of single copy genes in 29 angiosperm genomes |
title_short | Identification, characterization, and utilization of single copy genes in 29 angiosperm genomes |
title_sort | identification, characterization, and utilization of single copy genes in 29 angiosperm genomes |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4092219/ https://www.ncbi.nlm.nih.gov/pubmed/24950957 http://dx.doi.org/10.1186/1471-2164-15-504 |
work_keys_str_mv | AT hanfengming identificationcharacterizationandutilizationofsinglecopygenesin29angiospermgenomes AT pengyong identificationcharacterizationandutilizationofsinglecopygenesin29angiospermgenomes AT xulijia identificationcharacterizationandutilizationofsinglecopygenesin29angiospermgenomes AT xiaopeigen identificationcharacterizationandutilizationofsinglecopygenesin29angiospermgenomes |