Cargando…
Different evolutionary patterns of SNPs between domains and unassigned regions in human protein-coding sequences
Protein evolution plays an important role in the evolution of each genome. Because of their functional nature, in general, most of their parts or sites are differently constrained selectively, particularly by purifying selection. Most previous studies on protein evolution considered individual prote...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Springer Berlin Heidelberg
2016
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4875946/ https://www.ncbi.nlm.nih.gov/pubmed/26833483 http://dx.doi.org/10.1007/s00438-016-1170-7 |
_version_ | 1782433164593987584 |
---|---|
author | Pang, Erli Wu, Xiaomei Lin, Kui |
author_facet | Pang, Erli Wu, Xiaomei Lin, Kui |
author_sort | Pang, Erli |
collection | PubMed |
description | Protein evolution plays an important role in the evolution of each genome. Because of their functional nature, in general, most of their parts or sites are differently constrained selectively, particularly by purifying selection. Most previous studies on protein evolution considered individual proteins in their entirety or compared protein-coding sequences with non-coding sequences. Less attention has been paid to the evolution of different parts within each protein of a given genome. To this end, based on PfamA annotation of all human proteins, each protein sequence can be split into two parts: domains or unassigned regions. Using this rationale, single nucleotide polymorphisms (SNPs) in protein-coding sequences from the 1000 Genomes Project were mapped according to two classifications: SNPs occurring within protein domains and those within unassigned regions. With these classifications, we found: the density of synonymous SNPs within domains is significantly greater than that of synonymous SNPs within unassigned regions; however, the density of non-synonymous SNPs shows the opposite pattern. We also found there are signatures of purifying selection on both the domain and unassigned regions. Furthermore, the selective strength on domains is significantly greater than that on unassigned regions. In addition, among all of the human protein sequences, there are 117 PfamA domains in which no SNPs are found. Our results highlight an important aspect of protein domains and may contribute to our understanding of protein evolution. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1007/s00438-016-1170-7) contains supplementary material, which is available to authorized users. |
format | Online Article Text |
id | pubmed-4875946 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2016 |
publisher | Springer Berlin Heidelberg |
record_format | MEDLINE/PubMed |
spelling | pubmed-48759462016-06-21 Different evolutionary patterns of SNPs between domains and unassigned regions in human protein-coding sequences Pang, Erli Wu, Xiaomei Lin, Kui Mol Genet Genomics Original Article Protein evolution plays an important role in the evolution of each genome. Because of their functional nature, in general, most of their parts or sites are differently constrained selectively, particularly by purifying selection. Most previous studies on protein evolution considered individual proteins in their entirety or compared protein-coding sequences with non-coding sequences. Less attention has been paid to the evolution of different parts within each protein of a given genome. To this end, based on PfamA annotation of all human proteins, each protein sequence can be split into two parts: domains or unassigned regions. Using this rationale, single nucleotide polymorphisms (SNPs) in protein-coding sequences from the 1000 Genomes Project were mapped according to two classifications: SNPs occurring within protein domains and those within unassigned regions. With these classifications, we found: the density of synonymous SNPs within domains is significantly greater than that of synonymous SNPs within unassigned regions; however, the density of non-synonymous SNPs shows the opposite pattern. We also found there are signatures of purifying selection on both the domain and unassigned regions. Furthermore, the selective strength on domains is significantly greater than that on unassigned regions. In addition, among all of the human protein sequences, there are 117 PfamA domains in which no SNPs are found. Our results highlight an important aspect of protein domains and may contribute to our understanding of protein evolution. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1007/s00438-016-1170-7) contains supplementary material, which is available to authorized users. Springer Berlin Heidelberg 2016-01-30 2016 /pmc/articles/PMC4875946/ /pubmed/26833483 http://dx.doi.org/10.1007/s00438-016-1170-7 Text en © The Author(s) 2016 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. |
spellingShingle | Original Article Pang, Erli Wu, Xiaomei Lin, Kui Different evolutionary patterns of SNPs between domains and unassigned regions in human protein-coding sequences |
title | Different evolutionary patterns of SNPs between domains and unassigned regions in human protein-coding sequences |
title_full | Different evolutionary patterns of SNPs between domains and unassigned regions in human protein-coding sequences |
title_fullStr | Different evolutionary patterns of SNPs between domains and unassigned regions in human protein-coding sequences |
title_full_unstemmed | Different evolutionary patterns of SNPs between domains and unassigned regions in human protein-coding sequences |
title_short | Different evolutionary patterns of SNPs between domains and unassigned regions in human protein-coding sequences |
title_sort | different evolutionary patterns of snps between domains and unassigned regions in human protein-coding sequences |
topic | Original Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4875946/ https://www.ncbi.nlm.nih.gov/pubmed/26833483 http://dx.doi.org/10.1007/s00438-016-1170-7 |
work_keys_str_mv | AT pangerli differentevolutionarypatternsofsnpsbetweendomainsandunassignedregionsinhumanproteincodingsequences AT wuxiaomei differentevolutionarypatternsofsnpsbetweendomainsandunassignedregionsinhumanproteincodingsequences AT linkui differentevolutionarypatternsofsnpsbetweendomainsandunassignedregionsinhumanproteincodingsequences |