Cargando…

Different evolutionary patterns of SNPs between domains and unassigned regions in human protein-coding sequences

Protein evolution plays an important role in the evolution of each genome. Because of their functional nature, in general, most of their parts or sites are differently constrained selectively, particularly by purifying selection. Most previous studies on protein evolution considered individual prote...

Descripción completa

Detalles Bibliográficos
Autores principales: Pang, Erli, Wu, Xiaomei, Lin, Kui
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Springer Berlin Heidelberg 2016
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4875946/
https://www.ncbi.nlm.nih.gov/pubmed/26833483
http://dx.doi.org/10.1007/s00438-016-1170-7
_version_ 1782433164593987584
author Pang, Erli
Wu, Xiaomei
Lin, Kui
author_facet Pang, Erli
Wu, Xiaomei
Lin, Kui
author_sort Pang, Erli
collection PubMed
description Protein evolution plays an important role in the evolution of each genome. Because of their functional nature, in general, most of their parts or sites are differently constrained selectively, particularly by purifying selection. Most previous studies on protein evolution considered individual proteins in their entirety or compared protein-coding sequences with non-coding sequences. Less attention has been paid to the evolution of different parts within each protein of a given genome. To this end, based on PfamA annotation of all human proteins, each protein sequence can be split into two parts: domains or unassigned regions. Using this rationale, single nucleotide polymorphisms (SNPs) in protein-coding sequences from the 1000 Genomes Project were mapped according to two classifications: SNPs occurring within protein domains and those within unassigned regions. With these classifications, we found: the density of synonymous SNPs within domains is significantly greater than that of synonymous SNPs within unassigned regions; however, the density of non-synonymous SNPs shows the opposite pattern. We also found there are signatures of purifying selection on both the domain and unassigned regions. Furthermore, the selective strength on domains is significantly greater than that on unassigned regions. In addition, among all of the human protein sequences, there are 117 PfamA domains in which no SNPs are found. Our results highlight an important aspect of protein domains and may contribute to our understanding of protein evolution. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1007/s00438-016-1170-7) contains supplementary material, which is available to authorized users.
format Online
Article
Text
id pubmed-4875946
institution National Center for Biotechnology Information
language English
publishDate 2016
publisher Springer Berlin Heidelberg
record_format MEDLINE/PubMed
spelling pubmed-48759462016-06-21 Different evolutionary patterns of SNPs between domains and unassigned regions in human protein-coding sequences Pang, Erli Wu, Xiaomei Lin, Kui Mol Genet Genomics Original Article Protein evolution plays an important role in the evolution of each genome. Because of their functional nature, in general, most of their parts or sites are differently constrained selectively, particularly by purifying selection. Most previous studies on protein evolution considered individual proteins in their entirety or compared protein-coding sequences with non-coding sequences. Less attention has been paid to the evolution of different parts within each protein of a given genome. To this end, based on PfamA annotation of all human proteins, each protein sequence can be split into two parts: domains or unassigned regions. Using this rationale, single nucleotide polymorphisms (SNPs) in protein-coding sequences from the 1000 Genomes Project were mapped according to two classifications: SNPs occurring within protein domains and those within unassigned regions. With these classifications, we found: the density of synonymous SNPs within domains is significantly greater than that of synonymous SNPs within unassigned regions; however, the density of non-synonymous SNPs shows the opposite pattern. We also found there are signatures of purifying selection on both the domain and unassigned regions. Furthermore, the selective strength on domains is significantly greater than that on unassigned regions. In addition, among all of the human protein sequences, there are 117 PfamA domains in which no SNPs are found. Our results highlight an important aspect of protein domains and may contribute to our understanding of protein evolution. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1007/s00438-016-1170-7) contains supplementary material, which is available to authorized users. Springer Berlin Heidelberg 2016-01-30 2016 /pmc/articles/PMC4875946/ /pubmed/26833483 http://dx.doi.org/10.1007/s00438-016-1170-7 Text en © The Author(s) 2016 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made.
spellingShingle Original Article
Pang, Erli
Wu, Xiaomei
Lin, Kui
Different evolutionary patterns of SNPs between domains and unassigned regions in human protein-coding sequences
title Different evolutionary patterns of SNPs between domains and unassigned regions in human protein-coding sequences
title_full Different evolutionary patterns of SNPs between domains and unassigned regions in human protein-coding sequences
title_fullStr Different evolutionary patterns of SNPs between domains and unassigned regions in human protein-coding sequences
title_full_unstemmed Different evolutionary patterns of SNPs between domains and unassigned regions in human protein-coding sequences
title_short Different evolutionary patterns of SNPs between domains and unassigned regions in human protein-coding sequences
title_sort different evolutionary patterns of snps between domains and unassigned regions in human protein-coding sequences
topic Original Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4875946/
https://www.ncbi.nlm.nih.gov/pubmed/26833483
http://dx.doi.org/10.1007/s00438-016-1170-7
work_keys_str_mv AT pangerli differentevolutionarypatternsofsnpsbetweendomainsandunassignedregionsinhumanproteincodingsequences
AT wuxiaomei differentevolutionarypatternsofsnpsbetweendomainsandunassignedregionsinhumanproteincodingsequences
AT linkui differentevolutionarypatternsofsnpsbetweendomainsandunassignedregionsinhumanproteincodingsequences