Cargando…

Natural protein sequences are more intrinsically disordered than random sequences

Most natural protein sequences have resulted from millions or even billions of years of evolution. How they differ from random sequences is not fully understood. Previous computational and experimental studies of random proteins generated from noncoding regions yielded inclusive results due to speci...

Descripción completa

Detalles Bibliográficos
Autores principales: Yu, Jia-Feng, Cao, Zanxia, Yang, Yuedong, Wang, Chun-Ling, Su, Zhen-Dong, Zhao, Ya-Wei, Wang, Ji-Hua, Zhou, Yaoqi
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Springer International Publishing 2016
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4937073/
https://www.ncbi.nlm.nih.gov/pubmed/26801222
http://dx.doi.org/10.1007/s00018-016-2138-9
_version_ 1782441646041858048
author Yu, Jia-Feng
Cao, Zanxia
Yang, Yuedong
Wang, Chun-Ling
Su, Zhen-Dong
Zhao, Ya-Wei
Wang, Ji-Hua
Zhou, Yaoqi
author_facet Yu, Jia-Feng
Cao, Zanxia
Yang, Yuedong
Wang, Chun-Ling
Su, Zhen-Dong
Zhao, Ya-Wei
Wang, Ji-Hua
Zhou, Yaoqi
author_sort Yu, Jia-Feng
collection PubMed
description Most natural protein sequences have resulted from millions or even billions of years of evolution. How they differ from random sequences is not fully understood. Previous computational and experimental studies of random proteins generated from noncoding regions yielded inclusive results due to species-dependent codon biases and GC contents. Here, we approach this problem by investigating 10,000 sequences randomized at the amino acid level. Using well-established predictors for protein intrinsic disorder, we found that natural sequences have more long disordered regions than random sequences, even when random and natural sequences have the same overall composition of amino acid residues. We also showed that random sequences are as structured as natural sequences according to contents and length distributions of predicted secondary structure, although the structures from random sequences may be in a molten globular-like state, according to molecular dynamics simulations. The bias of natural sequences toward more intrinsic disorder suggests that natural sequences are created and evolved to avoid protein aggregation and increase functional diversity.
format Online
Article
Text
id pubmed-4937073
institution National Center for Biotechnology Information
language English
publishDate 2016
publisher Springer International Publishing
record_format MEDLINE/PubMed
spelling pubmed-49370732016-07-19 Natural protein sequences are more intrinsically disordered than random sequences Yu, Jia-Feng Cao, Zanxia Yang, Yuedong Wang, Chun-Ling Su, Zhen-Dong Zhao, Ya-Wei Wang, Ji-Hua Zhou, Yaoqi Cell Mol Life Sci Original Article Most natural protein sequences have resulted from millions or even billions of years of evolution. How they differ from random sequences is not fully understood. Previous computational and experimental studies of random proteins generated from noncoding regions yielded inclusive results due to species-dependent codon biases and GC contents. Here, we approach this problem by investigating 10,000 sequences randomized at the amino acid level. Using well-established predictors for protein intrinsic disorder, we found that natural sequences have more long disordered regions than random sequences, even when random and natural sequences have the same overall composition of amino acid residues. We also showed that random sequences are as structured as natural sequences according to contents and length distributions of predicted secondary structure, although the structures from random sequences may be in a molten globular-like state, according to molecular dynamics simulations. The bias of natural sequences toward more intrinsic disorder suggests that natural sequences are created and evolved to avoid protein aggregation and increase functional diversity. Springer International Publishing 2016-01-22 2016 /pmc/articles/PMC4937073/ /pubmed/26801222 http://dx.doi.org/10.1007/s00018-016-2138-9 Text en © The Author(s) 2016 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made.
spellingShingle Original Article
Yu, Jia-Feng
Cao, Zanxia
Yang, Yuedong
Wang, Chun-Ling
Su, Zhen-Dong
Zhao, Ya-Wei
Wang, Ji-Hua
Zhou, Yaoqi
Natural protein sequences are more intrinsically disordered than random sequences
title Natural protein sequences are more intrinsically disordered than random sequences
title_full Natural protein sequences are more intrinsically disordered than random sequences
title_fullStr Natural protein sequences are more intrinsically disordered than random sequences
title_full_unstemmed Natural protein sequences are more intrinsically disordered than random sequences
title_short Natural protein sequences are more intrinsically disordered than random sequences
title_sort natural protein sequences are more intrinsically disordered than random sequences
topic Original Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4937073/
https://www.ncbi.nlm.nih.gov/pubmed/26801222
http://dx.doi.org/10.1007/s00018-016-2138-9
work_keys_str_mv AT yujiafeng naturalproteinsequencesaremoreintrinsicallydisorderedthanrandomsequences
AT caozanxia naturalproteinsequencesaremoreintrinsicallydisorderedthanrandomsequences
AT yangyuedong naturalproteinsequencesaremoreintrinsicallydisorderedthanrandomsequences
AT wangchunling naturalproteinsequencesaremoreintrinsicallydisorderedthanrandomsequences
AT suzhendong naturalproteinsequencesaremoreintrinsicallydisorderedthanrandomsequences
AT zhaoyawei naturalproteinsequencesaremoreintrinsicallydisorderedthanrandomsequences
AT wangjihua naturalproteinsequencesaremoreintrinsicallydisorderedthanrandomsequences
AT zhouyaoqi naturalproteinsequencesaremoreintrinsicallydisorderedthanrandomsequences