Cargando…

Calculating and comparing codon usage values in rare disease genes highlights codon clustering with disease-and tissue- specific hierarchy

We designed a novel strategy to define codon usage bias (CUB) in 6 specific small cohorts of human genes. We calculated codon usage (CU) values in 29 non-disease-causing (NDC) and 31 disease-causing (DC) human genes which are highly expressed in 3 distinct tissues, kidney, muscle, and skin. We appli...

Descripción completa

Detalles Bibliográficos
Autores principales: Rossi, Rachele, Fang, Mingyan, Zhu, Lin, Jiang, Chongyi, Yu, Cong, Flesia, Cristina, Nie, Chao, Li, Wenyan, Ferlini, Alessandra
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8970475/
https://www.ncbi.nlm.nih.gov/pubmed/35358230
http://dx.doi.org/10.1371/journal.pone.0265469
_version_ 1784679464013135872
author Rossi, Rachele
Fang, Mingyan
Zhu, Lin
Jiang, Chongyi
Yu, Cong
Flesia, Cristina
Nie, Chao
Li, Wenyan
Ferlini, Alessandra
author_facet Rossi, Rachele
Fang, Mingyan
Zhu, Lin
Jiang, Chongyi
Yu, Cong
Flesia, Cristina
Nie, Chao
Li, Wenyan
Ferlini, Alessandra
author_sort Rossi, Rachele
collection PubMed
description We designed a novel strategy to define codon usage bias (CUB) in 6 specific small cohorts of human genes. We calculated codon usage (CU) values in 29 non-disease-causing (NDC) and 31 disease-causing (DC) human genes which are highly expressed in 3 distinct tissues, kidney, muscle, and skin. We applied our strategy to the same selected genes annotated in 15 mammalian species. We obtained CUB hierarchical clusters for each gene cohort which showed tissue-specific and disease-specific CUB fingerprints. We showed that DC genes (especially those expressed in muscle) display a low CUB, well recognizable in codon hierarchical clustering. We defined the extremely biased codons as “zero codons” and found that their number is significantly higher in all DC genes, all tissues, and that this trend is conserved across mammals. Based on this calculation in different gene cohorts, we identified 5 codons which are more differentially used across genes and mammals, underlining that some genes have favorite synonymous codons in use. Since of the muscle genes clear clusters, and, among these, dystrophin gene surprisingly does not show any “zero codon” we adopted a novel approach to study CUB, we called “mapping-on-codons”. We positioned 2828 dystrophin missense and nonsense pathogenic variations on their respective codon, highlighting that its frequency and occurrence is not dependent on the CU values. We conclude our strategy consents to identify a hierarchical clustering of CU values in a gene cohort-specific fingerprints, with recognizable trend across mammals. In DC muscle genes also a disease-related fingerprint can be observed, allowing discrimination between DC and NDC genes. We propose that using our strategy which studies CU in specific gene cohorts, as rare disease genes, and tissue specific genes, may provide novel information about the CUB role in human and medical genetics, with implications on synonymous variations interpretation and codon optimization algorithms.
format Online
Article
Text
id pubmed-8970475
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-89704752022-04-01 Calculating and comparing codon usage values in rare disease genes highlights codon clustering with disease-and tissue- specific hierarchy Rossi, Rachele Fang, Mingyan Zhu, Lin Jiang, Chongyi Yu, Cong Flesia, Cristina Nie, Chao Li, Wenyan Ferlini, Alessandra PLoS One Research Article We designed a novel strategy to define codon usage bias (CUB) in 6 specific small cohorts of human genes. We calculated codon usage (CU) values in 29 non-disease-causing (NDC) and 31 disease-causing (DC) human genes which are highly expressed in 3 distinct tissues, kidney, muscle, and skin. We applied our strategy to the same selected genes annotated in 15 mammalian species. We obtained CUB hierarchical clusters for each gene cohort which showed tissue-specific and disease-specific CUB fingerprints. We showed that DC genes (especially those expressed in muscle) display a low CUB, well recognizable in codon hierarchical clustering. We defined the extremely biased codons as “zero codons” and found that their number is significantly higher in all DC genes, all tissues, and that this trend is conserved across mammals. Based on this calculation in different gene cohorts, we identified 5 codons which are more differentially used across genes and mammals, underlining that some genes have favorite synonymous codons in use. Since of the muscle genes clear clusters, and, among these, dystrophin gene surprisingly does not show any “zero codon” we adopted a novel approach to study CUB, we called “mapping-on-codons”. We positioned 2828 dystrophin missense and nonsense pathogenic variations on their respective codon, highlighting that its frequency and occurrence is not dependent on the CU values. We conclude our strategy consents to identify a hierarchical clustering of CU values in a gene cohort-specific fingerprints, with recognizable trend across mammals. In DC muscle genes also a disease-related fingerprint can be observed, allowing discrimination between DC and NDC genes. We propose that using our strategy which studies CU in specific gene cohorts, as rare disease genes, and tissue specific genes, may provide novel information about the CUB role in human and medical genetics, with implications on synonymous variations interpretation and codon optimization algorithms. Public Library of Science 2022-03-31 /pmc/articles/PMC8970475/ /pubmed/35358230 http://dx.doi.org/10.1371/journal.pone.0265469 Text en © 2022 Rossi et al https://creativecommons.org/licenses/by/4.0/This is an open access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle Research Article
Rossi, Rachele
Fang, Mingyan
Zhu, Lin
Jiang, Chongyi
Yu, Cong
Flesia, Cristina
Nie, Chao
Li, Wenyan
Ferlini, Alessandra
Calculating and comparing codon usage values in rare disease genes highlights codon clustering with disease-and tissue- specific hierarchy
title Calculating and comparing codon usage values in rare disease genes highlights codon clustering with disease-and tissue- specific hierarchy
title_full Calculating and comparing codon usage values in rare disease genes highlights codon clustering with disease-and tissue- specific hierarchy
title_fullStr Calculating and comparing codon usage values in rare disease genes highlights codon clustering with disease-and tissue- specific hierarchy
title_full_unstemmed Calculating and comparing codon usage values in rare disease genes highlights codon clustering with disease-and tissue- specific hierarchy
title_short Calculating and comparing codon usage values in rare disease genes highlights codon clustering with disease-and tissue- specific hierarchy
title_sort calculating and comparing codon usage values in rare disease genes highlights codon clustering with disease-and tissue- specific hierarchy
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8970475/
https://www.ncbi.nlm.nih.gov/pubmed/35358230
http://dx.doi.org/10.1371/journal.pone.0265469
work_keys_str_mv AT rossirachele calculatingandcomparingcodonusagevaluesinrarediseasegeneshighlightscodonclusteringwithdiseaseandtissuespecifichierarchy
AT fangmingyan calculatingandcomparingcodonusagevaluesinrarediseasegeneshighlightscodonclusteringwithdiseaseandtissuespecifichierarchy
AT zhulin calculatingandcomparingcodonusagevaluesinrarediseasegeneshighlightscodonclusteringwithdiseaseandtissuespecifichierarchy
AT jiangchongyi calculatingandcomparingcodonusagevaluesinrarediseasegeneshighlightscodonclusteringwithdiseaseandtissuespecifichierarchy
AT yucong calculatingandcomparingcodonusagevaluesinrarediseasegeneshighlightscodonclusteringwithdiseaseandtissuespecifichierarchy
AT flesiacristina calculatingandcomparingcodonusagevaluesinrarediseasegeneshighlightscodonclusteringwithdiseaseandtissuespecifichierarchy
AT niechao calculatingandcomparingcodonusagevaluesinrarediseasegeneshighlightscodonclusteringwithdiseaseandtissuespecifichierarchy
AT liwenyan calculatingandcomparingcodonusagevaluesinrarediseasegeneshighlightscodonclusteringwithdiseaseandtissuespecifichierarchy
AT ferlinialessandra calculatingandcomparingcodonusagevaluesinrarediseasegeneshighlightscodonclusteringwithdiseaseandtissuespecifichierarchy