Cargando…
Bioinformatic Identification of Rare Codon Clusters (RCCs) in HBV Genome and Evaluation of RCCs in Proteins Structure of Hepatitis B Virus
BACKGROUND: Hepatitis B virus (HBV) as an infectious disease that has nine genotypes (A - I) and a ‘putative’ genotype J. OBJECTIVES: The aim of this study was to identify the rare codon clusters (RCC) in the HBV genome and to evaluate these RCCs in the HBV proteins structure. METHODS: For detection...
Autores principales: | , , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Kowsar
2016
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5116127/ https://www.ncbi.nlm.nih.gov/pubmed/27882067 http://dx.doi.org/10.5812/hepatmon.39909 |
_version_ | 1782468618945036288 |
---|---|
author | Mortazavi, Mojtaba Zarenezhad, Mohammad Gholamzadeh, Saeid Alavian, Seyed Moayed Ghorbani, Mohammad Dehghani, Reza Malekpour, Abdorrasoul Meshkibaf, Mohammadhasan Fakhrzad, Ali |
author_facet | Mortazavi, Mojtaba Zarenezhad, Mohammad Gholamzadeh, Saeid Alavian, Seyed Moayed Ghorbani, Mohammad Dehghani, Reza Malekpour, Abdorrasoul Meshkibaf, Mohammadhasan Fakhrzad, Ali |
author_sort | Mortazavi, Mojtaba |
collection | PubMed |
description | BACKGROUND: Hepatitis B virus (HBV) as an infectious disease that has nine genotypes (A - I) and a ‘putative’ genotype J. OBJECTIVES: The aim of this study was to identify the rare codon clusters (RCC) in the HBV genome and to evaluate these RCCs in the HBV proteins structure. METHODS: For detection of protein family accession numbers (Pfam) in HBV proteins, the UniProt database and Pfam search tool were used. Protein family accession numbers is a comprehensive and accurate collection of protein domains and families. It contains annotation of each family in the form of textual descriptions, links to other resources and literature references. Genome projects have used Pfam extensively for large-scale functional annotation of genomic data; Pfam database is a large collection of protein families, each represented by multiple sequence alignments and hidden Markov models (HMMs). The Pfam search tools are databases that identify Pfam of proteins. These Pfam IDs were analyzed in Sherlocc program and the location of RCCs in HBV genome and proteins were detected and reported as translated EMBL nucleotide sequence data library (TrEMBL) entries. The TrEMBL is a computer-annotated supplement of SWISS-PROT that contains all the translations of European molecular biology laboratory (EMBL) nucleotide sequence entries not yet integrated in SWISS-PROT. Furthermore, the structures of TrEMBL entries proteins were studied in the PDB database and 3D structures of the HBV proteins and locations of RCCs were visualized and studied using Swiss PDB Viewer software®. RESULTS: The Pfam search tool found nine protein families in three frames. Results of Pfams studies in the Sherlocc program showed that this program has not identified RCCs in the external core antigen (PF08290) and truncated HBeAg gene (PF08290) of HBV. By contrast, the RCCs were identified in gene of hepatitis core antigen (PF00906 and the residues 224 - 234 and 251 - 255), large envelope protein S (PF00695 and the residues 53-56 and 70 - 84), X protein (PF00739 and the residues 10 - 24, 29 - 83, 95 - 99. 122 - 129, 139 - 143), DNA polymerase (viral) N-terminal domain (PF00242 and the residues 59 - 62, 214 - 217, 407 - 413) and protein P (Pf00336 and the residues 225 - 228). In HBV genome, seven RCCs were identified in the gene area of hepatitis core antigen, large envelope protein S and DNA polymerase, while protein structures of TrEMBL entries sequences found in Sherlocc program outputs were not complete. CONCLUSIONS: Based on the location of detected RCCs in the structure of HBV proteins, it was found that these RCCs may have a critical role in correct folding of HBV proteins and can be considered as drug targets. The results of this study provide new and deep perspectives about structure of HBV proteins for further researches and designing new drugs for treatment of HBV. |
format | Online Article Text |
id | pubmed-5116127 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2016 |
publisher | Kowsar |
record_format | MEDLINE/PubMed |
spelling | pubmed-51161272016-11-23 Bioinformatic Identification of Rare Codon Clusters (RCCs) in HBV Genome and Evaluation of RCCs in Proteins Structure of Hepatitis B Virus Mortazavi, Mojtaba Zarenezhad, Mohammad Gholamzadeh, Saeid Alavian, Seyed Moayed Ghorbani, Mohammad Dehghani, Reza Malekpour, Abdorrasoul Meshkibaf, Mohammadhasan Fakhrzad, Ali Hepat Mon Research Article BACKGROUND: Hepatitis B virus (HBV) as an infectious disease that has nine genotypes (A - I) and a ‘putative’ genotype J. OBJECTIVES: The aim of this study was to identify the rare codon clusters (RCC) in the HBV genome and to evaluate these RCCs in the HBV proteins structure. METHODS: For detection of protein family accession numbers (Pfam) in HBV proteins, the UniProt database and Pfam search tool were used. Protein family accession numbers is a comprehensive and accurate collection of protein domains and families. It contains annotation of each family in the form of textual descriptions, links to other resources and literature references. Genome projects have used Pfam extensively for large-scale functional annotation of genomic data; Pfam database is a large collection of protein families, each represented by multiple sequence alignments and hidden Markov models (HMMs). The Pfam search tools are databases that identify Pfam of proteins. These Pfam IDs were analyzed in Sherlocc program and the location of RCCs in HBV genome and proteins were detected and reported as translated EMBL nucleotide sequence data library (TrEMBL) entries. The TrEMBL is a computer-annotated supplement of SWISS-PROT that contains all the translations of European molecular biology laboratory (EMBL) nucleotide sequence entries not yet integrated in SWISS-PROT. Furthermore, the structures of TrEMBL entries proteins were studied in the PDB database and 3D structures of the HBV proteins and locations of RCCs were visualized and studied using Swiss PDB Viewer software®. RESULTS: The Pfam search tool found nine protein families in three frames. Results of Pfams studies in the Sherlocc program showed that this program has not identified RCCs in the external core antigen (PF08290) and truncated HBeAg gene (PF08290) of HBV. By contrast, the RCCs were identified in gene of hepatitis core antigen (PF00906 and the residues 224 - 234 and 251 - 255), large envelope protein S (PF00695 and the residues 53-56 and 70 - 84), X protein (PF00739 and the residues 10 - 24, 29 - 83, 95 - 99. 122 - 129, 139 - 143), DNA polymerase (viral) N-terminal domain (PF00242 and the residues 59 - 62, 214 - 217, 407 - 413) and protein P (Pf00336 and the residues 225 - 228). In HBV genome, seven RCCs were identified in the gene area of hepatitis core antigen, large envelope protein S and DNA polymerase, while protein structures of TrEMBL entries sequences found in Sherlocc program outputs were not complete. CONCLUSIONS: Based on the location of detected RCCs in the structure of HBV proteins, it was found that these RCCs may have a critical role in correct folding of HBV proteins and can be considered as drug targets. The results of this study provide new and deep perspectives about structure of HBV proteins for further researches and designing new drugs for treatment of HBV. Kowsar 2016-10-04 /pmc/articles/PMC5116127/ /pubmed/27882067 http://dx.doi.org/10.5812/hepatmon.39909 Text en Copyright © 2016, Kowsar Corp http://creativecommons.org/licenses/by-nc/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution-NonCommercial 4.0 International License (http://creativecommons.org/licenses/by-nc/4.0/) which permits copy and redistribute the material just in noncommercial usages, provided the original work is properly cited. |
spellingShingle | Research Article Mortazavi, Mojtaba Zarenezhad, Mohammad Gholamzadeh, Saeid Alavian, Seyed Moayed Ghorbani, Mohammad Dehghani, Reza Malekpour, Abdorrasoul Meshkibaf, Mohammadhasan Fakhrzad, Ali Bioinformatic Identification of Rare Codon Clusters (RCCs) in HBV Genome and Evaluation of RCCs in Proteins Structure of Hepatitis B Virus |
title | Bioinformatic Identification of Rare Codon Clusters (RCCs) in HBV Genome and Evaluation of RCCs in Proteins Structure of Hepatitis B Virus |
title_full | Bioinformatic Identification of Rare Codon Clusters (RCCs) in HBV Genome and Evaluation of RCCs in Proteins Structure of Hepatitis B Virus |
title_fullStr | Bioinformatic Identification of Rare Codon Clusters (RCCs) in HBV Genome and Evaluation of RCCs in Proteins Structure of Hepatitis B Virus |
title_full_unstemmed | Bioinformatic Identification of Rare Codon Clusters (RCCs) in HBV Genome and Evaluation of RCCs in Proteins Structure of Hepatitis B Virus |
title_short | Bioinformatic Identification of Rare Codon Clusters (RCCs) in HBV Genome and Evaluation of RCCs in Proteins Structure of Hepatitis B Virus |
title_sort | bioinformatic identification of rare codon clusters (rccs) in hbv genome and evaluation of rccs in proteins structure of hepatitis b virus |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5116127/ https://www.ncbi.nlm.nih.gov/pubmed/27882067 http://dx.doi.org/10.5812/hepatmon.39909 |
work_keys_str_mv | AT mortazavimojtaba bioinformaticidentificationofrarecodonclustersrccsinhbvgenomeandevaluationofrccsinproteinsstructureofhepatitisbvirus AT zarenezhadmohammad bioinformaticidentificationofrarecodonclustersrccsinhbvgenomeandevaluationofrccsinproteinsstructureofhepatitisbvirus AT gholamzadehsaeid bioinformaticidentificationofrarecodonclustersrccsinhbvgenomeandevaluationofrccsinproteinsstructureofhepatitisbvirus AT alavianseyedmoayed bioinformaticidentificationofrarecodonclustersrccsinhbvgenomeandevaluationofrccsinproteinsstructureofhepatitisbvirus AT ghorbanimohammad bioinformaticidentificationofrarecodonclustersrccsinhbvgenomeandevaluationofrccsinproteinsstructureofhepatitisbvirus AT dehghanireza bioinformaticidentificationofrarecodonclustersrccsinhbvgenomeandevaluationofrccsinproteinsstructureofhepatitisbvirus AT malekpourabdorrasoul bioinformaticidentificationofrarecodonclustersrccsinhbvgenomeandevaluationofrccsinproteinsstructureofhepatitisbvirus AT meshkibafmohammadhasan bioinformaticidentificationofrarecodonclustersrccsinhbvgenomeandevaluationofrccsinproteinsstructureofhepatitisbvirus AT fakhrzadali bioinformaticidentificationofrarecodonclustersrccsinhbvgenomeandevaluationofrccsinproteinsstructureofhepatitisbvirus |