Cargando…

Systematic search for putative new domain families in Mycoplasma gallisepticum genome

BACKGROUND: Protein domains are the fundamental units of protein structure, function and evolution. The delineation of different domains in proteins is important for classification, understanding of structure, function and evolution. The delineation of protein domains within a polypeptide chain, nam...

Descripción completa

Detalles Bibliográficos
Autores principales: Reddy, Chilamakuri CS, Rani, Sane Sudha, Offmann, Bernard, Sowdhamini, R
Formato: Texto
Lenguaje:English
Publicado: BioMed Central 2010
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2865477/
https://www.ncbi.nlm.nih.gov/pubmed/20384986
http://dx.doi.org/10.1186/1756-0500-3-98
_version_ 1782180841000009728
author Reddy, Chilamakuri CS
Rani, Sane Sudha
Offmann, Bernard
Sowdhamini, R
author_facet Reddy, Chilamakuri CS
Rani, Sane Sudha
Offmann, Bernard
Sowdhamini, R
author_sort Reddy, Chilamakuri CS
collection PubMed
description BACKGROUND: Protein domains are the fundamental units of protein structure, function and evolution. The delineation of different domains in proteins is important for classification, understanding of structure, function and evolution. The delineation of protein domains within a polypeptide chain, namely at the genome scale, can be achieved in several ways but may remain problematic in many instances. Difficulties in identifying the domain content of a given sequence arise when the query sequence has no homologues with experimentally determined structure and searching against sequence domain databases also results in insignificant matches. Identification of domains under low sequence identity conditions and lack of structural homologues acquire a crucial importance especially at the genomic scale. FINDINGS: We have developed a new method for the identification of domains in unassigned regions through indirect connections and scaled up its application to the analysis of 434 unassigned regions in 726 protein sequences of Mycoplasma gallisepticum genome. We could establish 71 new domain relationships and probable 63 putative new domain families through intermediate sequences in the unassigned regions, which importantly represent an overall 10% increase in PfamA domain annotation over the direct assignment in this genome. CONCLUSIONS: The systematic analysis of the unassigned regions in the Mycoplasma gallisepticum genome has provided some insight into the possible new domain relationships and putative new domain families. Further investigation of these predicted new domains may prove beneficial in improving the existing domain prediction algorithms.
format Text
id pubmed-2865477
institution National Center for Biotechnology Information
language English
publishDate 2010
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-28654772010-05-07 Systematic search for putative new domain families in Mycoplasma gallisepticum genome Reddy, Chilamakuri CS Rani, Sane Sudha Offmann, Bernard Sowdhamini, R BMC Res Notes Short Report BACKGROUND: Protein domains are the fundamental units of protein structure, function and evolution. The delineation of different domains in proteins is important for classification, understanding of structure, function and evolution. The delineation of protein domains within a polypeptide chain, namely at the genome scale, can be achieved in several ways but may remain problematic in many instances. Difficulties in identifying the domain content of a given sequence arise when the query sequence has no homologues with experimentally determined structure and searching against sequence domain databases also results in insignificant matches. Identification of domains under low sequence identity conditions and lack of structural homologues acquire a crucial importance especially at the genomic scale. FINDINGS: We have developed a new method for the identification of domains in unassigned regions through indirect connections and scaled up its application to the analysis of 434 unassigned regions in 726 protein sequences of Mycoplasma gallisepticum genome. We could establish 71 new domain relationships and probable 63 putative new domain families through intermediate sequences in the unassigned regions, which importantly represent an overall 10% increase in PfamA domain annotation over the direct assignment in this genome. CONCLUSIONS: The systematic analysis of the unassigned regions in the Mycoplasma gallisepticum genome has provided some insight into the possible new domain relationships and putative new domain families. Further investigation of these predicted new domains may prove beneficial in improving the existing domain prediction algorithms. BioMed Central 2010-04-12 /pmc/articles/PMC2865477/ /pubmed/20384986 http://dx.doi.org/10.1186/1756-0500-3-98 Text en Copyright ©2010 Offmann et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Short Report
Reddy, Chilamakuri CS
Rani, Sane Sudha
Offmann, Bernard
Sowdhamini, R
Systematic search for putative new domain families in Mycoplasma gallisepticum genome
title Systematic search for putative new domain families in Mycoplasma gallisepticum genome
title_full Systematic search for putative new domain families in Mycoplasma gallisepticum genome
title_fullStr Systematic search for putative new domain families in Mycoplasma gallisepticum genome
title_full_unstemmed Systematic search for putative new domain families in Mycoplasma gallisepticum genome
title_short Systematic search for putative new domain families in Mycoplasma gallisepticum genome
title_sort systematic search for putative new domain families in mycoplasma gallisepticum genome
topic Short Report
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2865477/
https://www.ncbi.nlm.nih.gov/pubmed/20384986
http://dx.doi.org/10.1186/1756-0500-3-98
work_keys_str_mv AT reddychilamakurics systematicsearchforputativenewdomainfamiliesinmycoplasmagallisepticumgenome
AT ranisanesudha systematicsearchforputativenewdomainfamiliesinmycoplasmagallisepticumgenome
AT offmannbernard systematicsearchforputativenewdomainfamiliesinmycoplasmagallisepticumgenome
AT sowdhaminir systematicsearchforputativenewdomainfamiliesinmycoplasmagallisepticumgenome