Cargando…

A Database of Domain Definitions for Proteins with Complex Interdomain Geometry

Protein structural domains are necessary for understanding evolution and protein folding, and may vary widely from functional and sequence based domains. Although, various structural domain databases exist, defining domains for some proteins is non-trivial, and definitions of their domain boundaries...

Descripción completa

Detalles Bibliográficos
Autores principales: Majumdar, Indraneel, Kinch, Lisa N., Grishin, Nick V.
Formato: Texto
Lenguaje:English
Publicado: Public Library of Science 2009
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2662426/
https://www.ncbi.nlm.nih.gov/pubmed/19352501
http://dx.doi.org/10.1371/journal.pone.0005084
_version_ 1782165851016790016
author Majumdar, Indraneel
Kinch, Lisa N.
Grishin, Nick V.
author_facet Majumdar, Indraneel
Kinch, Lisa N.
Grishin, Nick V.
author_sort Majumdar, Indraneel
collection PubMed
description Protein structural domains are necessary for understanding evolution and protein folding, and may vary widely from functional and sequence based domains. Although, various structural domain databases exist, defining domains for some proteins is non-trivial, and definitions of their domain boundaries are not available. Here, we present a novel database of manually defined structural domains for a representative set of proteins from the SCOP “multi-domain proteins” class. (http://prodata.swmed.edu/multidom/). We consider our domains as mobile evolutionary units, which may rearrange during protein evolution. Additionally, they may be visualized as structurally compact and possibly independently folding units. We also found that representing domains as evolutionary and folding units do not always lead to a unique domain definition. However, unlike existing databases, we retain and refine these “alternate” domain definitions after careful inspection of structural similarity, functional sites and automated domain definition methods. We provide domain definitions, including actual residue boundaries, for proteins that well known databases like SCOP and CATH do not attempt to split. Our alternate domain definitions are suitable for sequence and structure searches by automated methods. Additionally, the database can be used for training and testing domain delineation algorithms. Since our domains represent structurally compact evolutionary units, the database may be useful for studying domain properties and evolution.
format Text
id pubmed-2662426
institution National Center for Biotechnology Information
language English
publishDate 2009
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-26624262009-04-08 A Database of Domain Definitions for Proteins with Complex Interdomain Geometry Majumdar, Indraneel Kinch, Lisa N. Grishin, Nick V. PLoS One Research Article Protein structural domains are necessary for understanding evolution and protein folding, and may vary widely from functional and sequence based domains. Although, various structural domain databases exist, defining domains for some proteins is non-trivial, and definitions of their domain boundaries are not available. Here, we present a novel database of manually defined structural domains for a representative set of proteins from the SCOP “multi-domain proteins” class. (http://prodata.swmed.edu/multidom/). We consider our domains as mobile evolutionary units, which may rearrange during protein evolution. Additionally, they may be visualized as structurally compact and possibly independently folding units. We also found that representing domains as evolutionary and folding units do not always lead to a unique domain definition. However, unlike existing databases, we retain and refine these “alternate” domain definitions after careful inspection of structural similarity, functional sites and automated domain definition methods. We provide domain definitions, including actual residue boundaries, for proteins that well known databases like SCOP and CATH do not attempt to split. Our alternate domain definitions are suitable for sequence and structure searches by automated methods. Additionally, the database can be used for training and testing domain delineation algorithms. Since our domains represent structurally compact evolutionary units, the database may be useful for studying domain properties and evolution. Public Library of Science 2009-04-08 /pmc/articles/PMC2662426/ /pubmed/19352501 http://dx.doi.org/10.1371/journal.pone.0005084 Text en Majumdar et al. http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited.
spellingShingle Research Article
Majumdar, Indraneel
Kinch, Lisa N.
Grishin, Nick V.
A Database of Domain Definitions for Proteins with Complex Interdomain Geometry
title A Database of Domain Definitions for Proteins with Complex Interdomain Geometry
title_full A Database of Domain Definitions for Proteins with Complex Interdomain Geometry
title_fullStr A Database of Domain Definitions for Proteins with Complex Interdomain Geometry
title_full_unstemmed A Database of Domain Definitions for Proteins with Complex Interdomain Geometry
title_short A Database of Domain Definitions for Proteins with Complex Interdomain Geometry
title_sort database of domain definitions for proteins with complex interdomain geometry
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2662426/
https://www.ncbi.nlm.nih.gov/pubmed/19352501
http://dx.doi.org/10.1371/journal.pone.0005084
work_keys_str_mv AT majumdarindraneel adatabaseofdomaindefinitionsforproteinswithcomplexinterdomaingeometry
AT kinchlisan adatabaseofdomaindefinitionsforproteinswithcomplexinterdomaingeometry
AT grishinnickv adatabaseofdomaindefinitionsforproteinswithcomplexinterdomaingeometry
AT majumdarindraneel databaseofdomaindefinitionsforproteinswithcomplexinterdomaingeometry
AT kinchlisan databaseofdomaindefinitionsforproteinswithcomplexinterdomaingeometry
AT grishinnickv databaseofdomaindefinitionsforproteinswithcomplexinterdomaingeometry