Cargando…

SEQATOMS: a web tool for identifying missing regions in PDB in sequence context

With over 46 000 proteins, the Protein Data Bank (PDB) is the most important database with structural information of biological macromolecules. PDB files contain sequence and coordinate information. Residues present in the sequence can be absent from the coordinate section, which means their positio...

Descripción completa

Detalles Bibliográficos
Autores principales:	Brandt, Bernd W., Heringa, Jaap, Leunissen, Jack A. M.
Formato:	Texto
Lenguaje:	English
Publicado:	Oxford University Press 2008
Materias:	Articles
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2447787/ https://www.ncbi.nlm.nih.gov/pubmed/18463137 http://dx.doi.org/10.1093/nar/gkn237

_version_	1782156999402717184
author	Brandt, Bernd W. Heringa, Jaap Leunissen, Jack A. M.
author_facet	Brandt, Bernd W. Heringa, Jaap Leunissen, Jack A. M.
author_sort	Brandt, Bernd W.
collection	PubMed
description	With over 46 000 proteins, the Protein Data Bank (PDB) is the most important database with structural information of biological macromolecules. PDB files contain sequence and coordinate information. Residues present in the sequence can be absent from the coordinate section, which means their position in space is unknown. Similarity searches are routinely carried out against sequences taken from PDB SEQRES. However, there no distinction is made between residues that have a known or unknown position in the 3D protein structure. We present a FASTA sequence database that is produced by combining the sequence and coordinate information. All residues absent from the PDB coordinate section are masked with lower-case letters, thereby providing a view of these residues in the context of the entire protein sequence, which facilitates inspecting ‘missing’ regions. We also provide a masked version of the CATH domain database. A user-friendly BLAST interface is available for similarity searching. In contrast to standard (stand-alone) BLAST output, which only contains upper-case letters, our output retains the lower-case letters of the masked regions. Thus, our server can be used to perform BLAST searching case-sensitively. Here, we have applied it to the study of missing regions in their sequence context. SEQATOMS is available at http://www.bioinformatics.nl/tools/seqatoms/.
format	Text
id	pubmed-2447787
institution	National Center for Biotechnology Information
language	English
publishDate	2008
publisher	Oxford University Press
record_format	MEDLINE/PubMed
spelling	pubmed-24477872008-07-09 SEQATOMS: a web tool for identifying missing regions in PDB in sequence context Brandt, Bernd W. Heringa, Jaap Leunissen, Jack A. M. Nucleic Acids Res Articles With over 46 000 proteins, the Protein Data Bank (PDB) is the most important database with structural information of biological macromolecules. PDB files contain sequence and coordinate information. Residues present in the sequence can be absent from the coordinate section, which means their position in space is unknown. Similarity searches are routinely carried out against sequences taken from PDB SEQRES. However, there no distinction is made between residues that have a known or unknown position in the 3D protein structure. We present a FASTA sequence database that is produced by combining the sequence and coordinate information. All residues absent from the PDB coordinate section are masked with lower-case letters, thereby providing a view of these residues in the context of the entire protein sequence, which facilitates inspecting ‘missing’ regions. We also provide a masked version of the CATH domain database. A user-friendly BLAST interface is available for similarity searching. In contrast to standard (stand-alone) BLAST output, which only contains upper-case letters, our output retains the lower-case letters of the masked regions. Thus, our server can be used to perform BLAST searching case-sensitively. Here, we have applied it to the study of missing regions in their sequence context. SEQATOMS is available at http://www.bioinformatics.nl/tools/seqatoms/. Oxford University Press 2008-07-01 2008-05-07 /pmc/articles/PMC2447787/ /pubmed/18463137 http://dx.doi.org/10.1093/nar/gkn237 Text en © 2008 The Author(s) http://creativecommons.org/licenses/by-nc/2.0/uk/ This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/2.0/uk/) which permits unrestricted non-commercial use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle	Articles Brandt, Bernd W. Heringa, Jaap Leunissen, Jack A. M. SEQATOMS: a web tool for identifying missing regions in PDB in sequence context
title	SEQATOMS: a web tool for identifying missing regions in PDB in sequence context
title_full	SEQATOMS: a web tool for identifying missing regions in PDB in sequence context
title_fullStr	SEQATOMS: a web tool for identifying missing regions in PDB in sequence context
title_full_unstemmed	SEQATOMS: a web tool for identifying missing regions in PDB in sequence context
title_short	SEQATOMS: a web tool for identifying missing regions in PDB in sequence context
title_sort	seqatoms: a web tool for identifying missing regions in pdb in sequence context
topic	Articles
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2447787/ https://www.ncbi.nlm.nih.gov/pubmed/18463137 http://dx.doi.org/10.1093/nar/gkn237
work_keys_str_mv	AT brandtberndw seqatomsawebtoolforidentifyingmissingregionsinpdbinsequencecontext AT heringajaap seqatomsawebtoolforidentifyingmissingregionsinpdbinsequencecontext AT leunissenjackam seqatomsawebtoolforidentifyingmissingregionsinpdbinsequencecontext

SEQATOMS: a web tool for identifying missing regions in PDB in sequence context

Ejemplares similares