Cargando…

CMD: A Database to Store the Bonding States of Cysteine Motifs with Secondary Structures

Computational approaches to the disulphide bonding state and its connectivity pattern prediction are based on various descriptors. One descriptor is the amino acid sequence motifs flanking the cysteine residue motifs. Despite the existence of disulphide bonding information in many databases and appl...

Descripción completa

Detalles Bibliográficos
Autores principales: Bostan, Hamed, Salim, Naomie, Hussein, Zeti Azura, Klappa, Peter, Shamsir, Mohd Shahir
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Hindawi Publishing Corporation 2012
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3474208/
https://www.ncbi.nlm.nih.gov/pubmed/23091487
http://dx.doi.org/10.1155/2012/849830
_version_ 1782246778589937664
author Bostan, Hamed
Salim, Naomie
Hussein, Zeti Azura
Klappa, Peter
Shamsir, Mohd Shahir
author_facet Bostan, Hamed
Salim, Naomie
Hussein, Zeti Azura
Klappa, Peter
Shamsir, Mohd Shahir
author_sort Bostan, Hamed
collection PubMed
description Computational approaches to the disulphide bonding state and its connectivity pattern prediction are based on various descriptors. One descriptor is the amino acid sequence motifs flanking the cysteine residue motifs. Despite the existence of disulphide bonding information in many databases and applications, there is no complete reference and motif query available at the moment. Cysteine motif database (CMD) is the first online resource that stores all cysteine residues, their flanking motifs with their secondary structure, and propensity values assignment derived from the laboratory data. We extracted more than 3 million cysteine motifs from PDB and UniProt data, annotated with secondary structure assignment, propensity value assignment, and frequency of occurrence and coefficiency of their bonding status. Removal of redundancies generated 15875 unique flanking motifs that are always bonded and 41577 unique patterns that are always nonbonded. Queries are based on the protein ID, FASTA sequence, sequence motif, and secondary structure individually or in batch format using the provided APIs that allow remote users to query our database via third party software and/or high throughput screening/querying. The CMD offers extensive information about the bonded, free cysteine residues, and their motifs that allows in-depth characterization of the sequence motif composition.
format Online
Article
Text
id pubmed-3474208
institution National Center for Biotechnology Information
language English
publishDate 2012
publisher Hindawi Publishing Corporation
record_format MEDLINE/PubMed
spelling pubmed-34742082012-10-22 CMD: A Database to Store the Bonding States of Cysteine Motifs with Secondary Structures Bostan, Hamed Salim, Naomie Hussein, Zeti Azura Klappa, Peter Shamsir, Mohd Shahir Adv Bioinformatics Research Article Computational approaches to the disulphide bonding state and its connectivity pattern prediction are based on various descriptors. One descriptor is the amino acid sequence motifs flanking the cysteine residue motifs. Despite the existence of disulphide bonding information in many databases and applications, there is no complete reference and motif query available at the moment. Cysteine motif database (CMD) is the first online resource that stores all cysteine residues, their flanking motifs with their secondary structure, and propensity values assignment derived from the laboratory data. We extracted more than 3 million cysteine motifs from PDB and UniProt data, annotated with secondary structure assignment, propensity value assignment, and frequency of occurrence and coefficiency of their bonding status. Removal of redundancies generated 15875 unique flanking motifs that are always bonded and 41577 unique patterns that are always nonbonded. Queries are based on the protein ID, FASTA sequence, sequence motif, and secondary structure individually or in batch format using the provided APIs that allow remote users to query our database via third party software and/or high throughput screening/querying. The CMD offers extensive information about the bonded, free cysteine residues, and their motifs that allows in-depth characterization of the sequence motif composition. Hindawi Publishing Corporation 2012 2012-10-10 /pmc/articles/PMC3474208/ /pubmed/23091487 http://dx.doi.org/10.1155/2012/849830 Text en Copyright © 2012 Hamed Bostan et al. https://creativecommons.org/licenses/by/3.0/ This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research Article
Bostan, Hamed
Salim, Naomie
Hussein, Zeti Azura
Klappa, Peter
Shamsir, Mohd Shahir
CMD: A Database to Store the Bonding States of Cysteine Motifs with Secondary Structures
title CMD: A Database to Store the Bonding States of Cysteine Motifs with Secondary Structures
title_full CMD: A Database to Store the Bonding States of Cysteine Motifs with Secondary Structures
title_fullStr CMD: A Database to Store the Bonding States of Cysteine Motifs with Secondary Structures
title_full_unstemmed CMD: A Database to Store the Bonding States of Cysteine Motifs with Secondary Structures
title_short CMD: A Database to Store the Bonding States of Cysteine Motifs with Secondary Structures
title_sort cmd: a database to store the bonding states of cysteine motifs with secondary structures
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3474208/
https://www.ncbi.nlm.nih.gov/pubmed/23091487
http://dx.doi.org/10.1155/2012/849830
work_keys_str_mv AT bostanhamed cmdadatabasetostorethebondingstatesofcysteinemotifswithsecondarystructures
AT salimnaomie cmdadatabasetostorethebondingstatesofcysteinemotifswithsecondarystructures
AT husseinzetiazura cmdadatabasetostorethebondingstatesofcysteinemotifswithsecondarystructures
AT klappapeter cmdadatabasetostorethebondingstatesofcysteinemotifswithsecondarystructures
AT shamsirmohdshahir cmdadatabasetostorethebondingstatesofcysteinemotifswithsecondarystructures