Cargando…

GenDiS database update with improved approach and features to recognize homologous sequences of protein domain superfamilies

Since proteins evolve by divergent evolution, proteins with distant homology to each other may or may not bear similar functions. Improved computational approaches are required to recognize distant homologues that are functionally similar. One of the methods of assigning function to sequences is to...

Descripción completa

Detalles Bibliográficos
Autores principales: Iyer, Meenakshi S, Bhargava, Kartik, Pavalam, Murugavel, Sowdhamini, Ramanathan
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6446967/
https://www.ncbi.nlm.nih.gov/pubmed/30943284
http://dx.doi.org/10.1093/database/baz042
_version_ 1783408445528997888
author Iyer, Meenakshi S
Bhargava, Kartik
Pavalam, Murugavel
Sowdhamini, Ramanathan
author_facet Iyer, Meenakshi S
Bhargava, Kartik
Pavalam, Murugavel
Sowdhamini, Ramanathan
author_sort Iyer, Meenakshi S
collection PubMed
description Since proteins evolve by divergent evolution, proteins with distant homology to each other may or may not bear similar functions. Improved computational approaches are required to recognize distant homologues that are functionally similar. One of the methods of assigning function to sequences is to use profiles derived from sequences of known structure. We describe an update of the Genomic Distribution of protein structural domain Superfamilies (GenDiS) database, namely GenDiS+, which provides a projection of SCOP superfamily members on the sequence space (NR database, NCBI). The sequences are validated using structure-based sequence alignment profiles and domain and full-length sequence alignments. GenDiS+ is a `tour de force’ for detecting homologues within around 160 000 taxonomic identifiers, starting from nearly 11 000 domains of known structure. Features, like full-sequence alignment and phylogeny, domain sequence alignment and phylogeny, list of associated structural and sequence domains with strength of interactions, links to databases like Pfam, UniProt and ModBase and list of sequences with a PDB structure, are provided.
format Online
Article
Text
id pubmed-6446967
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-64469672019-04-09 GenDiS database update with improved approach and features to recognize homologous sequences of protein domain superfamilies Iyer, Meenakshi S Bhargava, Kartik Pavalam, Murugavel Sowdhamini, Ramanathan Database (Oxford) Database Update Since proteins evolve by divergent evolution, proteins with distant homology to each other may or may not bear similar functions. Improved computational approaches are required to recognize distant homologues that are functionally similar. One of the methods of assigning function to sequences is to use profiles derived from sequences of known structure. We describe an update of the Genomic Distribution of protein structural domain Superfamilies (GenDiS) database, namely GenDiS+, which provides a projection of SCOP superfamily members on the sequence space (NR database, NCBI). The sequences are validated using structure-based sequence alignment profiles and domain and full-length sequence alignments. GenDiS+ is a `tour de force’ for detecting homologues within around 160 000 taxonomic identifiers, starting from nearly 11 000 domains of known structure. Features, like full-sequence alignment and phylogeny, domain sequence alignment and phylogeny, list of associated structural and sequence domains with strength of interactions, links to databases like Pfam, UniProt and ModBase and list of sequences with a PDB structure, are provided. Oxford University Press 2019-04-03 /pmc/articles/PMC6446967/ /pubmed/30943284 http://dx.doi.org/10.1093/database/baz042 Text en © The Author(s) 2019. Published by Oxford University Press. http://creativecommons.org/licenses/by/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Database Update
Iyer, Meenakshi S
Bhargava, Kartik
Pavalam, Murugavel
Sowdhamini, Ramanathan
GenDiS database update with improved approach and features to recognize homologous sequences of protein domain superfamilies
title GenDiS database update with improved approach and features to recognize homologous sequences of protein domain superfamilies
title_full GenDiS database update with improved approach and features to recognize homologous sequences of protein domain superfamilies
title_fullStr GenDiS database update with improved approach and features to recognize homologous sequences of protein domain superfamilies
title_full_unstemmed GenDiS database update with improved approach and features to recognize homologous sequences of protein domain superfamilies
title_short GenDiS database update with improved approach and features to recognize homologous sequences of protein domain superfamilies
title_sort gendis database update with improved approach and features to recognize homologous sequences of protein domain superfamilies
topic Database Update
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6446967/
https://www.ncbi.nlm.nih.gov/pubmed/30943284
http://dx.doi.org/10.1093/database/baz042
work_keys_str_mv AT iyermeenakshis gendisdatabaseupdatewithimprovedapproachandfeaturestorecognizehomologoussequencesofproteindomainsuperfamilies
AT bhargavakartik gendisdatabaseupdatewithimprovedapproachandfeaturestorecognizehomologoussequencesofproteindomainsuperfamilies
AT pavalammurugavel gendisdatabaseupdatewithimprovedapproachandfeaturestorecognizehomologoussequencesofproteindomainsuperfamilies
AT sowdhaminiramanathan gendisdatabaseupdatewithimprovedapproachandfeaturestorecognizehomologoussequencesofproteindomainsuperfamilies