Cargando…

SeQuery: an interactive graph database for visualizing the GPCR superfamily

The rate at which new protein and gene sequences are being discovered has grown explosively in the omics era, which has increasingly complicated the efficient characterization and analysis of their biological properties. In this study, we propose a web-based graphical database tool, SeQuery, for int...

Descripción completa

Detalles Bibliográficos
Autores principales: Hu, Geng-Ming, Secario, M K, Chen, Chi-Ming
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6591535/
https://www.ncbi.nlm.nih.gov/pubmed/31236561
http://dx.doi.org/10.1093/database/baz073
_version_ 1783429751282597888
author Hu, Geng-Ming
Secario, M K
Chen, Chi-Ming
author_facet Hu, Geng-Ming
Secario, M K
Chen, Chi-Ming
author_sort Hu, Geng-Ming
collection PubMed
description The rate at which new protein and gene sequences are being discovered has grown explosively in the omics era, which has increasingly complicated the efficient characterization and analysis of their biological properties. In this study, we propose a web-based graphical database tool, SeQuery, for intuitively visualizing proteome/genome networks by integrating the sequential, structural and functional information of sequences. As a demonstration of our tool’s effectiveness, we constructed a graph database of G protein-coupled receptor (GPCR) sequences by integrating data from the UniProt, GPCRdb and RCSB PDB databases. Our tool attempts to achieve two goals: (i) given the sequence of a query protein, correctly and efficiently identify whether the protein is a GPCR, and, if so, define its sequential and functional roles in the GPCR superfamily; and (ii) present a panoramic view of the GPCR superfamily and its network centralities that allows users to explore the superfamily at various resolutions. Such a bottom-up-to-top-down view can provide the users with a comprehensive understanding of the GPCR superfamily through interactive navigation of the graph database. A test of SeQuery with the GPCR2841 dataset shows that it correctly identifies 99 out of 100 queried protein sequences. The developed tool is readily applicable to other biological networks, and we aim to expand SeQuery by including additional biological databases in the near future.
format Online
Article
Text
id pubmed-6591535
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-65915352019-07-01 SeQuery: an interactive graph database for visualizing the GPCR superfamily Hu, Geng-Ming Secario, M K Chen, Chi-Ming Database (Oxford) Original Article The rate at which new protein and gene sequences are being discovered has grown explosively in the omics era, which has increasingly complicated the efficient characterization and analysis of their biological properties. In this study, we propose a web-based graphical database tool, SeQuery, for intuitively visualizing proteome/genome networks by integrating the sequential, structural and functional information of sequences. As a demonstration of our tool’s effectiveness, we constructed a graph database of G protein-coupled receptor (GPCR) sequences by integrating data from the UniProt, GPCRdb and RCSB PDB databases. Our tool attempts to achieve two goals: (i) given the sequence of a query protein, correctly and efficiently identify whether the protein is a GPCR, and, if so, define its sequential and functional roles in the GPCR superfamily; and (ii) present a panoramic view of the GPCR superfamily and its network centralities that allows users to explore the superfamily at various resolutions. Such a bottom-up-to-top-down view can provide the users with a comprehensive understanding of the GPCR superfamily through interactive navigation of the graph database. A test of SeQuery with the GPCR2841 dataset shows that it correctly identifies 99 out of 100 queried protein sequences. The developed tool is readily applicable to other biological networks, and we aim to expand SeQuery by including additional biological databases in the near future. Oxford University Press 2019-06-25 /pmc/articles/PMC6591535/ /pubmed/31236561 http://dx.doi.org/10.1093/database/baz073 Text en © The Author(s) 2019. Published by Oxford University Press. http://creativecommons.org/licenses/by/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Original Article
Hu, Geng-Ming
Secario, M K
Chen, Chi-Ming
SeQuery: an interactive graph database for visualizing the GPCR superfamily
title SeQuery: an interactive graph database for visualizing the GPCR superfamily
title_full SeQuery: an interactive graph database for visualizing the GPCR superfamily
title_fullStr SeQuery: an interactive graph database for visualizing the GPCR superfamily
title_full_unstemmed SeQuery: an interactive graph database for visualizing the GPCR superfamily
title_short SeQuery: an interactive graph database for visualizing the GPCR superfamily
title_sort sequery: an interactive graph database for visualizing the gpcr superfamily
topic Original Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6591535/
https://www.ncbi.nlm.nih.gov/pubmed/31236561
http://dx.doi.org/10.1093/database/baz073
work_keys_str_mv AT hugengming sequeryaninteractivegraphdatabaseforvisualizingthegpcrsuperfamily
AT secariomk sequeryaninteractivegraphdatabaseforvisualizingthegpcrsuperfamily
AT chenchiming sequeryaninteractivegraphdatabaseforvisualizingthegpcrsuperfamily