Cargando…

Sequence‐based searching of custom proteome and transcriptome databases

A long‐term goal in renal physiology is to understand the mechanisms involved in collecting duct function and regulation at a cellular and molecular level. The first step in modeling of these mechanisms, which can provide a guide to experimentation, is the generation of a list of model components. W...

Descripción completa

Detalles Bibliográficos
Autores principales: Medvar, Barbara, Sarkar, Abhijit, Knepper, Mark, Pisitkun, Trairak
Formato: Online Artículo Texto
Lenguaje:English
Publicado: John Wiley and Sons Inc. 2018
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6144439/
https://www.ncbi.nlm.nih.gov/pubmed/30230259
http://dx.doi.org/10.14814/phy2.13846
_version_ 1783356103283703808
author Medvar, Barbara
Sarkar, Abhijit
Knepper, Mark
Pisitkun, Trairak
author_facet Medvar, Barbara
Sarkar, Abhijit
Knepper, Mark
Pisitkun, Trairak
author_sort Medvar, Barbara
collection PubMed
description A long‐term goal in renal physiology is to understand the mechanisms involved in collecting duct function and regulation at a cellular and molecular level. The first step in modeling of these mechanisms, which can provide a guide to experimentation, is the generation of a list of model components. We have curated a list of proteins expressed in the rat renal inner medullary collecting duct (IMCD) from proteomic data from 18 different publications. The database has been posted as a public resource at https://hpcwebapps.cit.nih.gov/ ESBL/Database/IMCD_Proteome_Database/. It includes 8956 different proteins. To search the IMCD Proteomic Database efficiently, we have created a Java‐based program called curated database Basic Local Alignment Search Tool (cdbBLAST), which uses the NCBI BLAST kernel to search for specific amino acid sequences corresponding to proteins in the database. cdbBLAST reports information on the matched protein and identifies proteins in the database that have similar sequences. We have also adapted cdbBLAST to interrogate our previously published IMCD Transcriptome Database. We have made the cdbBLAST program available for use either as a web application or a downloadable .jar file at https://hpcwebapps.cit.nih.gov/ ESBL/Database/cdbBLAST/. Database searching based on protein sequence removes ambiguities arising from the standard search method based on official gene symbols and allows the user efficient identification of related proteins that may fulfill the same functional roles.
format Online
Article
Text
id pubmed-6144439
institution National Center for Biotechnology Information
language English
publishDate 2018
publisher John Wiley and Sons Inc.
record_format MEDLINE/PubMed
spelling pubmed-61444392018-09-24 Sequence‐based searching of custom proteome and transcriptome databases Medvar, Barbara Sarkar, Abhijit Knepper, Mark Pisitkun, Trairak Physiol Rep Original Research A long‐term goal in renal physiology is to understand the mechanisms involved in collecting duct function and regulation at a cellular and molecular level. The first step in modeling of these mechanisms, which can provide a guide to experimentation, is the generation of a list of model components. We have curated a list of proteins expressed in the rat renal inner medullary collecting duct (IMCD) from proteomic data from 18 different publications. The database has been posted as a public resource at https://hpcwebapps.cit.nih.gov/ ESBL/Database/IMCD_Proteome_Database/. It includes 8956 different proteins. To search the IMCD Proteomic Database efficiently, we have created a Java‐based program called curated database Basic Local Alignment Search Tool (cdbBLAST), which uses the NCBI BLAST kernel to search for specific amino acid sequences corresponding to proteins in the database. cdbBLAST reports information on the matched protein and identifies proteins in the database that have similar sequences. We have also adapted cdbBLAST to interrogate our previously published IMCD Transcriptome Database. We have made the cdbBLAST program available for use either as a web application or a downloadable .jar file at https://hpcwebapps.cit.nih.gov/ ESBL/Database/cdbBLAST/. Database searching based on protein sequence removes ambiguities arising from the standard search method based on official gene symbols and allows the user efficient identification of related proteins that may fulfill the same functional roles. John Wiley and Sons Inc. 2018-09-19 /pmc/articles/PMC6144439/ /pubmed/30230259 http://dx.doi.org/10.14814/phy2.13846 Text en Published 2018. This article is a U.S. Government work and is in the public domain in the USA. This is an open access article under the terms of the http://creativecommons.org/licenses/by/4.0/ License, which permits use, distribution and reproduction in any medium, provided the original work is properly cited.
spellingShingle Original Research
Medvar, Barbara
Sarkar, Abhijit
Knepper, Mark
Pisitkun, Trairak
Sequence‐based searching of custom proteome and transcriptome databases
title Sequence‐based searching of custom proteome and transcriptome databases
title_full Sequence‐based searching of custom proteome and transcriptome databases
title_fullStr Sequence‐based searching of custom proteome and transcriptome databases
title_full_unstemmed Sequence‐based searching of custom proteome and transcriptome databases
title_short Sequence‐based searching of custom proteome and transcriptome databases
title_sort sequence‐based searching of custom proteome and transcriptome databases
topic Original Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6144439/
https://www.ncbi.nlm.nih.gov/pubmed/30230259
http://dx.doi.org/10.14814/phy2.13846
work_keys_str_mv AT medvarbarbara sequencebasedsearchingofcustomproteomeandtranscriptomedatabases
AT sarkarabhijit sequencebasedsearchingofcustomproteomeandtranscriptomedatabases
AT kneppermark sequencebasedsearchingofcustomproteomeandtranscriptomedatabases
AT pisitkuntrairak sequencebasedsearchingofcustomproteomeandtranscriptomedatabases