Cargando…
Sequence‐based searching of custom proteome and transcriptome databases
A long‐term goal in renal physiology is to understand the mechanisms involved in collecting duct function and regulation at a cellular and molecular level. The first step in modeling of these mechanisms, which can provide a guide to experimentation, is the generation of a list of model components. W...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
John Wiley and Sons Inc.
2018
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6144439/ https://www.ncbi.nlm.nih.gov/pubmed/30230259 http://dx.doi.org/10.14814/phy2.13846 |
_version_ | 1783356103283703808 |
---|---|
author | Medvar, Barbara Sarkar, Abhijit Knepper, Mark Pisitkun, Trairak |
author_facet | Medvar, Barbara Sarkar, Abhijit Knepper, Mark Pisitkun, Trairak |
author_sort | Medvar, Barbara |
collection | PubMed |
description | A long‐term goal in renal physiology is to understand the mechanisms involved in collecting duct function and regulation at a cellular and molecular level. The first step in modeling of these mechanisms, which can provide a guide to experimentation, is the generation of a list of model components. We have curated a list of proteins expressed in the rat renal inner medullary collecting duct (IMCD) from proteomic data from 18 different publications. The database has been posted as a public resource at https://hpcwebapps.cit.nih.gov/ ESBL/Database/IMCD_Proteome_Database/. It includes 8956 different proteins. To search the IMCD Proteomic Database efficiently, we have created a Java‐based program called curated database Basic Local Alignment Search Tool (cdbBLAST), which uses the NCBI BLAST kernel to search for specific amino acid sequences corresponding to proteins in the database. cdbBLAST reports information on the matched protein and identifies proteins in the database that have similar sequences. We have also adapted cdbBLAST to interrogate our previously published IMCD Transcriptome Database. We have made the cdbBLAST program available for use either as a web application or a downloadable .jar file at https://hpcwebapps.cit.nih.gov/ ESBL/Database/cdbBLAST/. Database searching based on protein sequence removes ambiguities arising from the standard search method based on official gene symbols and allows the user efficient identification of related proteins that may fulfill the same functional roles. |
format | Online Article Text |
id | pubmed-6144439 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2018 |
publisher | John Wiley and Sons Inc. |
record_format | MEDLINE/PubMed |
spelling | pubmed-61444392018-09-24 Sequence‐based searching of custom proteome and transcriptome databases Medvar, Barbara Sarkar, Abhijit Knepper, Mark Pisitkun, Trairak Physiol Rep Original Research A long‐term goal in renal physiology is to understand the mechanisms involved in collecting duct function and regulation at a cellular and molecular level. The first step in modeling of these mechanisms, which can provide a guide to experimentation, is the generation of a list of model components. We have curated a list of proteins expressed in the rat renal inner medullary collecting duct (IMCD) from proteomic data from 18 different publications. The database has been posted as a public resource at https://hpcwebapps.cit.nih.gov/ ESBL/Database/IMCD_Proteome_Database/. It includes 8956 different proteins. To search the IMCD Proteomic Database efficiently, we have created a Java‐based program called curated database Basic Local Alignment Search Tool (cdbBLAST), which uses the NCBI BLAST kernel to search for specific amino acid sequences corresponding to proteins in the database. cdbBLAST reports information on the matched protein and identifies proteins in the database that have similar sequences. We have also adapted cdbBLAST to interrogate our previously published IMCD Transcriptome Database. We have made the cdbBLAST program available for use either as a web application or a downloadable .jar file at https://hpcwebapps.cit.nih.gov/ ESBL/Database/cdbBLAST/. Database searching based on protein sequence removes ambiguities arising from the standard search method based on official gene symbols and allows the user efficient identification of related proteins that may fulfill the same functional roles. John Wiley and Sons Inc. 2018-09-19 /pmc/articles/PMC6144439/ /pubmed/30230259 http://dx.doi.org/10.14814/phy2.13846 Text en Published 2018. This article is a U.S. Government work and is in the public domain in the USA. This is an open access article under the terms of the http://creativecommons.org/licenses/by/4.0/ License, which permits use, distribution and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Original Research Medvar, Barbara Sarkar, Abhijit Knepper, Mark Pisitkun, Trairak Sequence‐based searching of custom proteome and transcriptome databases |
title | Sequence‐based searching of custom proteome and transcriptome databases |
title_full | Sequence‐based searching of custom proteome and transcriptome databases |
title_fullStr | Sequence‐based searching of custom proteome and transcriptome databases |
title_full_unstemmed | Sequence‐based searching of custom proteome and transcriptome databases |
title_short | Sequence‐based searching of custom proteome and transcriptome databases |
title_sort | sequence‐based searching of custom proteome and transcriptome databases |
topic | Original Research |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6144439/ https://www.ncbi.nlm.nih.gov/pubmed/30230259 http://dx.doi.org/10.14814/phy2.13846 |
work_keys_str_mv | AT medvarbarbara sequencebasedsearchingofcustomproteomeandtranscriptomedatabases AT sarkarabhijit sequencebasedsearchingofcustomproteomeandtranscriptomedatabases AT kneppermark sequencebasedsearchingofcustomproteomeandtranscriptomedatabases AT pisitkuntrairak sequencebasedsearchingofcustomproteomeandtranscriptomedatabases |