Cargando…

NrichD database: sequence databases enriched with computationally designed protein-like sequences aid in remote homology detection

NrichD (http://proline.biochem.iisc.ernet.in/NRICHD/) is a database of computationally designed protein-like sequences, augmented into natural sequence databases that can perform hops in protein sequence space to assist in the detection of remote relationships. Establishing protein relationships in...

Descripción completa

Detalles Bibliográficos
Autores principales:	Mudgal, Richa, Sandhya, Sankaran, Kumar, Gayatri, Sowdhamini, Ramanathan, Chandra, Nagasuma R., Srinivasan, Narayanaswamy
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	Oxford University Press 2014
Materias:	Database Issue
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4384005/ https://www.ncbi.nlm.nih.gov/pubmed/25262355 http://dx.doi.org/10.1093/nar/gku888

_version_	1782364831158894592
author	Mudgal, Richa Sandhya, Sankaran Kumar, Gayatri Sowdhamini, Ramanathan Chandra, Nagasuma R. Srinivasan, Narayanaswamy
author_facet	Mudgal, Richa Sandhya, Sankaran Kumar, Gayatri Sowdhamini, Ramanathan Chandra, Nagasuma R. Srinivasan, Narayanaswamy
author_sort	Mudgal, Richa
collection	PubMed
description	NrichD (http://proline.biochem.iisc.ernet.in/NRICHD/) is a database of computationally designed protein-like sequences, augmented into natural sequence databases that can perform hops in protein sequence space to assist in the detection of remote relationships. Establishing protein relationships in the absence of structural evidence or natural ‘intermediately related sequences’ is a challenging task. Recently, we have demonstrated that the computational design of artificial intermediary sequences/linkers is an effective approach to fill naturally occurring voids in protein sequence space. Through a large-scale assessment we have demonstrated that such sequences can be plugged into commonly employed search databases to improve the performance of routinely used sequence search methods in detecting remote relationships. Since it is anticipated that such data sets will be employed to establish protein relationships, two databases that have already captured these relationships at the structural and functional domain level, namely, the SCOP database and the Pfam database, have been ‘enriched’ with these artificial intermediary sequences. NrichD database currently contains 3 611 010 artificial sequences that have been generated between 27 882 pairs of families from 374 SCOP folds. The data sets are freely available for download. Additional features include the design of artificial sequences between any two protein families of interest to the user.
format	Online Article Text
id	pubmed-4384005
institution	National Center for Biotechnology Information
language	English
publishDate	2014
publisher	Oxford University Press
record_format	MEDLINE/PubMed
spelling	pubmed-43840052015-04-08 NrichD database: sequence databases enriched with computationally designed protein-like sequences aid in remote homology detection Mudgal, Richa Sandhya, Sankaran Kumar, Gayatri Sowdhamini, Ramanathan Chandra, Nagasuma R. Srinivasan, Narayanaswamy Nucleic Acids Res Database Issue NrichD (http://proline.biochem.iisc.ernet.in/NRICHD/) is a database of computationally designed protein-like sequences, augmented into natural sequence databases that can perform hops in protein sequence space to assist in the detection of remote relationships. Establishing protein relationships in the absence of structural evidence or natural ‘intermediately related sequences’ is a challenging task. Recently, we have demonstrated that the computational design of artificial intermediary sequences/linkers is an effective approach to fill naturally occurring voids in protein sequence space. Through a large-scale assessment we have demonstrated that such sequences can be plugged into commonly employed search databases to improve the performance of routinely used sequence search methods in detecting remote relationships. Since it is anticipated that such data sets will be employed to establish protein relationships, two databases that have already captured these relationships at the structural and functional domain level, namely, the SCOP database and the Pfam database, have been ‘enriched’ with these artificial intermediary sequences. NrichD database currently contains 3 611 010 artificial sequences that have been generated between 27 882 pairs of families from 374 SCOP folds. The data sets are freely available for download. Additional features include the design of artificial sequences between any two protein families of interest to the user. Oxford University Press 2014-09-27 2015-01-28 /pmc/articles/PMC4384005/ /pubmed/25262355 http://dx.doi.org/10.1093/nar/gku888 Text en © The Author(s) 2014. Published by Oxford University Press on behalf of Nucleic Acids Research. https://creativecommons.org/licenses/by-nc/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by-nc/4.0/ (https://creativecommons.org/licenses/by-nc/4.0/) ), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial re-use, please contact journals.permissions@oup.com
spellingShingle	Database Issue Mudgal, Richa Sandhya, Sankaran Kumar, Gayatri Sowdhamini, Ramanathan Chandra, Nagasuma R. Srinivasan, Narayanaswamy NrichD database: sequence databases enriched with computationally designed protein-like sequences aid in remote homology detection
title	NrichD database: sequence databases enriched with computationally designed protein-like sequences aid in remote homology detection
title_full	NrichD database: sequence databases enriched with computationally designed protein-like sequences aid in remote homology detection
title_fullStr	NrichD database: sequence databases enriched with computationally designed protein-like sequences aid in remote homology detection
title_full_unstemmed	NrichD database: sequence databases enriched with computationally designed protein-like sequences aid in remote homology detection
title_short	NrichD database: sequence databases enriched with computationally designed protein-like sequences aid in remote homology detection
title_sort	nrichd database: sequence databases enriched with computationally designed protein-like sequences aid in remote homology detection
topic	Database Issue
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4384005/ https://www.ncbi.nlm.nih.gov/pubmed/25262355 http://dx.doi.org/10.1093/nar/gku888
work_keys_str_mv	AT mudgalricha nrichddatabasesequencedatabasesenrichedwithcomputationallydesignedproteinlikesequencesaidinremotehomologydetection AT sandhyasankaran nrichddatabasesequencedatabasesenrichedwithcomputationallydesignedproteinlikesequencesaidinremotehomologydetection AT kumargayatri nrichddatabasesequencedatabasesenrichedwithcomputationallydesignedproteinlikesequencesaidinremotehomologydetection AT sowdhaminiramanathan nrichddatabasesequencedatabasesenrichedwithcomputationallydesignedproteinlikesequencesaidinremotehomologydetection AT chandranagasumar nrichddatabasesequencedatabasesenrichedwithcomputationallydesignedproteinlikesequencesaidinremotehomologydetection AT srinivasannarayanaswamy nrichddatabasesequencedatabasesenrichedwithcomputationallydesignedproteinlikesequencesaidinremotehomologydetection

NrichD database: sequence databases enriched with computationally designed protein-like sequences aid in remote homology detection

Ejemplares similares