Cargando…

Master Blaster: an approach to sensitive identification of remotely related proteins

Genome sequencing projects unearth sequences of all the protein sequences encoded in a genome. As the first step, homology detection is employed to obtain clues to structure and function of these proteins. However, high evolutionary divergence between homologous proteins challenges our ability to de...

Descripción completa

Detalles Bibliográficos
Autores principales: Janaki, Chintalapati, Gowri, Venkatraman S., Srinivasan, Narayanaswamy
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Nature Publishing Group UK 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8062480/
https://www.ncbi.nlm.nih.gov/pubmed/33888741
http://dx.doi.org/10.1038/s41598-021-87833-4
_version_ 1783681773554630656
author Janaki, Chintalapati
Gowri, Venkatraman S.
Srinivasan, Narayanaswamy
author_facet Janaki, Chintalapati
Gowri, Venkatraman S.
Srinivasan, Narayanaswamy
author_sort Janaki, Chintalapati
collection PubMed
description Genome sequencing projects unearth sequences of all the protein sequences encoded in a genome. As the first step, homology detection is employed to obtain clues to structure and function of these proteins. However, high evolutionary divergence between homologous proteins challenges our ability to detect distant relationships. In the past, an approach involving multiple Position Specific Scoring Matrices (PSSMs) was found to be more effective than traditional single PSSMs. Cascaded search is another successful approach where hits of a search are queried to detect more homologues. We propose a protocol, ‘Master Blaster’, which combines the principles adopted in these two approaches to enhance our ability to detect remote homologues even further. Assessment of the approach was performed using known relationships available in the SCOP70 database, and the results were compared against that of PSI-BLAST and HHblits, a hidden Markov model-based method. Compared to PSI-BLAST, Master Blaster resulted in 10% improvement with respect to detection of cross superfamily connections, nearly 35% improvement in cross family and more than 80% improvement in intra family connections. From the results it was observed that HHblits is more sensitive in detecting remote homologues compared to Master Blaster. However, there are true hits from 46-folds for which Master Blaster reported homologs that are not reported by HHblits even using the optimal parameters indicating that for detecting remote homologues, use of multiple methods employing a combination of different approaches can be more effective in detecting remote homologs. Master Blaster stand-alone code is available for download in the supplementary archive.
format Online
Article
Text
id pubmed-8062480
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Nature Publishing Group UK
record_format MEDLINE/PubMed
spelling pubmed-80624802021-04-23 Master Blaster: an approach to sensitive identification of remotely related proteins Janaki, Chintalapati Gowri, Venkatraman S. Srinivasan, Narayanaswamy Sci Rep Article Genome sequencing projects unearth sequences of all the protein sequences encoded in a genome. As the first step, homology detection is employed to obtain clues to structure and function of these proteins. However, high evolutionary divergence between homologous proteins challenges our ability to detect distant relationships. In the past, an approach involving multiple Position Specific Scoring Matrices (PSSMs) was found to be more effective than traditional single PSSMs. Cascaded search is another successful approach where hits of a search are queried to detect more homologues. We propose a protocol, ‘Master Blaster’, which combines the principles adopted in these two approaches to enhance our ability to detect remote homologues even further. Assessment of the approach was performed using known relationships available in the SCOP70 database, and the results were compared against that of PSI-BLAST and HHblits, a hidden Markov model-based method. Compared to PSI-BLAST, Master Blaster resulted in 10% improvement with respect to detection of cross superfamily connections, nearly 35% improvement in cross family and more than 80% improvement in intra family connections. From the results it was observed that HHblits is more sensitive in detecting remote homologues compared to Master Blaster. However, there are true hits from 46-folds for which Master Blaster reported homologs that are not reported by HHblits even using the optimal parameters indicating that for detecting remote homologues, use of multiple methods employing a combination of different approaches can be more effective in detecting remote homologs. Master Blaster stand-alone code is available for download in the supplementary archive. Nature Publishing Group UK 2021-04-22 /pmc/articles/PMC8062480/ /pubmed/33888741 http://dx.doi.org/10.1038/s41598-021-87833-4 Text en © The Author(s) 2021 https://creativecommons.org/licenses/by/4.0/Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) .
spellingShingle Article
Janaki, Chintalapati
Gowri, Venkatraman S.
Srinivasan, Narayanaswamy
Master Blaster: an approach to sensitive identification of remotely related proteins
title Master Blaster: an approach to sensitive identification of remotely related proteins
title_full Master Blaster: an approach to sensitive identification of remotely related proteins
title_fullStr Master Blaster: an approach to sensitive identification of remotely related proteins
title_full_unstemmed Master Blaster: an approach to sensitive identification of remotely related proteins
title_short Master Blaster: an approach to sensitive identification of remotely related proteins
title_sort master blaster: an approach to sensitive identification of remotely related proteins
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8062480/
https://www.ncbi.nlm.nih.gov/pubmed/33888741
http://dx.doi.org/10.1038/s41598-021-87833-4
work_keys_str_mv AT janakichintalapati masterblasteranapproachtosensitiveidentificationofremotelyrelatedproteins
AT gowrivenkatramans masterblasteranapproachtosensitiveidentificationofremotelyrelatedproteins
AT srinivasannarayanaswamy masterblasteranapproachtosensitiveidentificationofremotelyrelatedproteins