Cargando…
Visualization of protein sequence space with force-directed graphs, and their application to the choice of target-template pairs for homology modelling
The protein sequence-structure gap results from the contrast between rapid, low-cost deep sequencing, and slow, expensive experimental structure determination techniques. Comparative homology modelling may have the potential to close this gap by predicting protein structure in target sequences using...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Elsevier Inc.
2019
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7110651/ https://www.ncbi.nlm.nih.gov/pubmed/31377535 http://dx.doi.org/10.1016/j.jmgm.2019.07.014 |
_version_ | 1783513096341422080 |
---|---|
author | Mead, Dylan J.T. Lunagomez, Simón Gatherer, Derek |
author_facet | Mead, Dylan J.T. Lunagomez, Simón Gatherer, Derek |
author_sort | Mead, Dylan J.T. |
collection | PubMed |
description | The protein sequence-structure gap results from the contrast between rapid, low-cost deep sequencing, and slow, expensive experimental structure determination techniques. Comparative homology modelling may have the potential to close this gap by predicting protein structure in target sequences using existing experimentally solved structures as templates. This paper presents the first use of force-directed graphs for the visualization of sequence space in two dimensions, and applies them to the choice of suitable RNA-dependent RNA polymerase (RdRP) target-template pairs within human-infective RNA virus genera. Measures of centrality in protein sequence space for each genus were also derived and used to identify centroid nearest-neighbour sequences (CNNs) potentially useful for production of homology models most representative of their genera. Homology modelling was then carried out for target-template pairs in different species, different genera and different families, and model quality assessed using several metrics. Reconstructed ancestral RdRP sequences for individual genera were also used as templates for the production of ancestral RdRP homology models. High quality ancestral RdRP models were consistently produced, as were good quality models for target-template pairs in the same genus. Homology modelling between genera in the same family produced mixed results and inter-family modelling was unreliable. We present a protocol for the production of optimal RdRP homology models for use in further experiments, e.g. docking to discover novel anti-viral compounds. (219 words) |
format | Online Article Text |
id | pubmed-7110651 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2019 |
publisher | Elsevier Inc. |
record_format | MEDLINE/PubMed |
spelling | pubmed-71106512020-04-02 Visualization of protein sequence space with force-directed graphs, and their application to the choice of target-template pairs for homology modelling Mead, Dylan J.T. Lunagomez, Simón Gatherer, Derek J Mol Graph Model Article The protein sequence-structure gap results from the contrast between rapid, low-cost deep sequencing, and slow, expensive experimental structure determination techniques. Comparative homology modelling may have the potential to close this gap by predicting protein structure in target sequences using existing experimentally solved structures as templates. This paper presents the first use of force-directed graphs for the visualization of sequence space in two dimensions, and applies them to the choice of suitable RNA-dependent RNA polymerase (RdRP) target-template pairs within human-infective RNA virus genera. Measures of centrality in protein sequence space for each genus were also derived and used to identify centroid nearest-neighbour sequences (CNNs) potentially useful for production of homology models most representative of their genera. Homology modelling was then carried out for target-template pairs in different species, different genera and different families, and model quality assessed using several metrics. Reconstructed ancestral RdRP sequences for individual genera were also used as templates for the production of ancestral RdRP homology models. High quality ancestral RdRP models were consistently produced, as were good quality models for target-template pairs in the same genus. Homology modelling between genera in the same family produced mixed results and inter-family modelling was unreliable. We present a protocol for the production of optimal RdRP homology models for use in further experiments, e.g. docking to discover novel anti-viral compounds. (219 words) Elsevier Inc. 2019-11 2019-07-26 /pmc/articles/PMC7110651/ /pubmed/31377535 http://dx.doi.org/10.1016/j.jmgm.2019.07.014 Text en © 2019 Elsevier Inc. All rights reserved. Since January 2020 Elsevier has created a COVID-19 resource centre with free information in English and Mandarin on the novel coronavirus COVID-19. The COVID-19 resource centre is hosted on Elsevier Connect, the company's public news and information website. Elsevier hereby grants permission to make all its COVID-19-related research that is available on the COVID-19 resource centre - including this research content - immediately available in PubMed Central and other publicly funded repositories, such as the WHO COVID database with rights for unrestricted research re-use and analyses in any form or by any means with acknowledgement of the original source. These permissions are granted for free by Elsevier for as long as the COVID-19 resource centre remains active. |
spellingShingle | Article Mead, Dylan J.T. Lunagomez, Simón Gatherer, Derek Visualization of protein sequence space with force-directed graphs, and their application to the choice of target-template pairs for homology modelling |
title | Visualization of protein sequence space with force-directed graphs, and their application to the choice of target-template pairs for homology modelling |
title_full | Visualization of protein sequence space with force-directed graphs, and their application to the choice of target-template pairs for homology modelling |
title_fullStr | Visualization of protein sequence space with force-directed graphs, and their application to the choice of target-template pairs for homology modelling |
title_full_unstemmed | Visualization of protein sequence space with force-directed graphs, and their application to the choice of target-template pairs for homology modelling |
title_short | Visualization of protein sequence space with force-directed graphs, and their application to the choice of target-template pairs for homology modelling |
title_sort | visualization of protein sequence space with force-directed graphs, and their application to the choice of target-template pairs for homology modelling |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7110651/ https://www.ncbi.nlm.nih.gov/pubmed/31377535 http://dx.doi.org/10.1016/j.jmgm.2019.07.014 |
work_keys_str_mv | AT meaddylanjt visualizationofproteinsequencespacewithforcedirectedgraphsandtheirapplicationtothechoiceoftargettemplatepairsforhomologymodelling AT lunagomezsimon visualizationofproteinsequencespacewithforcedirectedgraphsandtheirapplicationtothechoiceoftargettemplatepairsforhomologymodelling AT gathererderek visualizationofproteinsequencespacewithforcedirectedgraphsandtheirapplicationtothechoiceoftargettemplatepairsforhomologymodelling |