Cargando…

SEQUENCE SLIDER: integration of structural and genetic data to characterize isoforms from natural sources

Proteins isolated from natural sources can be composed of a mixture of isoforms with similar physicochemical properties that coexist in the final steps of purification. Yet, even where unverified, the assumed sequence is enforced throughout the structural studies. Herein, we propose a novel perspect...

Descripción completa

Detalles Bibliográficos
Autores principales: Borges, Rafael J, Salvador, Guilherme H M, Pimenta, Daniel C, dos Santos, Lucilene D, Fontes, Marcos R M, Usón, Isabel
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9122596/
https://www.ncbi.nlm.nih.gov/pubmed/35104880
http://dx.doi.org/10.1093/nar/gkac029
_version_ 1784711379380338688
author Borges, Rafael J
Salvador, Guilherme H M
Pimenta, Daniel C
dos Santos, Lucilene D
Fontes, Marcos R M
Usón, Isabel
author_facet Borges, Rafael J
Salvador, Guilherme H M
Pimenta, Daniel C
dos Santos, Lucilene D
Fontes, Marcos R M
Usón, Isabel
author_sort Borges, Rafael J
collection PubMed
description Proteins isolated from natural sources can be composed of a mixture of isoforms with similar physicochemical properties that coexist in the final steps of purification. Yet, even where unverified, the assumed sequence is enforced throughout the structural studies. Herein, we propose a novel perspective to address the usually neglected sequence heterogeneity of natural products by integrating biophysical, genetic and structural data in our program SEQUENCE SLIDER. The aim is to assess the evidence supporting chemical composition in structure determination. Locally, we interrogate the experimental map to establish which side chains are supported by the structural data, and the genetic information relating sequence conservation is integrated into this statistic. Hence, we build a constrained peptide database, containing most probable sequences to interpret mass spectrometry data (MS). In parallel, we perform MS de novo sequencing with genomic-based algorithms to detect point mutations. We calibrated SLIDER with Gallus gallus lysozyme, whose sequence is unequivocally established and numerous natural isoforms are reported. We used SLIDER to characterize a metalloproteinase and a phospholipase A(2)-like protein from the venom of Bothrops moojeni and a crotoxin from Crotalus durissus collilineatus. This integrated approach offers a more realistic structural descriptor to characterize macromolecules isolated from natural sources.
format Online
Article
Text
id pubmed-9122596
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-91225962022-05-23 SEQUENCE SLIDER: integration of structural and genetic data to characterize isoforms from natural sources Borges, Rafael J Salvador, Guilherme H M Pimenta, Daniel C dos Santos, Lucilene D Fontes, Marcos R M Usón, Isabel Nucleic Acids Res Methods Online Proteins isolated from natural sources can be composed of a mixture of isoforms with similar physicochemical properties that coexist in the final steps of purification. Yet, even where unverified, the assumed sequence is enforced throughout the structural studies. Herein, we propose a novel perspective to address the usually neglected sequence heterogeneity of natural products by integrating biophysical, genetic and structural data in our program SEQUENCE SLIDER. The aim is to assess the evidence supporting chemical composition in structure determination. Locally, we interrogate the experimental map to establish which side chains are supported by the structural data, and the genetic information relating sequence conservation is integrated into this statistic. Hence, we build a constrained peptide database, containing most probable sequences to interpret mass spectrometry data (MS). In parallel, we perform MS de novo sequencing with genomic-based algorithms to detect point mutations. We calibrated SLIDER with Gallus gallus lysozyme, whose sequence is unequivocally established and numerous natural isoforms are reported. We used SLIDER to characterize a metalloproteinase and a phospholipase A(2)-like protein from the venom of Bothrops moojeni and a crotoxin from Crotalus durissus collilineatus. This integrated approach offers a more realistic structural descriptor to characterize macromolecules isolated from natural sources. Oxford University Press 2022-02-01 /pmc/articles/PMC9122596/ /pubmed/35104880 http://dx.doi.org/10.1093/nar/gkac029 Text en © The Author(s) 2022. Published by Oxford University Press on behalf of Nucleic Acids Research. https://creativecommons.org/licenses/by-nc/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution-NonCommercial License (https://creativecommons.org/licenses/by-nc/4.0/), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial re-use, please contact journals.permissions@oup.com
spellingShingle Methods Online
Borges, Rafael J
Salvador, Guilherme H M
Pimenta, Daniel C
dos Santos, Lucilene D
Fontes, Marcos R M
Usón, Isabel
SEQUENCE SLIDER: integration of structural and genetic data to characterize isoforms from natural sources
title SEQUENCE SLIDER: integration of structural and genetic data to characterize isoforms from natural sources
title_full SEQUENCE SLIDER: integration of structural and genetic data to characterize isoforms from natural sources
title_fullStr SEQUENCE SLIDER: integration of structural and genetic data to characterize isoforms from natural sources
title_full_unstemmed SEQUENCE SLIDER: integration of structural and genetic data to characterize isoforms from natural sources
title_short SEQUENCE SLIDER: integration of structural and genetic data to characterize isoforms from natural sources
title_sort sequence slider: integration of structural and genetic data to characterize isoforms from natural sources
topic Methods Online
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9122596/
https://www.ncbi.nlm.nih.gov/pubmed/35104880
http://dx.doi.org/10.1093/nar/gkac029
work_keys_str_mv AT borgesrafaelj sequencesliderintegrationofstructuralandgeneticdatatocharacterizeisoformsfromnaturalsources
AT salvadorguilhermehm sequencesliderintegrationofstructuralandgeneticdatatocharacterizeisoformsfromnaturalsources
AT pimentadanielc sequencesliderintegrationofstructuralandgeneticdatatocharacterizeisoformsfromnaturalsources
AT dossantoslucilened sequencesliderintegrationofstructuralandgeneticdatatocharacterizeisoformsfromnaturalsources
AT fontesmarcosrm sequencesliderintegrationofstructuralandgeneticdatatocharacterizeisoformsfromnaturalsources
AT usonisabel sequencesliderintegrationofstructuralandgeneticdatatocharacterizeisoformsfromnaturalsources