Cargando…

SOMSpec as a General Purpose Validated Self-Organising Map Tool for Rapid Protein Secondary Structure Prediction From Infrared Absorbance Data

A protein’s structure is the key to its function. As protein structure can vary with environment, it is important to be able to determine it over a wide range of concentrations, temperatures, formulation vehicles, and states. Robust reproducible validated methods are required for applications includ...

Descripción completa

Detalles Bibliográficos
Autores principales: Pinto Corujo, Marco, Olamoyesan, Adewale, Tukova, Anastasiia, Ang, Dale, Goormaghtigh, Erik, Peterson, Jason, Sharov, Victor, Chmel, Nikola, Rodger, Alison
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Frontiers Media S.A. 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8830495/
https://www.ncbi.nlm.nih.gov/pubmed/35155377
http://dx.doi.org/10.3389/fchem.2021.784625
_version_ 1784648285365993472
author Pinto Corujo, Marco
Olamoyesan, Adewale
Tukova, Anastasiia
Ang, Dale
Goormaghtigh, Erik
Peterson, Jason
Sharov, Victor
Chmel, Nikola
Rodger, Alison
author_facet Pinto Corujo, Marco
Olamoyesan, Adewale
Tukova, Anastasiia
Ang, Dale
Goormaghtigh, Erik
Peterson, Jason
Sharov, Victor
Chmel, Nikola
Rodger, Alison
author_sort Pinto Corujo, Marco
collection PubMed
description A protein’s structure is the key to its function. As protein structure can vary with environment, it is important to be able to determine it over a wide range of concentrations, temperatures, formulation vehicles, and states. Robust reproducible validated methods are required for applications including batch-batch comparisons of biopharmaceutical products. Circular dichroism is widely used for this purpose, but an alternative is required for concentrations above 10 mg/mL or for solutions with chiral buffer components that absorb far UV light. Infrared (IR) protein absorbance spectra of the Amide I region (1,600–1700 cm(−1)) contain information about secondary structure and require higher concentrations than circular dichroism often with complementary spectral windows. In this paper, we consider a number of approaches to extract structural information from a protein infrared spectrum and determine their reliability for regulatory and research purpose. In particular, we compare direct and second derivative band-fitting with a self-organising map (SOM) approach applied to a number of different reference sets. The self-organising map (SOM) approach proved significantly more accurate than the band-fitting approaches for solution spectra. As there is no validated benchmark method available for infrared structure fitting, SOMSpec was implemented in a leave-one-out validation (LOOV) approach for solid-state transmission and thin-film attenuated total reflectance (ATR) reference sets. We then tested SOMSpec and the thin-film ATR reference set against 68 solution spectra and found the average prediction error for helix (α + 3(10)) and β-sheet was less than 6% for proteins with less than 40% helix. This is quantitatively better than other available approaches. The visual output format of SOMSpec aids identification of poor predictions. We also demonstrated how to convert aqueous ATR spectra to and from transmission spectra for structure fitting. Fourier self-deconvolution did not improve the average structure predictions.
format Online
Article
Text
id pubmed-8830495
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Frontiers Media S.A.
record_format MEDLINE/PubMed
spelling pubmed-88304952022-02-11 SOMSpec as a General Purpose Validated Self-Organising Map Tool for Rapid Protein Secondary Structure Prediction From Infrared Absorbance Data Pinto Corujo, Marco Olamoyesan, Adewale Tukova, Anastasiia Ang, Dale Goormaghtigh, Erik Peterson, Jason Sharov, Victor Chmel, Nikola Rodger, Alison Front Chem Chemistry A protein’s structure is the key to its function. As protein structure can vary with environment, it is important to be able to determine it over a wide range of concentrations, temperatures, formulation vehicles, and states. Robust reproducible validated methods are required for applications including batch-batch comparisons of biopharmaceutical products. Circular dichroism is widely used for this purpose, but an alternative is required for concentrations above 10 mg/mL or for solutions with chiral buffer components that absorb far UV light. Infrared (IR) protein absorbance spectra of the Amide I region (1,600–1700 cm(−1)) contain information about secondary structure and require higher concentrations than circular dichroism often with complementary spectral windows. In this paper, we consider a number of approaches to extract structural information from a protein infrared spectrum and determine their reliability for regulatory and research purpose. In particular, we compare direct and second derivative band-fitting with a self-organising map (SOM) approach applied to a number of different reference sets. The self-organising map (SOM) approach proved significantly more accurate than the band-fitting approaches for solution spectra. As there is no validated benchmark method available for infrared structure fitting, SOMSpec was implemented in a leave-one-out validation (LOOV) approach for solid-state transmission and thin-film attenuated total reflectance (ATR) reference sets. We then tested SOMSpec and the thin-film ATR reference set against 68 solution spectra and found the average prediction error for helix (α + 3(10)) and β-sheet was less than 6% for proteins with less than 40% helix. This is quantitatively better than other available approaches. The visual output format of SOMSpec aids identification of poor predictions. We also demonstrated how to convert aqueous ATR spectra to and from transmission spectra for structure fitting. Fourier self-deconvolution did not improve the average structure predictions. Frontiers Media S.A. 2022-01-27 /pmc/articles/PMC8830495/ /pubmed/35155377 http://dx.doi.org/10.3389/fchem.2021.784625 Text en Copyright © 2022 Pinto Corujo, Olamoyesan, Tukova, Ang, Goormaghtigh, Peterson, Sharov, Chmel and Rodger. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle Chemistry
Pinto Corujo, Marco
Olamoyesan, Adewale
Tukova, Anastasiia
Ang, Dale
Goormaghtigh, Erik
Peterson, Jason
Sharov, Victor
Chmel, Nikola
Rodger, Alison
SOMSpec as a General Purpose Validated Self-Organising Map Tool for Rapid Protein Secondary Structure Prediction From Infrared Absorbance Data
title SOMSpec as a General Purpose Validated Self-Organising Map Tool for Rapid Protein Secondary Structure Prediction From Infrared Absorbance Data
title_full SOMSpec as a General Purpose Validated Self-Organising Map Tool for Rapid Protein Secondary Structure Prediction From Infrared Absorbance Data
title_fullStr SOMSpec as a General Purpose Validated Self-Organising Map Tool for Rapid Protein Secondary Structure Prediction From Infrared Absorbance Data
title_full_unstemmed SOMSpec as a General Purpose Validated Self-Organising Map Tool for Rapid Protein Secondary Structure Prediction From Infrared Absorbance Data
title_short SOMSpec as a General Purpose Validated Self-Organising Map Tool for Rapid Protein Secondary Structure Prediction From Infrared Absorbance Data
title_sort somspec as a general purpose validated self-organising map tool for rapid protein secondary structure prediction from infrared absorbance data
topic Chemistry
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8830495/
https://www.ncbi.nlm.nih.gov/pubmed/35155377
http://dx.doi.org/10.3389/fchem.2021.784625
work_keys_str_mv AT pintocorujomarco somspecasageneralpurposevalidatedselforganisingmaptoolforrapidproteinsecondarystructurepredictionfrominfraredabsorbancedata
AT olamoyesanadewale somspecasageneralpurposevalidatedselforganisingmaptoolforrapidproteinsecondarystructurepredictionfrominfraredabsorbancedata
AT tukovaanastasiia somspecasageneralpurposevalidatedselforganisingmaptoolforrapidproteinsecondarystructurepredictionfrominfraredabsorbancedata
AT angdale somspecasageneralpurposevalidatedselforganisingmaptoolforrapidproteinsecondarystructurepredictionfrominfraredabsorbancedata
AT goormaghtigherik somspecasageneralpurposevalidatedselforganisingmaptoolforrapidproteinsecondarystructurepredictionfrominfraredabsorbancedata
AT petersonjason somspecasageneralpurposevalidatedselforganisingmaptoolforrapidproteinsecondarystructurepredictionfrominfraredabsorbancedata
AT sharovvictor somspecasageneralpurposevalidatedselforganisingmaptoolforrapidproteinsecondarystructurepredictionfrominfraredabsorbancedata
AT chmelnikola somspecasageneralpurposevalidatedselforganisingmaptoolforrapidproteinsecondarystructurepredictionfrominfraredabsorbancedata
AT rodgeralison somspecasageneralpurposevalidatedselforganisingmaptoolforrapidproteinsecondarystructurepredictionfrominfraredabsorbancedata