Cargando…
Discovering design principles of collagen molecular stability using a genetic algorithm, deep learning, and experimental validation
Collagen is the most abundant structural protein in humans, providing crucial mechanical properties, including high strength and toughness, in tissues. Collagen-based biomaterials are, therefore, used for tissue repair and regeneration. Utilizing collagen effectively during materials processing ex v...
Autores principales: | , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
National Academy of Sciences
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9546622/ https://www.ncbi.nlm.nih.gov/pubmed/36161946 http://dx.doi.org/10.1073/pnas.2209524119 |
_version_ | 1784805082932445184 |
---|---|
author | Khare, Eesha Yu, Chi-Hua Gonzalez Obeso, Constancio Milazzo, Mario Kaplan, David L. Buehler, Markus J. |
author_facet | Khare, Eesha Yu, Chi-Hua Gonzalez Obeso, Constancio Milazzo, Mario Kaplan, David L. Buehler, Markus J. |
author_sort | Khare, Eesha |
collection | PubMed |
description | Collagen is the most abundant structural protein in humans, providing crucial mechanical properties, including high strength and toughness, in tissues. Collagen-based biomaterials are, therefore, used for tissue repair and regeneration. Utilizing collagen effectively during materials processing ex vivo and subsequent function in vivo requires stability over wide temperature ranges to avoid denaturation and loss of structure, measured as melting temperature (T(m)). Although significant research has been conducted on understanding how collagen primary amino acid sequences correspond to T(m) values, a robust framework to facilitate the design of collagen sequences with specific T(m) remains a challenge. Here, we develop a general model using a genetic algorithm within a deep learning framework to design collagen sequences with specific T(m) values. We report 1,000 de novo collagen sequences, and we show that we can efficiently use this model to generate collagen sequences and verify their T(m) values using both experimental and computational methods. We find that the model accurately predicts T(m) values within a few degrees centigrade. Further, using this model, we conduct a high-throughput study to identify the most frequently occurring collagen triplets that can be directly incorporated into collagen. We further discovered that the number of hydrogen bonds within collagen calculated with molecular dynamics (MD) is directly correlated to the experimental measurement of triple-helical quality. Ultimately, we see this work as a critical step to helping researchers develop collagen sequences with specific T(m) values for intended materials manufacturing methods and biomedical applications, realizing a mechanistic materials by design paradigm. |
format | Online Article Text |
id | pubmed-9546622 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | National Academy of Sciences |
record_format | MEDLINE/PubMed |
spelling | pubmed-95466222023-03-26 Discovering design principles of collagen molecular stability using a genetic algorithm, deep learning, and experimental validation Khare, Eesha Yu, Chi-Hua Gonzalez Obeso, Constancio Milazzo, Mario Kaplan, David L. Buehler, Markus J. Proc Natl Acad Sci U S A Physical Sciences Collagen is the most abundant structural protein in humans, providing crucial mechanical properties, including high strength and toughness, in tissues. Collagen-based biomaterials are, therefore, used for tissue repair and regeneration. Utilizing collagen effectively during materials processing ex vivo and subsequent function in vivo requires stability over wide temperature ranges to avoid denaturation and loss of structure, measured as melting temperature (T(m)). Although significant research has been conducted on understanding how collagen primary amino acid sequences correspond to T(m) values, a robust framework to facilitate the design of collagen sequences with specific T(m) remains a challenge. Here, we develop a general model using a genetic algorithm within a deep learning framework to design collagen sequences with specific T(m) values. We report 1,000 de novo collagen sequences, and we show that we can efficiently use this model to generate collagen sequences and verify their T(m) values using both experimental and computational methods. We find that the model accurately predicts T(m) values within a few degrees centigrade. Further, using this model, we conduct a high-throughput study to identify the most frequently occurring collagen triplets that can be directly incorporated into collagen. We further discovered that the number of hydrogen bonds within collagen calculated with molecular dynamics (MD) is directly correlated to the experimental measurement of triple-helical quality. Ultimately, we see this work as a critical step to helping researchers develop collagen sequences with specific T(m) values for intended materials manufacturing methods and biomedical applications, realizing a mechanistic materials by design paradigm. National Academy of Sciences 2022-09-26 2022-10-04 /pmc/articles/PMC9546622/ /pubmed/36161946 http://dx.doi.org/10.1073/pnas.2209524119 Text en Copyright © 2022 the Author(s). Published by PNAS. https://creativecommons.org/licenses/by-nc-nd/4.0/This article is distributed under Creative Commons Attribution-NonCommercial-NoDerivatives License 4.0 (CC BY-NC-ND) (https://creativecommons.org/licenses/by-nc-nd/4.0/) . |
spellingShingle | Physical Sciences Khare, Eesha Yu, Chi-Hua Gonzalez Obeso, Constancio Milazzo, Mario Kaplan, David L. Buehler, Markus J. Discovering design principles of collagen molecular stability using a genetic algorithm, deep learning, and experimental validation |
title | Discovering design principles of collagen molecular stability using a genetic algorithm, deep learning, and experimental validation |
title_full | Discovering design principles of collagen molecular stability using a genetic algorithm, deep learning, and experimental validation |
title_fullStr | Discovering design principles of collagen molecular stability using a genetic algorithm, deep learning, and experimental validation |
title_full_unstemmed | Discovering design principles of collagen molecular stability using a genetic algorithm, deep learning, and experimental validation |
title_short | Discovering design principles of collagen molecular stability using a genetic algorithm, deep learning, and experimental validation |
title_sort | discovering design principles of collagen molecular stability using a genetic algorithm, deep learning, and experimental validation |
topic | Physical Sciences |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9546622/ https://www.ncbi.nlm.nih.gov/pubmed/36161946 http://dx.doi.org/10.1073/pnas.2209524119 |
work_keys_str_mv | AT khareeesha discoveringdesignprinciplesofcollagenmolecularstabilityusingageneticalgorithmdeeplearningandexperimentalvalidation AT yuchihua discoveringdesignprinciplesofcollagenmolecularstabilityusingageneticalgorithmdeeplearningandexperimentalvalidation AT gonzalezobesoconstancio discoveringdesignprinciplesofcollagenmolecularstabilityusingageneticalgorithmdeeplearningandexperimentalvalidation AT milazzomario discoveringdesignprinciplesofcollagenmolecularstabilityusingageneticalgorithmdeeplearningandexperimentalvalidation AT kaplandavidl discoveringdesignprinciplesofcollagenmolecularstabilityusingageneticalgorithmdeeplearningandexperimentalvalidation AT buehlermarkusj discoveringdesignprinciplesofcollagenmolecularstabilityusingageneticalgorithmdeeplearningandexperimentalvalidation |