Cargando…

Machine learning in a data-limited regime: Augmenting experiments with synthetic data uncovers order in crumpled sheets

Machine learning has gained widespread attention as a powerful tool to identify structure in complex, high-dimensional data. However, these techniques are ostensibly inapplicable for experimental systems where data are scarce or expensive to obtain. Here, we introduce a strategy to resolve this impa...

Descripción completa

Detalles Bibliográficos
Autores principales: Hoffmann, Jordan, Bar-Sinai, Yohai, Lee, Lisa M., Andrejevic, Jovana, Mishra, Shruti, Rubinstein, Shmuel M., Rycroft, Chris H.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: American Association for the Advancement of Science 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6486215/
https://www.ncbi.nlm.nih.gov/pubmed/31032399
http://dx.doi.org/10.1126/sciadv.aau6792
_version_ 1783414331576156160
author Hoffmann, Jordan
Bar-Sinai, Yohai
Lee, Lisa M.
Andrejevic, Jovana
Mishra, Shruti
Rubinstein, Shmuel M.
Rycroft, Chris H.
author_facet Hoffmann, Jordan
Bar-Sinai, Yohai
Lee, Lisa M.
Andrejevic, Jovana
Mishra, Shruti
Rubinstein, Shmuel M.
Rycroft, Chris H.
author_sort Hoffmann, Jordan
collection PubMed
description Machine learning has gained widespread attention as a powerful tool to identify structure in complex, high-dimensional data. However, these techniques are ostensibly inapplicable for experimental systems where data are scarce or expensive to obtain. Here, we introduce a strategy to resolve this impasse by augmenting the experimental dataset with synthetically generated data of a much simpler sister system. Specifically, we study spontaneously emerging local order in crease networks of crumpled thin sheets, a paradigmatic example of spatial complexity, and show that machine learning techniques can be effective even in a data-limited regime. This is achieved by augmenting the scarce experimental dataset with inexhaustible amounts of simulated data of rigid flat-folded sheets, which are simple to simulate and share common statistical properties. This considerably improves the predictive power in a test problem of pattern completion and demonstrates the usefulness of machine learning in bench-top experiments where data are good but scarce.
format Online
Article
Text
id pubmed-6486215
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher American Association for the Advancement of Science
record_format MEDLINE/PubMed
spelling pubmed-64862152019-04-27 Machine learning in a data-limited regime: Augmenting experiments with synthetic data uncovers order in crumpled sheets Hoffmann, Jordan Bar-Sinai, Yohai Lee, Lisa M. Andrejevic, Jovana Mishra, Shruti Rubinstein, Shmuel M. Rycroft, Chris H. Sci Adv Research Articles Machine learning has gained widespread attention as a powerful tool to identify structure in complex, high-dimensional data. However, these techniques are ostensibly inapplicable for experimental systems where data are scarce or expensive to obtain. Here, we introduce a strategy to resolve this impasse by augmenting the experimental dataset with synthetically generated data of a much simpler sister system. Specifically, we study spontaneously emerging local order in crease networks of crumpled thin sheets, a paradigmatic example of spatial complexity, and show that machine learning techniques can be effective even in a data-limited regime. This is achieved by augmenting the scarce experimental dataset with inexhaustible amounts of simulated data of rigid flat-folded sheets, which are simple to simulate and share common statistical properties. This considerably improves the predictive power in a test problem of pattern completion and demonstrates the usefulness of machine learning in bench-top experiments where data are good but scarce. American Association for the Advancement of Science 2019-04-26 /pmc/articles/PMC6486215/ /pubmed/31032399 http://dx.doi.org/10.1126/sciadv.aau6792 Text en Copyright © 2019 The Authors, some rights reserved; exclusive licensee American Association for the Advancement of Science. No claim to original U.S. Government Works. Distributed under a Creative Commons Attribution NonCommercial License 4.0 (CC BY-NC). http://creativecommons.org/licenses/by-nc/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution-NonCommercial license (http://creativecommons.org/licenses/by-nc/4.0/) , which permits use, distribution, and reproduction in any medium, so long as the resultant use is not for commercial advantage and provided the original work is properly cited.
spellingShingle Research Articles
Hoffmann, Jordan
Bar-Sinai, Yohai
Lee, Lisa M.
Andrejevic, Jovana
Mishra, Shruti
Rubinstein, Shmuel M.
Rycroft, Chris H.
Machine learning in a data-limited regime: Augmenting experiments with synthetic data uncovers order in crumpled sheets
title Machine learning in a data-limited regime: Augmenting experiments with synthetic data uncovers order in crumpled sheets
title_full Machine learning in a data-limited regime: Augmenting experiments with synthetic data uncovers order in crumpled sheets
title_fullStr Machine learning in a data-limited regime: Augmenting experiments with synthetic data uncovers order in crumpled sheets
title_full_unstemmed Machine learning in a data-limited regime: Augmenting experiments with synthetic data uncovers order in crumpled sheets
title_short Machine learning in a data-limited regime: Augmenting experiments with synthetic data uncovers order in crumpled sheets
title_sort machine learning in a data-limited regime: augmenting experiments with synthetic data uncovers order in crumpled sheets
topic Research Articles
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6486215/
https://www.ncbi.nlm.nih.gov/pubmed/31032399
http://dx.doi.org/10.1126/sciadv.aau6792
work_keys_str_mv AT hoffmannjordan machinelearninginadatalimitedregimeaugmentingexperimentswithsyntheticdatauncoversorderincrumpledsheets
AT barsinaiyohai machinelearninginadatalimitedregimeaugmentingexperimentswithsyntheticdatauncoversorderincrumpledsheets
AT leelisam machinelearninginadatalimitedregimeaugmentingexperimentswithsyntheticdatauncoversorderincrumpledsheets
AT andrejevicjovana machinelearninginadatalimitedregimeaugmentingexperimentswithsyntheticdatauncoversorderincrumpledsheets
AT mishrashruti machinelearninginadatalimitedregimeaugmentingexperimentswithsyntheticdatauncoversorderincrumpledsheets
AT rubinsteinshmuelm machinelearninginadatalimitedregimeaugmentingexperimentswithsyntheticdatauncoversorderincrumpledsheets
AT rycroftchrish machinelearninginadatalimitedregimeaugmentingexperimentswithsyntheticdatauncoversorderincrumpledsheets