Cargando…
Machine learning in a data-limited regime: Augmenting experiments with synthetic data uncovers order in crumpled sheets
Machine learning has gained widespread attention as a powerful tool to identify structure in complex, high-dimensional data. However, these techniques are ostensibly inapplicable for experimental systems where data are scarce or expensive to obtain. Here, we introduce a strategy to resolve this impa...
Autores principales: | , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
American Association for the Advancement of Science
2019
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6486215/ https://www.ncbi.nlm.nih.gov/pubmed/31032399 http://dx.doi.org/10.1126/sciadv.aau6792 |
_version_ | 1783414331576156160 |
---|---|
author | Hoffmann, Jordan Bar-Sinai, Yohai Lee, Lisa M. Andrejevic, Jovana Mishra, Shruti Rubinstein, Shmuel M. Rycroft, Chris H. |
author_facet | Hoffmann, Jordan Bar-Sinai, Yohai Lee, Lisa M. Andrejevic, Jovana Mishra, Shruti Rubinstein, Shmuel M. Rycroft, Chris H. |
author_sort | Hoffmann, Jordan |
collection | PubMed |
description | Machine learning has gained widespread attention as a powerful tool to identify structure in complex, high-dimensional data. However, these techniques are ostensibly inapplicable for experimental systems where data are scarce or expensive to obtain. Here, we introduce a strategy to resolve this impasse by augmenting the experimental dataset with synthetically generated data of a much simpler sister system. Specifically, we study spontaneously emerging local order in crease networks of crumpled thin sheets, a paradigmatic example of spatial complexity, and show that machine learning techniques can be effective even in a data-limited regime. This is achieved by augmenting the scarce experimental dataset with inexhaustible amounts of simulated data of rigid flat-folded sheets, which are simple to simulate and share common statistical properties. This considerably improves the predictive power in a test problem of pattern completion and demonstrates the usefulness of machine learning in bench-top experiments where data are good but scarce. |
format | Online Article Text |
id | pubmed-6486215 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2019 |
publisher | American Association for the Advancement of Science |
record_format | MEDLINE/PubMed |
spelling | pubmed-64862152019-04-27 Machine learning in a data-limited regime: Augmenting experiments with synthetic data uncovers order in crumpled sheets Hoffmann, Jordan Bar-Sinai, Yohai Lee, Lisa M. Andrejevic, Jovana Mishra, Shruti Rubinstein, Shmuel M. Rycroft, Chris H. Sci Adv Research Articles Machine learning has gained widespread attention as a powerful tool to identify structure in complex, high-dimensional data. However, these techniques are ostensibly inapplicable for experimental systems where data are scarce or expensive to obtain. Here, we introduce a strategy to resolve this impasse by augmenting the experimental dataset with synthetically generated data of a much simpler sister system. Specifically, we study spontaneously emerging local order in crease networks of crumpled thin sheets, a paradigmatic example of spatial complexity, and show that machine learning techniques can be effective even in a data-limited regime. This is achieved by augmenting the scarce experimental dataset with inexhaustible amounts of simulated data of rigid flat-folded sheets, which are simple to simulate and share common statistical properties. This considerably improves the predictive power in a test problem of pattern completion and demonstrates the usefulness of machine learning in bench-top experiments where data are good but scarce. American Association for the Advancement of Science 2019-04-26 /pmc/articles/PMC6486215/ /pubmed/31032399 http://dx.doi.org/10.1126/sciadv.aau6792 Text en Copyright © 2019 The Authors, some rights reserved; exclusive licensee American Association for the Advancement of Science. No claim to original U.S. Government Works. Distributed under a Creative Commons Attribution NonCommercial License 4.0 (CC BY-NC). http://creativecommons.org/licenses/by-nc/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution-NonCommercial license (http://creativecommons.org/licenses/by-nc/4.0/) , which permits use, distribution, and reproduction in any medium, so long as the resultant use is not for commercial advantage and provided the original work is properly cited. |
spellingShingle | Research Articles Hoffmann, Jordan Bar-Sinai, Yohai Lee, Lisa M. Andrejevic, Jovana Mishra, Shruti Rubinstein, Shmuel M. Rycroft, Chris H. Machine learning in a data-limited regime: Augmenting experiments with synthetic data uncovers order in crumpled sheets |
title | Machine learning in a data-limited regime: Augmenting experiments with synthetic data uncovers order in crumpled sheets |
title_full | Machine learning in a data-limited regime: Augmenting experiments with synthetic data uncovers order in crumpled sheets |
title_fullStr | Machine learning in a data-limited regime: Augmenting experiments with synthetic data uncovers order in crumpled sheets |
title_full_unstemmed | Machine learning in a data-limited regime: Augmenting experiments with synthetic data uncovers order in crumpled sheets |
title_short | Machine learning in a data-limited regime: Augmenting experiments with synthetic data uncovers order in crumpled sheets |
title_sort | machine learning in a data-limited regime: augmenting experiments with synthetic data uncovers order in crumpled sheets |
topic | Research Articles |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6486215/ https://www.ncbi.nlm.nih.gov/pubmed/31032399 http://dx.doi.org/10.1126/sciadv.aau6792 |
work_keys_str_mv | AT hoffmannjordan machinelearninginadatalimitedregimeaugmentingexperimentswithsyntheticdatauncoversorderincrumpledsheets AT barsinaiyohai machinelearninginadatalimitedregimeaugmentingexperimentswithsyntheticdatauncoversorderincrumpledsheets AT leelisam machinelearninginadatalimitedregimeaugmentingexperimentswithsyntheticdatauncoversorderincrumpledsheets AT andrejevicjovana machinelearninginadatalimitedregimeaugmentingexperimentswithsyntheticdatauncoversorderincrumpledsheets AT mishrashruti machinelearninginadatalimitedregimeaugmentingexperimentswithsyntheticdatauncoversorderincrumpledsheets AT rubinsteinshmuelm machinelearninginadatalimitedregimeaugmentingexperimentswithsyntheticdatauncoversorderincrumpledsheets AT rycroftchrish machinelearninginadatalimitedregimeaugmentingexperimentswithsyntheticdatauncoversorderincrumpledsheets |