Cargando…

Deep learning for real-time single-pixel video

Single-pixel cameras capture images without the requirement for a multi-pixel sensor, enabling the use of state-of-the-art detector technologies and providing a potentially low-cost solution for sensing beyond the visible spectrum. One limitation of single-pixel cameras is the inherent trade-off bet...

Descripción completa

Detalles Bibliográficos
Autores principales: Higham, Catherine F., Murray-Smith, Roderick, Padgett, Miles J., Edgar, Matthew P.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Nature Publishing Group UK 2018
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5799195/
https://www.ncbi.nlm.nih.gov/pubmed/29403059
http://dx.doi.org/10.1038/s41598-018-20521-y
_version_ 1783297944007475200
author Higham, Catherine F.
Murray-Smith, Roderick
Padgett, Miles J.
Edgar, Matthew P.
author_facet Higham, Catherine F.
Murray-Smith, Roderick
Padgett, Miles J.
Edgar, Matthew P.
author_sort Higham, Catherine F.
collection PubMed
description Single-pixel cameras capture images without the requirement for a multi-pixel sensor, enabling the use of state-of-the-art detector technologies and providing a potentially low-cost solution for sensing beyond the visible spectrum. One limitation of single-pixel cameras is the inherent trade-off between image resolution and frame rate, with current compressive (compressed) sensing techniques being unable to support real-time video. In this work we demonstrate the application of deep learning with convolutional auto-encoder networks to recover real-time 128 × 128 pixel video at 30 frames-per-second from a single-pixel camera sampling at a compression ratio of 2%. In addition, by training the network on a large database of images we are able to optimise the first layer of the convolutional network, equivalent to optimising the basis used for scanning the image intensities. This work develops and implements a novel approach to solving the inverse problem for single-pixel cameras efficiently and represents a significant step towards real-time operation of computational imagers. By learning from examples in a particular context, our approach opens up the possibility of high resolution for task-specific adaptation, with importance for applications in gas sensing, 3D imaging and metrology.
format Online
Article
Text
id pubmed-5799195
institution National Center for Biotechnology Information
language English
publishDate 2018
publisher Nature Publishing Group UK
record_format MEDLINE/PubMed
spelling pubmed-57991952018-02-14 Deep learning for real-time single-pixel video Higham, Catherine F. Murray-Smith, Roderick Padgett, Miles J. Edgar, Matthew P. Sci Rep Article Single-pixel cameras capture images without the requirement for a multi-pixel sensor, enabling the use of state-of-the-art detector technologies and providing a potentially low-cost solution for sensing beyond the visible spectrum. One limitation of single-pixel cameras is the inherent trade-off between image resolution and frame rate, with current compressive (compressed) sensing techniques being unable to support real-time video. In this work we demonstrate the application of deep learning with convolutional auto-encoder networks to recover real-time 128 × 128 pixel video at 30 frames-per-second from a single-pixel camera sampling at a compression ratio of 2%. In addition, by training the network on a large database of images we are able to optimise the first layer of the convolutional network, equivalent to optimising the basis used for scanning the image intensities. This work develops and implements a novel approach to solving the inverse problem for single-pixel cameras efficiently and represents a significant step towards real-time operation of computational imagers. By learning from examples in a particular context, our approach opens up the possibility of high resolution for task-specific adaptation, with importance for applications in gas sensing, 3D imaging and metrology. Nature Publishing Group UK 2018-02-05 /pmc/articles/PMC5799195/ /pubmed/29403059 http://dx.doi.org/10.1038/s41598-018-20521-y Text en © The Author(s) 2018 Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.
spellingShingle Article
Higham, Catherine F.
Murray-Smith, Roderick
Padgett, Miles J.
Edgar, Matthew P.
Deep learning for real-time single-pixel video
title Deep learning for real-time single-pixel video
title_full Deep learning for real-time single-pixel video
title_fullStr Deep learning for real-time single-pixel video
title_full_unstemmed Deep learning for real-time single-pixel video
title_short Deep learning for real-time single-pixel video
title_sort deep learning for real-time single-pixel video
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5799195/
https://www.ncbi.nlm.nih.gov/pubmed/29403059
http://dx.doi.org/10.1038/s41598-018-20521-y
work_keys_str_mv AT highamcatherinef deeplearningforrealtimesinglepixelvideo
AT murraysmithroderick deeplearningforrealtimesinglepixelvideo
AT padgettmilesj deeplearningforrealtimesinglepixelvideo
AT edgarmatthewp deeplearningforrealtimesinglepixelvideo