Cargando…
A multichannel optical computing architecture for advanced machine vision
Endowed with the superior computing speed and energy efficiency, optical neural networks (ONNs) have attracted ever-growing attention in recent years. Existing optical computing architectures are mainly single-channel due to the lack of advanced optical connection and interaction operators, solving...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Nature Publishing Group UK
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9385649/ https://www.ncbi.nlm.nih.gov/pubmed/35977940 http://dx.doi.org/10.1038/s41377-022-00945-y |
_version_ | 1784769635602661376 |
---|---|
author | Xu, Zhihao Yuan, Xiaoyun Zhou, Tiankuang Fang, Lu |
author_facet | Xu, Zhihao Yuan, Xiaoyun Zhou, Tiankuang Fang, Lu |
author_sort | Xu, Zhihao |
collection | PubMed |
description | Endowed with the superior computing speed and energy efficiency, optical neural networks (ONNs) have attracted ever-growing attention in recent years. Existing optical computing architectures are mainly single-channel due to the lack of advanced optical connection and interaction operators, solving simple tasks such as hand-written digit classification, saliency detection, etc. The limited computing capacity and scalability of single-channel ONNs restrict the optical implementation of advanced machine vision. Herein, we develop Monet: a multichannel optical neural network architecture for a universal multiple-input multiple-channel optical computing based on a novel projection-interference-prediction framework where the inter- and intra- channel connections are mapped to optical interference and diffraction. In our Monet, optical interference patterns are generated by projecting and interfering the multichannel inputs in a shared domain. These patterns encoding the correspondences together with feature embeddings are iteratively produced through the projection-interference process to predict the final output optically. For the first time, Monet validates that multichannel processing properties can be optically implemented with high-efficiency, enabling real-world intelligent multichannel-processing tasks solved via optical computing, including 3D/motion detections. Extensive experiments on different scenarios demonstrate the effectiveness of Monet in handling advanced machine vision tasks with comparative accuracy as the electronic counterparts yet achieving a ten-fold improvement in computing efficiency. For intelligent computing, the trends of dealing with real-world advanced tasks are irreversible. Breaking the capacity and scalability limitations of single-channel ONN and further exploring the multichannel processing potential of wave optics, we anticipate that the proposed technique will accelerate the development of more powerful optical AI as critical support for modern advanced machine vision. |
format | Online Article Text |
id | pubmed-9385649 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | Nature Publishing Group UK |
record_format | MEDLINE/PubMed |
spelling | pubmed-93856492022-08-19 A multichannel optical computing architecture for advanced machine vision Xu, Zhihao Yuan, Xiaoyun Zhou, Tiankuang Fang, Lu Light Sci Appl Article Endowed with the superior computing speed and energy efficiency, optical neural networks (ONNs) have attracted ever-growing attention in recent years. Existing optical computing architectures are mainly single-channel due to the lack of advanced optical connection and interaction operators, solving simple tasks such as hand-written digit classification, saliency detection, etc. The limited computing capacity and scalability of single-channel ONNs restrict the optical implementation of advanced machine vision. Herein, we develop Monet: a multichannel optical neural network architecture for a universal multiple-input multiple-channel optical computing based on a novel projection-interference-prediction framework where the inter- and intra- channel connections are mapped to optical interference and diffraction. In our Monet, optical interference patterns are generated by projecting and interfering the multichannel inputs in a shared domain. These patterns encoding the correspondences together with feature embeddings are iteratively produced through the projection-interference process to predict the final output optically. For the first time, Monet validates that multichannel processing properties can be optically implemented with high-efficiency, enabling real-world intelligent multichannel-processing tasks solved via optical computing, including 3D/motion detections. Extensive experiments on different scenarios demonstrate the effectiveness of Monet in handling advanced machine vision tasks with comparative accuracy as the electronic counterparts yet achieving a ten-fold improvement in computing efficiency. For intelligent computing, the trends of dealing with real-world advanced tasks are irreversible. Breaking the capacity and scalability limitations of single-channel ONN and further exploring the multichannel processing potential of wave optics, we anticipate that the proposed technique will accelerate the development of more powerful optical AI as critical support for modern advanced machine vision. Nature Publishing Group UK 2022-08-18 /pmc/articles/PMC9385649/ /pubmed/35977940 http://dx.doi.org/10.1038/s41377-022-00945-y Text en © The Author(s) 2022 https://creativecommons.org/licenses/by/4.0/Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . |
spellingShingle | Article Xu, Zhihao Yuan, Xiaoyun Zhou, Tiankuang Fang, Lu A multichannel optical computing architecture for advanced machine vision |
title | A multichannel optical computing architecture for advanced machine vision |
title_full | A multichannel optical computing architecture for advanced machine vision |
title_fullStr | A multichannel optical computing architecture for advanced machine vision |
title_full_unstemmed | A multichannel optical computing architecture for advanced machine vision |
title_short | A multichannel optical computing architecture for advanced machine vision |
title_sort | multichannel optical computing architecture for advanced machine vision |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9385649/ https://www.ncbi.nlm.nih.gov/pubmed/35977940 http://dx.doi.org/10.1038/s41377-022-00945-y |
work_keys_str_mv | AT xuzhihao amultichannelopticalcomputingarchitectureforadvancedmachinevision AT yuanxiaoyun amultichannelopticalcomputingarchitectureforadvancedmachinevision AT zhoutiankuang amultichannelopticalcomputingarchitectureforadvancedmachinevision AT fanglu amultichannelopticalcomputingarchitectureforadvancedmachinevision AT xuzhihao multichannelopticalcomputingarchitectureforadvancedmachinevision AT yuanxiaoyun multichannelopticalcomputingarchitectureforadvancedmachinevision AT zhoutiankuang multichannelopticalcomputingarchitectureforadvancedmachinevision AT fanglu multichannelopticalcomputingarchitectureforadvancedmachinevision |