Cargando…

A multichannel optical computing architecture for advanced machine vision

Endowed with the superior computing speed and energy efficiency, optical neural networks (ONNs) have attracted ever-growing attention in recent years. Existing optical computing architectures are mainly single-channel due to the lack of advanced optical connection and interaction operators, solving...

Descripción completa

Detalles Bibliográficos
Autores principales: Xu, Zhihao, Yuan, Xiaoyun, Zhou, Tiankuang, Fang, Lu
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Nature Publishing Group UK 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9385649/
https://www.ncbi.nlm.nih.gov/pubmed/35977940
http://dx.doi.org/10.1038/s41377-022-00945-y
_version_ 1784769635602661376
author Xu, Zhihao
Yuan, Xiaoyun
Zhou, Tiankuang
Fang, Lu
author_facet Xu, Zhihao
Yuan, Xiaoyun
Zhou, Tiankuang
Fang, Lu
author_sort Xu, Zhihao
collection PubMed
description Endowed with the superior computing speed and energy efficiency, optical neural networks (ONNs) have attracted ever-growing attention in recent years. Existing optical computing architectures are mainly single-channel due to the lack of advanced optical connection and interaction operators, solving simple tasks such as hand-written digit classification, saliency detection, etc. The limited computing capacity and scalability of single-channel ONNs restrict the optical implementation of advanced machine vision. Herein, we develop Monet: a multichannel optical neural network architecture for a universal multiple-input multiple-channel optical computing based on a novel projection-interference-prediction framework where the inter- and intra- channel connections are mapped to optical interference and diffraction. In our Monet, optical interference patterns are generated by projecting and interfering the multichannel inputs in a shared domain. These patterns encoding the correspondences together with feature embeddings are iteratively produced through the projection-interference process to predict the final output optically. For the first time, Monet validates that multichannel processing properties can be optically implemented with high-efficiency, enabling real-world intelligent multichannel-processing tasks solved via optical computing, including 3D/motion detections. Extensive experiments on different scenarios demonstrate the effectiveness of Monet in handling advanced machine vision tasks with comparative accuracy as the electronic counterparts yet achieving a ten-fold improvement in computing efficiency. For intelligent computing, the trends of dealing with real-world advanced tasks are irreversible. Breaking the capacity and scalability limitations of single-channel ONN and further exploring the multichannel processing potential of wave optics, we anticipate that the proposed technique will accelerate the development of more powerful optical AI as critical support for modern advanced machine vision.
format Online
Article
Text
id pubmed-9385649
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Nature Publishing Group UK
record_format MEDLINE/PubMed
spelling pubmed-93856492022-08-19 A multichannel optical computing architecture for advanced machine vision Xu, Zhihao Yuan, Xiaoyun Zhou, Tiankuang Fang, Lu Light Sci Appl Article Endowed with the superior computing speed and energy efficiency, optical neural networks (ONNs) have attracted ever-growing attention in recent years. Existing optical computing architectures are mainly single-channel due to the lack of advanced optical connection and interaction operators, solving simple tasks such as hand-written digit classification, saliency detection, etc. The limited computing capacity and scalability of single-channel ONNs restrict the optical implementation of advanced machine vision. Herein, we develop Monet: a multichannel optical neural network architecture for a universal multiple-input multiple-channel optical computing based on a novel projection-interference-prediction framework where the inter- and intra- channel connections are mapped to optical interference and diffraction. In our Monet, optical interference patterns are generated by projecting and interfering the multichannel inputs in a shared domain. These patterns encoding the correspondences together with feature embeddings are iteratively produced through the projection-interference process to predict the final output optically. For the first time, Monet validates that multichannel processing properties can be optically implemented with high-efficiency, enabling real-world intelligent multichannel-processing tasks solved via optical computing, including 3D/motion detections. Extensive experiments on different scenarios demonstrate the effectiveness of Monet in handling advanced machine vision tasks with comparative accuracy as the electronic counterparts yet achieving a ten-fold improvement in computing efficiency. For intelligent computing, the trends of dealing with real-world advanced tasks are irreversible. Breaking the capacity and scalability limitations of single-channel ONN and further exploring the multichannel processing potential of wave optics, we anticipate that the proposed technique will accelerate the development of more powerful optical AI as critical support for modern advanced machine vision. Nature Publishing Group UK 2022-08-18 /pmc/articles/PMC9385649/ /pubmed/35977940 http://dx.doi.org/10.1038/s41377-022-00945-y Text en © The Author(s) 2022 https://creativecommons.org/licenses/by/4.0/Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) .
spellingShingle Article
Xu, Zhihao
Yuan, Xiaoyun
Zhou, Tiankuang
Fang, Lu
A multichannel optical computing architecture for advanced machine vision
title A multichannel optical computing architecture for advanced machine vision
title_full A multichannel optical computing architecture for advanced machine vision
title_fullStr A multichannel optical computing architecture for advanced machine vision
title_full_unstemmed A multichannel optical computing architecture for advanced machine vision
title_short A multichannel optical computing architecture for advanced machine vision
title_sort multichannel optical computing architecture for advanced machine vision
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9385649/
https://www.ncbi.nlm.nih.gov/pubmed/35977940
http://dx.doi.org/10.1038/s41377-022-00945-y
work_keys_str_mv AT xuzhihao amultichannelopticalcomputingarchitectureforadvancedmachinevision
AT yuanxiaoyun amultichannelopticalcomputingarchitectureforadvancedmachinevision
AT zhoutiankuang amultichannelopticalcomputingarchitectureforadvancedmachinevision
AT fanglu amultichannelopticalcomputingarchitectureforadvancedmachinevision
AT xuzhihao multichannelopticalcomputingarchitectureforadvancedmachinevision
AT yuanxiaoyun multichannelopticalcomputingarchitectureforadvancedmachinevision
AT zhoutiankuang multichannelopticalcomputingarchitectureforadvancedmachinevision
AT fanglu multichannelopticalcomputingarchitectureforadvancedmachinevision