Cargando…

Towards artificial general intelligence via a multimodal foundation model

The fundamental goal of artificial intelligence (AI) is to mimic the core cognitive activities of human. Despite tremendous success in the AI research, most of existing methods have only single-cognitive ability. To overcome this limitation and take a solid step towards artificial general intelligen...

Descripción completa

Detalles Bibliográficos
Autores principales: Fei, Nanyi, Lu, Zhiwu, Gao, Yizhao, Yang, Guoxing, Huo, Yuqi, Wen, Jingyuan, Lu, Haoyu, Song, Ruihua, Gao, Xin, Xiang, Tao, Sun, Hao, Wen, Ji-Rong
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Nature Publishing Group UK 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9163040/
https://www.ncbi.nlm.nih.gov/pubmed/35655064
http://dx.doi.org/10.1038/s41467-022-30761-2
_version_ 1784719842620735488
author Fei, Nanyi
Lu, Zhiwu
Gao, Yizhao
Yang, Guoxing
Huo, Yuqi
Wen, Jingyuan
Lu, Haoyu
Song, Ruihua
Gao, Xin
Xiang, Tao
Sun, Hao
Wen, Ji-Rong
author_facet Fei, Nanyi
Lu, Zhiwu
Gao, Yizhao
Yang, Guoxing
Huo, Yuqi
Wen, Jingyuan
Lu, Haoyu
Song, Ruihua
Gao, Xin
Xiang, Tao
Sun, Hao
Wen, Ji-Rong
author_sort Fei, Nanyi
collection PubMed
description The fundamental goal of artificial intelligence (AI) is to mimic the core cognitive activities of human. Despite tremendous success in the AI research, most of existing methods have only single-cognitive ability. To overcome this limitation and take a solid step towards artificial general intelligence (AGI), we develop a foundation model pre-trained with huge multimodal data, which can be quickly adapted for various downstream cognitive tasks. To achieve this goal, we propose to pre-train our foundation model by self-supervised learning with weak semantic correlation data crawled from the Internet and show that promising results can be obtained on a wide range of downstream tasks. Particularly, with the developed model-interpretability tools, we demonstrate that strong imagination ability is now possessed by our foundation model. We believe that our work makes a transformative stride towards AGI, from our common practice of “weak or narrow AI” to that of “strong or generalized AI”.
format Online
Article
Text
id pubmed-9163040
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Nature Publishing Group UK
record_format MEDLINE/PubMed
spelling pubmed-91630402022-06-05 Towards artificial general intelligence via a multimodal foundation model Fei, Nanyi Lu, Zhiwu Gao, Yizhao Yang, Guoxing Huo, Yuqi Wen, Jingyuan Lu, Haoyu Song, Ruihua Gao, Xin Xiang, Tao Sun, Hao Wen, Ji-Rong Nat Commun Article The fundamental goal of artificial intelligence (AI) is to mimic the core cognitive activities of human. Despite tremendous success in the AI research, most of existing methods have only single-cognitive ability. To overcome this limitation and take a solid step towards artificial general intelligence (AGI), we develop a foundation model pre-trained with huge multimodal data, which can be quickly adapted for various downstream cognitive tasks. To achieve this goal, we propose to pre-train our foundation model by self-supervised learning with weak semantic correlation data crawled from the Internet and show that promising results can be obtained on a wide range of downstream tasks. Particularly, with the developed model-interpretability tools, we demonstrate that strong imagination ability is now possessed by our foundation model. We believe that our work makes a transformative stride towards AGI, from our common practice of “weak or narrow AI” to that of “strong or generalized AI”. Nature Publishing Group UK 2022-06-02 /pmc/articles/PMC9163040/ /pubmed/35655064 http://dx.doi.org/10.1038/s41467-022-30761-2 Text en © The Author(s) 2022 https://creativecommons.org/licenses/by/4.0/Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) .
spellingShingle Article
Fei, Nanyi
Lu, Zhiwu
Gao, Yizhao
Yang, Guoxing
Huo, Yuqi
Wen, Jingyuan
Lu, Haoyu
Song, Ruihua
Gao, Xin
Xiang, Tao
Sun, Hao
Wen, Ji-Rong
Towards artificial general intelligence via a multimodal foundation model
title Towards artificial general intelligence via a multimodal foundation model
title_full Towards artificial general intelligence via a multimodal foundation model
title_fullStr Towards artificial general intelligence via a multimodal foundation model
title_full_unstemmed Towards artificial general intelligence via a multimodal foundation model
title_short Towards artificial general intelligence via a multimodal foundation model
title_sort towards artificial general intelligence via a multimodal foundation model
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9163040/
https://www.ncbi.nlm.nih.gov/pubmed/35655064
http://dx.doi.org/10.1038/s41467-022-30761-2
work_keys_str_mv AT feinanyi towardsartificialgeneralintelligenceviaamultimodalfoundationmodel
AT luzhiwu towardsartificialgeneralintelligenceviaamultimodalfoundationmodel
AT gaoyizhao towardsartificialgeneralintelligenceviaamultimodalfoundationmodel
AT yangguoxing towardsartificialgeneralintelligenceviaamultimodalfoundationmodel
AT huoyuqi towardsartificialgeneralintelligenceviaamultimodalfoundationmodel
AT wenjingyuan towardsartificialgeneralintelligenceviaamultimodalfoundationmodel
AT luhaoyu towardsartificialgeneralintelligenceviaamultimodalfoundationmodel
AT songruihua towardsartificialgeneralintelligenceviaamultimodalfoundationmodel
AT gaoxin towardsartificialgeneralintelligenceviaamultimodalfoundationmodel
AT xiangtao towardsartificialgeneralintelligenceviaamultimodalfoundationmodel
AT sunhao towardsartificialgeneralintelligenceviaamultimodalfoundationmodel
AT wenjirong towardsartificialgeneralintelligenceviaamultimodalfoundationmodel