Cargando…
Towards artificial general intelligence via a multimodal foundation model
The fundamental goal of artificial intelligence (AI) is to mimic the core cognitive activities of human. Despite tremendous success in the AI research, most of existing methods have only single-cognitive ability. To overcome this limitation and take a solid step towards artificial general intelligen...
Autores principales: | , , , , , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Nature Publishing Group UK
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9163040/ https://www.ncbi.nlm.nih.gov/pubmed/35655064 http://dx.doi.org/10.1038/s41467-022-30761-2 |
_version_ | 1784719842620735488 |
---|---|
author | Fei, Nanyi Lu, Zhiwu Gao, Yizhao Yang, Guoxing Huo, Yuqi Wen, Jingyuan Lu, Haoyu Song, Ruihua Gao, Xin Xiang, Tao Sun, Hao Wen, Ji-Rong |
author_facet | Fei, Nanyi Lu, Zhiwu Gao, Yizhao Yang, Guoxing Huo, Yuqi Wen, Jingyuan Lu, Haoyu Song, Ruihua Gao, Xin Xiang, Tao Sun, Hao Wen, Ji-Rong |
author_sort | Fei, Nanyi |
collection | PubMed |
description | The fundamental goal of artificial intelligence (AI) is to mimic the core cognitive activities of human. Despite tremendous success in the AI research, most of existing methods have only single-cognitive ability. To overcome this limitation and take a solid step towards artificial general intelligence (AGI), we develop a foundation model pre-trained with huge multimodal data, which can be quickly adapted for various downstream cognitive tasks. To achieve this goal, we propose to pre-train our foundation model by self-supervised learning with weak semantic correlation data crawled from the Internet and show that promising results can be obtained on a wide range of downstream tasks. Particularly, with the developed model-interpretability tools, we demonstrate that strong imagination ability is now possessed by our foundation model. We believe that our work makes a transformative stride towards AGI, from our common practice of “weak or narrow AI” to that of “strong or generalized AI”. |
format | Online Article Text |
id | pubmed-9163040 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | Nature Publishing Group UK |
record_format | MEDLINE/PubMed |
spelling | pubmed-91630402022-06-05 Towards artificial general intelligence via a multimodal foundation model Fei, Nanyi Lu, Zhiwu Gao, Yizhao Yang, Guoxing Huo, Yuqi Wen, Jingyuan Lu, Haoyu Song, Ruihua Gao, Xin Xiang, Tao Sun, Hao Wen, Ji-Rong Nat Commun Article The fundamental goal of artificial intelligence (AI) is to mimic the core cognitive activities of human. Despite tremendous success in the AI research, most of existing methods have only single-cognitive ability. To overcome this limitation and take a solid step towards artificial general intelligence (AGI), we develop a foundation model pre-trained with huge multimodal data, which can be quickly adapted for various downstream cognitive tasks. To achieve this goal, we propose to pre-train our foundation model by self-supervised learning with weak semantic correlation data crawled from the Internet and show that promising results can be obtained on a wide range of downstream tasks. Particularly, with the developed model-interpretability tools, we demonstrate that strong imagination ability is now possessed by our foundation model. We believe that our work makes a transformative stride towards AGI, from our common practice of “weak or narrow AI” to that of “strong or generalized AI”. Nature Publishing Group UK 2022-06-02 /pmc/articles/PMC9163040/ /pubmed/35655064 http://dx.doi.org/10.1038/s41467-022-30761-2 Text en © The Author(s) 2022 https://creativecommons.org/licenses/by/4.0/Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . |
spellingShingle | Article Fei, Nanyi Lu, Zhiwu Gao, Yizhao Yang, Guoxing Huo, Yuqi Wen, Jingyuan Lu, Haoyu Song, Ruihua Gao, Xin Xiang, Tao Sun, Hao Wen, Ji-Rong Towards artificial general intelligence via a multimodal foundation model |
title | Towards artificial general intelligence via a multimodal foundation model |
title_full | Towards artificial general intelligence via a multimodal foundation model |
title_fullStr | Towards artificial general intelligence via a multimodal foundation model |
title_full_unstemmed | Towards artificial general intelligence via a multimodal foundation model |
title_short | Towards artificial general intelligence via a multimodal foundation model |
title_sort | towards artificial general intelligence via a multimodal foundation model |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9163040/ https://www.ncbi.nlm.nih.gov/pubmed/35655064 http://dx.doi.org/10.1038/s41467-022-30761-2 |
work_keys_str_mv | AT feinanyi towardsartificialgeneralintelligenceviaamultimodalfoundationmodel AT luzhiwu towardsartificialgeneralintelligenceviaamultimodalfoundationmodel AT gaoyizhao towardsartificialgeneralintelligenceviaamultimodalfoundationmodel AT yangguoxing towardsartificialgeneralintelligenceviaamultimodalfoundationmodel AT huoyuqi towardsartificialgeneralintelligenceviaamultimodalfoundationmodel AT wenjingyuan towardsartificialgeneralintelligenceviaamultimodalfoundationmodel AT luhaoyu towardsartificialgeneralintelligenceviaamultimodalfoundationmodel AT songruihua towardsartificialgeneralintelligenceviaamultimodalfoundationmodel AT gaoxin towardsartificialgeneralintelligenceviaamultimodalfoundationmodel AT xiangtao towardsartificialgeneralintelligenceviaamultimodalfoundationmodel AT sunhao towardsartificialgeneralintelligenceviaamultimodalfoundationmodel AT wenjirong towardsartificialgeneralintelligenceviaamultimodalfoundationmodel |