Cargando…

Natural Language Generation and Understanding of Big Code for AI-Assisted Programming: A Review

This paper provides a comprehensive review of the literature concerning the utilization of Natural Language Processing (NLP) techniques, with a particular focus on transformer-based large language models (LLMs) trained using Big Code, within the domain of AI-assisted programming tasks. LLMs, augment...

Descripción completa

Detalles Bibliográficos
Autores principales: Wong, Man-Fai, Guo, Shangxin, Hang, Ching-Nam, Ho, Siu-Wai, Tan, Chee-Wei
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10297336/
https://www.ncbi.nlm.nih.gov/pubmed/37372232
http://dx.doi.org/10.3390/e25060888
_version_ 1785063859838517248
author Wong, Man-Fai
Guo, Shangxin
Hang, Ching-Nam
Ho, Siu-Wai
Tan, Chee-Wei
author_facet Wong, Man-Fai
Guo, Shangxin
Hang, Ching-Nam
Ho, Siu-Wai
Tan, Chee-Wei
author_sort Wong, Man-Fai
collection PubMed
description This paper provides a comprehensive review of the literature concerning the utilization of Natural Language Processing (NLP) techniques, with a particular focus on transformer-based large language models (LLMs) trained using Big Code, within the domain of AI-assisted programming tasks. LLMs, augmented with software naturalness, have played a crucial role in facilitating AI-assisted programming applications, including code generation, code completion, code translation, code refinement, code summarization, defect detection, and clone detection. Notable examples of such applications include the GitHub Copilot powered by OpenAI’s Codex and DeepMind AlphaCode. This paper presents an overview of the major LLMs and their applications in downstream tasks related to AI-assisted programming. Furthermore, it explores the challenges and opportunities associated with incorporating NLP techniques with software naturalness in these applications, with a discussion on extending AI-assisted programming capabilities to Apple’s Xcode for mobile software development. This paper also presents the challenges of and opportunities for incorporating NLP techniques with software naturalness, empowering developers with advanced coding assistance and streamlining the software development process.
format Online
Article
Text
id pubmed-10297336
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-102973362023-06-28 Natural Language Generation and Understanding of Big Code for AI-Assisted Programming: A Review Wong, Man-Fai Guo, Shangxin Hang, Ching-Nam Ho, Siu-Wai Tan, Chee-Wei Entropy (Basel) Review This paper provides a comprehensive review of the literature concerning the utilization of Natural Language Processing (NLP) techniques, with a particular focus on transformer-based large language models (LLMs) trained using Big Code, within the domain of AI-assisted programming tasks. LLMs, augmented with software naturalness, have played a crucial role in facilitating AI-assisted programming applications, including code generation, code completion, code translation, code refinement, code summarization, defect detection, and clone detection. Notable examples of such applications include the GitHub Copilot powered by OpenAI’s Codex and DeepMind AlphaCode. This paper presents an overview of the major LLMs and their applications in downstream tasks related to AI-assisted programming. Furthermore, it explores the challenges and opportunities associated with incorporating NLP techniques with software naturalness in these applications, with a discussion on extending AI-assisted programming capabilities to Apple’s Xcode for mobile software development. This paper also presents the challenges of and opportunities for incorporating NLP techniques with software naturalness, empowering developers with advanced coding assistance and streamlining the software development process. MDPI 2023-06-01 /pmc/articles/PMC10297336/ /pubmed/37372232 http://dx.doi.org/10.3390/e25060888 Text en © 2023 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle Review
Wong, Man-Fai
Guo, Shangxin
Hang, Ching-Nam
Ho, Siu-Wai
Tan, Chee-Wei
Natural Language Generation and Understanding of Big Code for AI-Assisted Programming: A Review
title Natural Language Generation and Understanding of Big Code for AI-Assisted Programming: A Review
title_full Natural Language Generation and Understanding of Big Code for AI-Assisted Programming: A Review
title_fullStr Natural Language Generation and Understanding of Big Code for AI-Assisted Programming: A Review
title_full_unstemmed Natural Language Generation and Understanding of Big Code for AI-Assisted Programming: A Review
title_short Natural Language Generation and Understanding of Big Code for AI-Assisted Programming: A Review
title_sort natural language generation and understanding of big code for ai-assisted programming: a review
topic Review
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10297336/
https://www.ncbi.nlm.nih.gov/pubmed/37372232
http://dx.doi.org/10.3390/e25060888
work_keys_str_mv AT wongmanfai naturallanguagegenerationandunderstandingofbigcodeforaiassistedprogrammingareview
AT guoshangxin naturallanguagegenerationandunderstandingofbigcodeforaiassistedprogrammingareview
AT hangchingnam naturallanguagegenerationandunderstandingofbigcodeforaiassistedprogrammingareview
AT hosiuwai naturallanguagegenerationandunderstandingofbigcodeforaiassistedprogrammingareview
AT tancheewei naturallanguagegenerationandunderstandingofbigcodeforaiassistedprogrammingareview