Released AI system 'Codex' that automatically outputs code from natural language input by OpenAI



OpenAI , an artificial intelligence (AI) research institute founded by Tesla and SpaceX founder Elon Musk, has released a private beta API for Codex, an AI system that translates natural language into code.

OpenAI Codex
https://openai.com/blog/openai-codex/



OpenAI launches Codex, an API for translating natural language into code | VentureBeat
https://venturebeat.com/2021/08/10/openai-launches-codex-an-api-for-translating-natural-language-into-code/

OpenAI can translate English into code with its new machine learning software Codex --The Verge
https://www.theverge.com/2021/8/10/22618128/openai-codex-natural-language-into-code-api-beta-access

Codex is an enhanced version of a function called 'GitHub Copilot' that automatically complements the continuation of the source code, which was built and released by OpenAI in partnership with GitHub in July 2021. For GitHub Copilot, the actual engineer, Hrithwik Bharadwaj , summarized the advantages and disadvantages : 'It's great for writing some functions, but it may not be efficient for some code. There is, '' It's not a substitute for humans, it's a co-pilot who helps programmers work. '

'GitHub Copilot', a function that automatically complements the 'continuation' of the source code, has appeared on GitHub, with the cooperation of OpenAI --GIGAZINE



Codex supports more than 12 programming languages, and it is an AI system that can interpret simple commands in natural language and execute them on behalf of the user. Since commands can be interpreted and executed in plain English, not only can it support programming work by programmers, but it can also support coding that amateurs do for the first time, so it is a 'tool to increase programmers' and Greg Brockman of OpenAI. Mann CTO explains to the foreign media The Verge.

What you can actually do is just type in English 'Create a web page with a menu on the side and a title at the top' and Codex will output the appropriate website code for this. It will give you. You can check the demonstration of Codex outputting websites and simple games from the entered English in the following movie.

OpenAI Codex Live Demo-YouTube


Since Codex is positioned as an enhanced version of GitHub Copilot, it is much more sophisticated and flexible than GitHub Copilot, which outputs the continuation of the code, because Codex can output the code from scratch from the beginning. It can be said that it is a system. Since Codex is built on GPT-3 , which is an OpenAI language generation model, it is possible to analyze natural language and output code according to the user's intention.

It can also work for entire lines of code in a development environment such as Microsoft Visual Studio. In addition, Codex is trained with billions of lines of public code, so OpenAI appeals that it works with a wide set of frameworks and languages and can be adapted to editing according to the developer's coding style. doing.

In addition, according to OpenAI, Codex, available via the API, is able to provide the most functionality in Python. However, it also supports programming languages such as JavaScript, Go, Perl, PHP, Ruby, Swift, TypeScript, Shell, and it is possible to execute programming tasks such as conversion, code explanation, code refactoring while considering context information. That thing.

'Codex helps computers better understand human intent. Codex makes it even more accessible to everyone,' explains Codex.



However, tech media VentureBeat said, 'Codex is very capable, but

related papers published by OpenAI suggest that Codex may have significant limitations such as bias and sample inefficiency. Researchers have pointed out that the model suggests code that is syntactically incorrect or undefined and calls variables and attributes that are undefined or outside the scope of the codex. Further concern is that Codex Sometimes it suggests a solution that looks correct on the surface but doesn't actually perform the intended task. For example, when asked to create an encryption key, Codex says, 'A significant percentage. Select the 'obviously insecure' configuration parameter and recommend the compromised package as a dependency, 'he said, pointing out a potential Codex issue.

In addition, Codex may output 'obfuscated code that looks good but does something that isn't really desirable' because it produces a response that is as similar as possible to the data used for training. In fact, OpenAI reports that trying to output code related to some words using Codex can result in racist or harmful code, specifically called 'Islam'. Words are more often associated with words such as 'terrorist' and 'violent' than other religious words.

With these issues in mind, OpenAI said of Codex, 'By carefully scrutinizing the documentation, designing the user interface, reviewing the code, and controlling the content, these issues Can be avoided. '



If you want to use the Codex API, you can join the weight list from the following page.

OpenAI API Waitlist --OpenAI
https://share.hsforms.com/1Lfc7WtPLRk2ppXhPjcYY-A4sk30

in Software,   Video, Posted by logu_ii