What happens if you let an interactive AI such as ChatGPT or Bing Chat play chess?



Interactive AI, such as ChatGPT, responds to the input text (prompt) in sentences that are as natural as humans.

Zach Witten , an engineer who has released a tool `` Prompt Engineering Chess '' to make ChatGPT play chess using prompts for this interactive AI, let ChatGPT and Bing's interactive AI play chess I am reporting the results.

GitHub - zswitten/promptchess
https://github.com/zswitten/promptchess



The idea of letting interactive AI such as ChatGPT play chess was not original by Mr. Witten, but was devised on the online bulletin board site Reddit. In the movie attached to the following tweet, it seems that ChatGPT and Stockfish , an open source chess engine, are fighting in chess, but ChatGPT (black) does not understand the rules of chess correctly, originally You can perform messy operations such as diagonally moving a rook that can only move vertically and horizontally, and reviving captured pieces.



It seems that Mr. Witten first read the game record of a match between professional chess players and then played the continuation, but since the large-scale language model is not an AI for chess, it tends to repeat the same move. It was said that there were many cases where the game ended immediately with a thousand days .



Therefore, Mr. Witten adopted a policy of inputting the game record so far, the latest board situation, the piece that can be moved at the moment and how to move it at the prompt. The idea worked, and the battle between ChatGPT and Stockfish continued for some time.

The Python program that generates this prompt is 'Prompt Engineering Chess'. Click on the thumbnail below to see a GIF animation of part of a match using Prompt Engineering Chess to pit a modified version of GPT-3, InstructGPT (white), against EleutherAI's GPT-J-6B (black). I can.



Below is where Prompt Engineering Chess was used to play chess with interactive AI 'Sydney' installed in Bing. The board surface at the start of the animation was read from the existing game record, and the movement of the pieces after that was created by Sydney. Looking at the animation, you can see that it is moving the pieces without any problems and is aware of the concept of checkmate.



The content of the conversation with Sydney when generating the animation attached to the above tweet is as follows.



Mr. Witten asked Sydney, ``Would you like to play chess together?'' He said, ``I'm sorry. That's right.



According to Mr. Witten, Sydney himself said that he accessed Stockfish and pointed to chess, but since Sydney did not send an HTTP request to Stockfish, it was completely Sydney's point of view. It turns out there is. Mr. Witten, who has a chess Elo rating of about 2000, said that Sydney's skill is about 1100 to 1200. I'm here.

in Software,   Video,   Game, Posted by log1i_yk