``Web LLM'' that can run chat AI only with the WebGPU of the browser and can use Japanese, how to actually try it is like this
Interactive AIs such as ChatGPT and Bard that can converse with humans based on large-scale language models have appeared, but these interactive AIs require enormous computing resources to operate, and usually AIs can operate. It enables interaction by accessing the server. Interactive AI ' Web LLM ' based on
GitHub - mlc-ai/web-llm: Bringing large-language models and chat to web browsers. Everything runs inside the browser with no server support.
https://github.com/mlc-ai/web-llm
To experience Web LLM, you need to use a 'WebGPU compatible browser' on a 'PC with NVIDIA GPU'. However, WebGPU has just been reflected in Chrome, and at the time of writing the article, it must be accessed from Chrome version 113 beta or later, or Chrome Canary, which is a developer build.
This time it will be accessed from Chrome Canary. Chrome Canary is distributed on the following page.
Chrome Canary Developer Features - Google Chrome
https://www.google.com/intl/ja/chrome/canary/
Click 'Download Chrome Canary' on the distribution page to download the EXE format installer (file size 1.30MB).
When you start the downloaded installer, the installation will be performed automatically.
When the installation is finished, Chrome Canary will start.
A demo of Web LLM can be accessed here.
WebLLM | Home
https://mlc.ai/web-llm/
'Chat Demo' at the bottom of the above page is a demo of Web LLM. First, click 'Send' without entering anything.
Then the following message is displayed and the model download starts.
Ready when '[System Initialize] All initialization finished' is displayed. I entered 'Hello, can you speak Japanese?' in the input field, clicked 'Send', and suddenly spoke in Japanese.
'Yes, I can speak Japanese. Please listen to me.'
First of all, I asked what kind of site GIGAZINE is. They seem to recognize that it's a website, but I'm concerned about the meaningless phrase 'This is the primary online comment scene in Japan.'
Next, when I asked how to make curry, the Japanese of the answer was clearly broken, and words like German and Russian can be seen here and there.
Moreover, as a result of continuing to answer in broken Japanese, I stopped at 'curry'.
This time, when I asked him about
This time, I entered
The explanation of the contents of the code after that is also a sentence that does not make sense at all.
Finally, when I asked about Momotaro's content, not only was the content of the fairy tale completely different, but the Japanese grammar was also quite unstable.
Web LLM is certainly able to talk with AI on the browser, but the accuracy of the large-scale language model is still sweeter than ChatGPT's GPT-3.5, and in Japanese it returned a gibberish conversation. . Still, it worked even with WebGPU processing, so I would like to expect further evolution of the model.
Related Posts:
in Review, Software, Web Application, Posted by log1i_yk