'Feels Like Paper!': An intuitive AI interface that uses paper
Unlike computers, paper is said to be good for reading books and leaving simple notes and doodles. Designer
'Feels Like Paper!' · Interfacing Artificial Intelligence through Paper
https://lukasmoro.com/paper
'Feels like Paper!' is a series consisting of three prototypes: 'Maths & Questions,' 'Mark & Comment,' and 'Draw & Dream.'
◆Maths & Questions
Maths & Questions is an application that allows users to write mathematical expressions on paper while wearing a mixed reality (MR) headset, and the application displays the results. The application uses a wrapper using Apple's Vision Framework API to automatically recognize equal signs and question marks, and then transmits the read mathematical expressions to a large-scale language model to calculate the results.
Apple's Math Notes feature, but on real paper.
— Lukas Moro (@lukas_moro) August 30, 2024
Could camera access on mixed reality headsets in combination with OCR & LLMs lead to a renaissance of physical paper as a productivity tool? pic.twitter.com/z9vlx2UGN1
All of the pseudo-characters were handwritten by Morrow on a computer in advance, and Morrow said, 'Each number and letter is animated as if it were being written by a friendly ghost.'
◆Mark & Comment
'Mark & Comment' allows users to highlight text on paper and have the same text highlighted on the computer in real time.
Another prototype about interfacing artificial intelligence through paper with Quest 3. Highlight and comment text on physical paper, it synchronizes the highlights and spoken comments with a digital version automatically.
https://t.co/DsYI170AFD pic.twitter.com/LlptksaP64 — Lukas Moro (@lukas_moro) September 23, 2024
In addition, if you hold the paper with your pinky finger and speak, your voice will be automatically recorded and entered into the transcription AI Whisper. The transcribed comment will automatically be displayed above the highlight.
In addition, the Mark & Comment process converts the text into a string of character codes using an OpenCV optical character recognition pipeline, and then forwards the prompt to GPT-4o: 'If highlighted text appears, please answer with text only. Otherwise, return an empty answer.' As a result, it becomes possible to recognize only the highlighted text and digitally highlight it.
◆Draw & Dream
'Draw & Dream' is an application that converts the act of drawing on paper into another video in real time. The way it works is that the user draws on paper and automatically inputs it into the image generation AI
Real-time image diffusion as a muse while painting on physical paper. Prototype using StreamDiffusion and Quest 3. https://t.co/zbnIywxsTc pic.twitter.com/2EqJf0GXAs
— Lukas Moro (@lukas_moro) October 18, 2024
According to Morrow, he implemented an image processing pipeline on Python to feed the drawn image to StreamDiffusion. He also introduced OpenCV to detect the four sides of the paper and input the image to StreamDiffusion. In addition, by using a ' Brute-Force matcher ' that matches features between consecutive frames and homography that allows you to always see the image from a bird's-eye view, it is possible to properly input even if the paper is partially obscured by a hand, pen, or brush.
Speaking about these prototypes, Morrow said: 'These three approaches frame the relationship between humans and AI in different ways. Maths & Questions proposes integrating or 'merging' with AI to complement our own intelligence. Mark & Comment harnesses the capabilities of AI to use it as an assistant to understand the world. Draw & Dream uses generative AI not just as a tool, but to help you create your own work and provide interactions that further inspire your creativity.'
Related Posts:
in Software, Posted by log1r_ut