Google is reportedly close to launching an AI agent called Project Jarvis, designed to operate within web browsers to automate everyday tasks. According to a report from The Information, Jarvis will be able to respond to user commands by taking frequent screenshots of the computer screen, interpreting them, and performing actions like clicking buttons or filling out forms. This tool will primarily work with Chrome and is intended to assist with tasks such as research, shopping, and flight bookings.
This development aligns with Google’s ongoing enhancements to its Gemini AI, which is set to unveil its next-gen model in December. Recently, Gemini Live, Google’s AI chatbot, has added support for numerous languages and integrated into various applications like Google Meet and Photos.
This news follows a similar announcement from Anthropic, which has introduced a feature for its Claude AI that allows it to use standard software and tools, currently available in public beta.