Google Teases Computer Vision, Conversational Capabilities of Gemini AI Ahead of Google I/O Event

Technology



Google shared a video on its social media platforms on Monday, showcasing the new capabilities of its Gemini chatbot powered by artificial intelligence (AI). The video was released just a day before the company's annual developer-focused Google I/O event. It is believed that the tech giant could make several announcements around AI and reveal new features and possibly new AI models. In addition to this, Android 15 and Wear OS 5 are likely to take center stage, which could be unveiled during the event.

In a short video posted on X (formerly known as Twitter), Google's official account showed off the new capabilities of its in-house AI chatbot. The 50-second video highlighted notable improvements in his speech, giving Gemini a more emotive voice and modulations that give him a more human appearance. Additionally, the video highlighted new computer vision capabilities. The AI ​​could pick up the images on the screen and analyze them.

Gemini could also access the smartphone's camera, a capability it doesn't currently have. The user was moving the camera around the space and asked the AI ​​to describe what it saw. With almost no lag, the chatbot could describe the setup as a stage, and when prompted, could even recognize the Google I/O logo and share information around it.

The video didn't share more details about the AI, instead asking people to watch the event to learn more. There are a few questions that could be answered during the event, such as whether Google is using a new Large Language Model (LLM) for computer vision or whether it is an updated version of Gemini 1.5 Pro. Additionally, Google may also reveal what else AI can do with its computer vision. In particular, there are rumors that the tech giant could introduce Gems, which are considered chatbot agents that can be designed for particular tasks, similar to OpenAI's GPTs.

While Google's event is expected to introduce new features to Gemini, OpenAI held its Spring Update event on Monday and unveiled its latest AI model GPT-4o which added features to ChatGPT, similar to the video shared by Google. The new AI model allows you to have conversational speech, computer vision, real-time language translation and more.



Source

Leave a Reply

Your email address will not be published. Required fields are marked *