Moshi AI Chatbot With Real-Time Voice Features Launched by Kyutai Labs as GPT-4o Rival

Technology


Kyutai Labs on Wednesday launched Moshi AI, an artificial intelligence (AI) chatbot that responds verbally in real time. The French AI firm has announced that Moshi's entire audio language model was developed in-house. He can also modulate his voice to express emotions and respond in various speech styles. The AI ​​model can be accessed by the public for free. The AI ​​model currently restricts conversations to five minutes. Interestingly, OpenAI also announced similar speech features with the release of GPT-4o, but it hasn't been released yet.

Features of Moshi AI

The company claims the AI ​​model was developed in six months by a team of eight people. When they presented the AI ​​model at an event in Paris, Kyutai Labs said that Moshi is not an AI assistant but a prototype that can be used to develop tools for different use cases. He has also made the chatbot publicly available here. Users can enter their email and join the queue, but Gadgets 360 staff members were able to get immediate access to the platform without any waiting time.

The interface of the platform is quite minimalistic. There is a simplified AI design where users can check the loudness of their voice when speaking. There is a text box where only the AI ​​responses appear. Another box near the top shows technical details like audio duration, latency, and dropped audio.

At the top, there is a button to disconnect the call. Currently, the maximum call duration can be five minutes. The description page highlights that Moshi can think, talk and listen at the same time to maximize the flow of conversation.

Gadgets 360 found that latency is extremely low and the AI ​​often responds instantly. However, there are some cases where the response time delay can exceed 10-15 seconds. But this may be due to heavy server load. However, sometimes the verbal cues did not register at all, even after three-quarters of the volume meter was filled.

Moshi AI interface
Photo credit: Kyutai Labs

Gadgets 360 also found that the AI ​​model can respond with an emotive voice and can speak in different styles and using various voice modulations. The AI ​​model is also connected to the Internet and can get answers to queries that require searching the web. Notably, the chatbot does not allow text messages and voice is the only means of interacting with it.

Kyutai Labs has stated that the AI ​​model will be open source. However, the AI ​​company still needs to host the weights and model code in a portal. Once available, users will be able to download and install it locally, and can be run on an offline device.

For the latest tech news and reviews, follow Gadgets 360 on X, Facebook, WhatsApp, Threads and Google News. For the latest videos on gadgets and technology, subscribe to our YouTube channel. If you want to know all about the best influencers, follow our in-house Who'sThat360 on Instagram and YouTube.

Lava Blaze X 5G price range leaked ahead of India launch; Proposed to include MediaTek Dimensity 7050 SoC





Source

Leave a Reply

Your email address will not be published. Required fields are marked *