Google has reportedly secured some of the top talent behind the AI voice startup Hume AI, marking another significant move in the competitive world of artificial intelligence. According to Wired, Google DeepMind has reached a new licensing agreement that will bring Hume AI’s CEO, Alan Cowen, along with several of its leading engineers, into the fold to work on enhancing Gemini’s voice capabilities.
While Hume AI will continue to license its technology to other AI companies, the exact financial details of this deal have not been disclosed. TechCrunch has reached out to both Google and Hume AI for confirmation, but the sources close to the matter suggest that the move is strategic, aiming to strengthen Google’s AI voice offerings in a rapidly growing market.
Around seven engineers from Hume AI will now collaborate with DeepMind, following a growing trend among tech giants: acquiring startup teams rather than the company itself. This approach, often referred to as an “acqui-hire,” enables leading AI firms to bring on specialized talent while avoiding potential regulatory scrutiny associated with larger acquisitions. Last year, Google acquired the CEO and other key researchers from AI coding startup Windsurf, and OpenAI has similarly integrated several startup teams in recent months, including those from Convogo and Roi. The Federal Trade Commission has indicated that it will be closely monitoring such deals to ensure competitive practices in the AI sector.
This latest move also underscores how voice technology is emerging as the next frontier in AI. Hume AI has carved a niche for itself with its unique models capable of detecting a user’s emotions and mood purely through their voice. This emotional intelligence in AI was brought to life in 2024 with the launch of Hume AI’s Empathetic Voice Interface, a conversational AI designed to engage users in a more human-like and emotionally aware manner. According to PitchBook, Hume AI has raised close to $80 million in funding to date and is projected to generate around $100 million in revenue this year, reflecting both the market demand and the value of its technology.
Voice-focused AI is not limited to Hume AI. Google has been steadily improving its Gemini Live feature, which enables users to have interactive conversations with its chatbot. Last month, Google introduced a new native audio model for the Live API, improving the AI’s ability to handle complex workflows and respond naturally in real-time interactions. This ongoing development shows Google’s commitment to building AI systems that feel more conversational, intuitive, and human-like, moving beyond simple text-based interactions.
Other major players in the industry are also making significant investments in voice technology. OpenAI, for instance, is reportedly revamping its audio models in preparation for an audio-first personal device, developed in collaboration with designer Jonny Ive’s firm, io. Leaks suggest the device could resemble earbuds and focus heavily on voice interaction, signaling a broader trend toward AI-powered wearable technology. Similarly, Meta has accelerated its AI audio efforts by acquiring Play AI last year. The social media giant has integrated voice and audio capabilities into its Ray-Ban smart glasses, allowing hands-free control for calls, texts, music, and photos, as well as assisting users in hearing conversations more clearly in noisy environments.
Investors and industry experts have highlighted the growing importance of voice as an input mode, particularly for wearables. “Voice is the only acceptable input mode for wearables,” investor Vanessa Larco told TechCrunch. “This acquisition will only accelerate the need for voice apps.” This perspective underscores a broader industry trend: as AI and wearables converge, natural, human-like voice interaction is becoming a critical feature for both devices and applications.
The demand for AI-powered voice solutions continues to rise rapidly. Earlier this month, ElevenLabs, a leading AI voice generation startup, reported surpassing $330 million in annual recurring revenue, reflecting the explosive growth and market appetite for advanced voice technology. Hume AI’s acquisition by Google DeepMind places the company in a prime position to influence how AI voices evolve in the next few years, potentially shaping the way humans interact with technology on a daily basis.
In addition to improving conversational AI, these developments also highlight the strategic importance of acquiring talent in a competitive industry. By bringing in Hume AI’s team, Google is not only gaining cutting-edge technology but also a group of experts with deep knowledge of emotional intelligence in AI. Such expertise is invaluable as tech companies race to create AI that can understand not just words, but the subtle cues and emotions behind them.
In conclusion, Google’s move to acquire the Hume AI team signals the increasing focus on voice technology as a key growth area for AI. With emotional intelligence, conversational abilities, and complex workflow management becoming critical features, voice-focused AI is set to become a defining element in the next generation of human-computer interaction. As more companies invest in voice capabilities, users can expect more natural, empathetic, and intuitive AI experiences across a range of devices and applications.
