SAN JOSE, Calif.--(BUSINESS WIRE)--SoundHound Inc.®, the leading innovator in voice enabled AI and conversational intelligence technologies, today unveiled its large vocabulary, hybrid voice and natural language understanding interface for in-vehicle infotainment systems at the NVIDIA GPU Technology Conference (GTC) 2019. The event marks the first time the technology has been shown to the public, and highlights the NVIDIA DRIVE™ ecosystem collaboration between SoundHound Inc. and NVIDIA.
Leveraging the patented Speech-to-Meaning™ and Deep Meaning Understanding™ technologies from SoundHound Inc.’s Houndify Voice AI platform, running on NVIDIA DRIVE IX™, the solution enables real-time responses to voice queries in vehicles, even without Internet connectivity. This is achieved with high speed and accuracy through a hybrid speech recognition system that processes voice requests both in the cloud and locally on the embedded system (for when an internet connection is not available) to return fast responses. The embedded system also enables drivers to control their car’s functions when a connection to the cloud is unavailable including the car’s climate control, window controls, radio, navigation, and more.
NVIDIA DRIVE AGX integrates the high-performance, energy-efficient compute of the NVIDIA Xavier™ system-on-a-chip (SoC) and full stack AV software to monitor surroundings and the driver, localize to an HD map, and plan a safe path forward. Within DRIVE software, NVIDIA DRIVE IX is a framework for the full cockpit experience. It combines the system, tools, and algorithms to enhance the driver’s situational awareness, assist in driving functions and provide intelligent interactions between the vehicle and its occupants. This is the ideal platform for integrating the voice technology that Houndify can provide, enabling the vehicle to seamlessly respond to human voice commands.
“The NVIDIA DRIVE platform has enabled us to create an embedded solution for interacting with cars using voice and natural language,” said Keyvan Mohajer, Founder and CEO, SoundHound Inc. “By using NVIDIA GPUs for deep learning training, and the DRIVE IX platform for embedded computation using the GPU inside the Xavier SoC, we are able to scale to large vocabulary in natural language with the Houndify platform, maintaining speed and accuracy, even without a cloud connection.”
“Low-latency speech recognition is an important aspect of intelligent experiences in the vehicle,” said Danny Shapiro, senior director of automotive at NVIDIA. “SoundHound’s innovative solution on our open DRIVE IX platform will allow carmakers to offer systems that have an enormous vocabulary, understand a wide range of topics, and respond conversationally.”
With Houndify, drivers can now interact with hundreds of domains—programs that provide users with relevant information or actions related to their queries. These include: navigation, weather, stock prices, sports scores, flight status, local business searches, and hotel searches with complex criteria, among others.
SoundHound Inc. had been in stealth mode with its voice technology and natural language understanding research and development for a decade. The company quietly built the technology stack needed to create a complete solution that enables companies to deploy a customized, branded voice experience. The Houndify platform enables developers to use that proprietary technology in their own products. SoundHound Inc. has two consumer apps powered by Houndify: Hound, the voice assistant app, and SoundHound, the popular music discovery and lyrics app, with over 310 million unique downloads globally.
Developers interested in exploring the Houndify platform can visit www.houndify.com to sign up for a free trial and learn more. Additional information on NVIDIA Automotive solutions can be found by visiting http://www.nvidia.com/drive
About SoundHound Inc.:
SoundHound Inc. turns sound into understanding and actionable meaning. We believe in enabling humans to interact with the things around them in the same way we interact with each other: by speaking naturally to mobile phones, cars, TVs, music speakers, coffee machines, and every other part of the emerging ‘connected’ world. Our consumer product, Hound, leverages our Speech-to-Meaning technology to showcase a groundbreaking smartphone experience, and is the first product to leverage and showcase the Houndify platform. Our SoundHound product applies our technology to music, enabling people to discover, explore, and share the music around them, and even find the name of that song stuck in their heads by singing or humming. And through the Houndify platform, we empower developers to be part of the Speech-to-Meaning revolution. Mission: Houndify everything.