CES: Nvidia Avatar Cloud Engine Uses AI for Digital Characters

As part of what it calls “production microservices,” Nvidia is adding an Avatar Cloud Engine (ACE) that lets game developers, as well as those who make tools and middleware, to integrate generative AI models into the digital avatars created for games and applications. The new ACE microservices “let developers build interactive avatars using AI models such as Nvidia Omniverse Audio2Face (A2F), which creates expressive facial animations from audio sources, and Nvidia Riva automatic speech recognition (ASR), for building customizable multilingual speech and translation applications using generative AI,” Nvidia says. Continue reading CES: Nvidia Avatar Cloud Engine Uses AI for Digital Characters

OpenAI Rolls Out Open-Source Speech Recognition System

OpenAI has released a new open source AI speech recognition model called Whisper that can recognize and translate audio at levels it says compare in accuracy and robustness to human abilities. Case uses include transcription of speeches, interviews, podcasts and conversations. “Moreover, it enables transcription in multiple languages, as well as translation from those languages into English,” says OpenAI, which is open-sourcing models and inference code on GitHub “to serve as a foundation for building useful applications and for further research on robust speech processing.” Continue reading OpenAI Rolls Out Open-Source Speech Recognition System