Meta Says Its AI-Compressed Audio Codec Beats MP3 by 10x

Meta Platforms says its vision for the metaverse will rely heavily on compression technology “to deliver high-quality, uninterrupted experiences for everyone.” With that in mind, it’s trained its Fundamental AI Research (FAIR) lab on developing “hypercompression” solutions. First up is EnCodec, an audio technology it says compresses at 64 kbps, with no loss in quality, and at 10 times the efficiency of MP3. The EnCodec protocol has the potential to  greatly improve the sound and reliability of speech over low-bandwidth (like when your mobile phone is only getting one bar). It also works for music. Continue reading Meta Says Its AI-Compressed Audio Codec Beats MP3 by 10x

Google Stadia Adds Party Stream and Resume Live Features

Google is introducing Stadia improvements including Party Stream, which lets players invite up to nine others to participate in a game session directly through the Stadia app, eliminating the need for a third-party intermediary. Friends can be invited to play along or just watch in any combination, limited to a total of 10. Stadia’s Party Stream chat makes voice and emoji reactions available. Party Stream is available beginning this week to desktop users and through the mobile web on Android. Also new, ”resume live stream” lets players switch Stadia games without having to end a live stream. Continue reading Google Stadia Adds Party Stream and Resume Live Features

Google and Amazon Use AI to Improve Speech Recognition

Google’s artificial intelligence researchers made an unexpected discovery with its new SpecAugment data augmentation model for automatic speech recognition. Rather than augmenting input audio waveforms, SpecAugment applies augmentation directly to the audio spectrogram. Researchers discovered, to their surprise, that models trained with SpecAugment out-performed all other speech recognition methods, even without a language model. Amazon also revealed research on improving Alexa’s speech recognition by 15 percent. Continue reading Google and Amazon Use AI to Improve Speech Recognition

Facebook Introduces Open-Source Image Processing Library

Facebook unveiled Spectrum, an open-source image processing library to help improve the quality and reliability of images uploaded through its own apps. Spectrum, which Facebook first showed publicly and launched in beta in November, is now on GitHub, available to the developer community. As higher quality cameras on smartphones have become a key selling point, consumers are dealing with larger image files, which can be a stumbling block since they eat up more device memory and more network bandwidth. Continue reading Facebook Introduces Open-Source Image Processing Library