Companies Turn to AI for New Approaches to Audio Solutions

To understand speech visually, by reading lips, in addition to aurally, is an advantage for which AI has been waiting, according to researchers at Meta Platforms (formerly Facebook). The company says it has developed a framework that learns by watching — Audio-Visual Hidden Unit BERT (AV-HuBERT) — and that it is 75 percent more accurate than competing automated speech recognition systems on several metrics. Meta claims that AV-HuBERT outperforms the former best audiovisual speech recognition system with only one-tenth the inuput, which makes it potentially useful with languages with little or no audio data. Continue reading Companies Turn to AI for New Approaches to Audio Solutions