Microsoft’s VASA-1 Can Generate Talking Faces in Real Time

Microsoft has developed VASA, a framework for generating lifelike virtual characters with vocal capabilities including speaking and singing. The premiere model, VASA-1, can perform the feat in real time from a single static image and a vocalization clip. The research demo showcases realistic audio-enhanced faces that can be fine-tuned to look in different directions or change expression in video clips of up to one minute at 512 x 512 pixels and up to 40fps “with negligible starting latency,” according to Microsoft, which says “it paves the way for real-time engagements with lifelike avatars that emulate human conversational behaviors.” Continue reading Microsoft’s VASA-1 Can Generate Talking Faces in Real Time

Nvidia Unveils New Tools for AI, the Metaverse at SIGGRAPH

Nvidia founder and CEO Jensen Huang shared his vision for a computer graphics industry transformed by AI, the metaverse and digital humans. “The combination of AI and computer graphics will power the metaverse, the next evolution of the Internet,” Huang told attendees at SIGGRAPH 2022 in Vancouver. To support this transformation, Nvidia unveiled the Avatar Cloud Engine (ACE) and discussed plans to build out the Universal Scene Description (USD) industry standard, which Huang called “the language of the metaverse.” New extensions for Omniverse and graphics workflow optimizations using machine learning were also part of the mix.
Continue reading Nvidia Unveils New Tools for AI, the Metaverse at SIGGRAPH