Microsoft Unveils AI Model That Comprehends Image Content

Microsoft researchers have unveiled Kosmos-1, a new AI model the company says analyzes images for content, performs visual text recognition, solves visual puzzles and passes visual IQ tests. It also understands natural language instructions. The new model is what’s known as multimodal AI, which means it uses different instruction sets, from text to audio and video. Mixing media is a key step in building artificial general intelligence (AGI) that can perform tasks in a manner approximating human performance. Examples from a Kosmos-1 research paper show it can effectively analyze images, answering questions about them. Continue reading Microsoft Unveils AI Model That Comprehends Image Content

SMPTE Tech Summit: Understanding the Human Vision System

The first Saturday morning session of SMPTE’s Technology Summit On Cinema at NAB focused on factors that could impact the UHD TV rollout, including research on what humans are able to see and observe. During a panel titled “Understanding the Human Vision System,” Dr. Jenny Read of Newcastle University Institute of Neuroscience set the stage by discussing four parameters of vision: spatial resolution, temporal resolution, dynamic range, and color perception. Related studies from Dolby, EBU and EPFL were presented. Continue reading SMPTE Tech Summit: Understanding the Human Vision System