By
Paula ParisiOctober 11, 2023
OpenAI began previewing vision capabilities for GPT-4 in March, and the company is now starting to roll out the image input and output to users of its popular ChatGPT. The multimodal expansion also includes audio functionality, with OpenAI proclaiming late last month that “ChatGPT can now see, hear and speak.” The upgrade vaults GPT-4 into the multimodal category with what OpenAI is apparently calling GPT-4V (for “Vision,” though equally applicable to “Voice”). “We’re rolling out voice and images in ChatGPT to Plus and Enterprise users,” OpenAI announced. Continue reading ChatGPT Goes Multimodal: OpenAI Adds Vision, Voice Ability
By
ETCentricSeptember 14, 2023
ETC@USC is hosting its 9th vETC virtual conference at this year’s IBC in Amsterdam, September 15-18. The event — which highlights significant presentations of emerging technologies and their impact on the M&E industry — will explore how generative AI, machine learning, and other compelling new tools help simplify building 3D worlds and tackle today’s computer vision challenges. The sessions will be recorded and posted on ETC’s YouTube channel. For those attending IBC who may be interested in attending the sessions (located in Hall 7 at Booth C28), visit the program guide, which includes a full schedule and speaker bios. Continue reading vETC Coming to IBC 2023: The Path of Sustainable Innovation
By
Paula ParisiAugust 28, 2023
Amazon is using artificial intelligence to change the way viewers experience “Thursday Night Football” on Amazon Prime Video this season. Now in the second year of its 10-year NFL deal, Amazon joins Disney’s ESPN in using AI to change how people experience televised sports by parsing a variety of analytics and using machine learning to interpret 2D video into 3D for a variety of viewpoints on any play. Amazon is auto-generating highlights feeds for each game, so late arrivals can catch up. September 14 marks the debut of the new AI Prime features and the games in 1080p HDR. Continue reading Amazon Integrating AI to Modernize NFL Viewing Experience
By
ETCentricAugust 7, 2023
ETC@USC will host its 8th vETC virtual conference at SIGGRAPH 2023 in Los Angeles, August 8-10. The event – which highlights significant presentations of emerging technologies and their impact on the M&E industry – will explore how generative AI, machine learning, and other compelling new tools help simplify building 3D worlds and tackle today’s computer vision challenges. Three days of sessions will be recorded and posted on ETC’s YouTube channel. For those attending SIGGRAPH who may be interested in attending the sessions (located at Z by HP Booth 215), visit the program guide, which includes a full schedule and speaker bios. Continue reading ETC Will Host Sessions at SIGGRAPH Conference This Week
By
ETCentricAugust 4, 2023
ETC@USC will host its 8th vETC virtual conference at SIGGRAPH 2023 in Los Angeles, August 8-10. The event – which highlights significant presentations of emerging technologies and their impact on the M&E industry – will explore how generative AI, machine learning, and other compelling new tools help simplify building 3D worlds and tackle today’s computer vision challenges. Three days of sessions will be recorded and posted on ETC’s YouTube channel. For those attending SIGGRAPH who may be interested in attending the sessions (located at Z by HP Booth 215), visit the program guide, which includes a full schedule and speaker bios. Continue reading ETC Will Host Sessions at SIGGRAPH Conference Next Week
By
Paula ParisiJune 23, 2023
Researchers at MIT’s Computer Science and Artificial Intelligence Laboratory (CSAIL) have introduced a computer vision system that combines image recognition and image generation technology into one training model instead of two. The result, MAGE (short for MAsked Generative Encoder) holds promise for a wide variety of use cases and is expected to reduce costs through unified training, according to the team. “To the best of our knowledge, this is the first model that achieves close to state-of-the-art results for both tasks using the same data and training paradigm,” the researchers said. Continue reading MAGE AI Unifies Generative and Recognition Image Training
By
Paula ParisiJune 15, 2023
Meta Platforms continues to make progress on a mission to develop artificial intelligence that can teach itself to learn how the world works. Chief AI Scientist Yann LeCun has taken a special interest in developing the new model, called Image Joint Embedding Predictive Architecture, or I-JEPA, which learns by building an internal representation of the outside world and analyzing image abstracts instead of comparing pixels. The approach allows AI techto learn more like humans do, with their ability to figure out complex tasks and adapt to new situations. Continue reading Meta Develops Computer Vision AI That Learns Like Humans
By
Paula ParisiMarch 31, 2023
Apple has acquired WaveOne, a Mountain View-based startup that has been developing AI algorithms for video compression. Cupertino has been mum about the purchase, but the deal reportedly closed in January, and WaveOne employees are said to have been absorbed into Apple’s machine learning groups. WaveOne’s codecs use machine learning to squeeze more picture out of less bandwidth, including optimizing for signal interruptions, so the picture doesn’t freeze or disappear, making it ideal for mobile. As Netflix and YouTube tout picture improvements, WaveOne could potentially advantage Apple TV+ and a mixed reality headset. Continue reading Apple Eyes AI Video Compression with WaveOne Acquisition
By
Paula ParisiFebruary 10, 2023
Pinterest grew Q4 year-over-year revenue by 4 percent, to $877 million, while full year sales jumped 9 percent in 2022 totaling $2.8 billion. The company said that global monthly active users also grew by 4 percent in the three month period ending December 31, to a total of 450 million. CEO Bill Ready emphasized on the earnings call the intent to eventually “make every pin shoppable.” Similar to how it is monetizing still images Pinterest is focusing on making videos “more actionable” by applying what it calls “our computer vision technology.” Continue reading Pinterest Grows Its Active Users, Focuses on Video Shopping
By
Rachel Joy VictorJanuary 10, 2023
The design of truly contextual experiences — whether for utility or entertainment — requires a knowledge of both the user and the environment they are in. This becomes especially relevant when we think of what it means to build interesting mixed reality experiences. CES this year showcased a variety of computer vision AI software tools oriented towards understanding environmental context. At Eureka Park in the Venetian, however, MantiSpectra’s chip sensor technology provided a peek into the benefits for user experience enabled by environmental intelligence arising from hardware. Continue reading CES: Encoding Environmental Intelligence with New Chip Tech
By
Paula ParisiSeptember 30, 2022
Google is the latest tech giant to be swayed by the influence of TikTok and Instagram as it reimagines a more visual, discovery-centric type of search. That was major media’s takeaway from the third annual Google Search On event, which continued the trend of trying to find more intuitive ways to search, namely visually and vocally, by snapping a photo or asking your phone a question. Thanks to advances in artificial intelligence, the Alphabet company says it is “going far beyond the search box to create search experiences that work more like our minds.” Continue reading Google Search Reinvention Focuses on Visuals and Discovery
By
Paula ParisiOctober 1, 2021
During its streamed media event this week, Amazon introduced new devices including a wheeled robot named Astro and a sale-by-invitation-only Ring autonomous security drone for the home. While the unusual products added sizzle, the focus was largely on basics like its first smart thermostat, updates to the Echo speaker line and Ring security products. Several of the new products appear to target market share of products already on offer, including through Amazon, and many emphasize synergy among Amazon’s hardware brands. The company’s fee-based premium services were also emphasized. Continue reading New Amazon Devices Include Home Robot, Smart Thermostat
By
Debra KaufmanMay 27, 2021
IBM’s AI research unit debuted Project CodeNet, a dataset to develop machine learning models for software programming. The name is a take-off on ImageNet, the influential dataset of photos that pushed the development of computer vision and deep learning. Creating “AI for code” systems has been challenging since software developers are constantly discovering new problems and exploring different solutions. IBM researchers have taken that into consideration in developing a multi-purpose dataset for Project CodeNet. Continue reading IBM Project CodeNet Employs AI Tools to Program Software
By
Debra KaufmanMay 13, 2021
During its Think conference this week, IBM debuted Project CodeNet, an open-source dataset for benchmarking around AI for code. Project CodeNet consists of 14 million code examples, which makes it about 10 times larger than the most similar dataset, which has 52,000 examples. Project CodeNet also offers 500 million lines of code and 55 programming languages including C++, Java, Python, Go, COBOL, Pascal and Fortran, making it a Rosetta Stone for AI systems to automatically translate code into other programming languages. Continue reading IBM CodeNet Enables AI Translation of Computer Languages
By
Debra KaufmanApril 12, 2021
Facebook released an open-source AI data set of 45,186 videos featuring 3,011 U.S. actors who were paid to participate. The data set is dubbed Casual Conversations because the diverse group was recorded giving unscripted answers to questions about age and gender. Skin tone and lighting conditions were also annotated by humans. Biases have been a problem in AI-enabled technologies such as facial recognition. Facebook is encouraging teams to use the new data set. Most AI data sets comprise people unaware they are being recorded. Continue reading Facebook Counters AI Bias with a Data Set Featuring Actors