Woodpecker: Chinese Researchers Combat AI Hallucinations

The University of Science and Technology of China (USTC) and Tencent YouTu Lab have released a research paper on a new framework called Woodpecker, designed to correct hallucinations in multimodal large language AI models. “Hallucination is a big shadow hanging over the rapidly evolving MLLMs,” writes the group, describing the phenomenon as when MLLMs “output descriptions that are inconsistent with the input image.” Solutions to date focus mainly on “instruction-tuning,” a form of retraining that is data and computation intensive. Woodpecker takes a training-free approach that purports to correct hallucinations from the basis of the generated text. Continue reading Woodpecker: Chinese Researchers Combat AI Hallucinations

ChatGPT Goes Multimodal: OpenAI Adds Vision, Voice Ability

OpenAI began previewing vision capabilities for GPT-4 in March, and the company is now starting to roll out the image input and output to users of its popular ChatGPT. The multimodal expansion also includes audio functionality, with OpenAI proclaiming late last month that “ChatGPT can now see, hear and speak.” The upgrade vaults GPT-4 into the multimodal category with what OpenAI is apparently calling GPT-4V (for “Vision,” though equally applicable to “Voice”). “We’re rolling out voice and images in ChatGPT to Plus and Enterprise users,” OpenAI announced. Continue reading ChatGPT Goes Multimodal: OpenAI Adds Vision, Voice Ability

Yasa-1: Startup Reka Launches New AI Multimodal Assistant

Startup Reka AI is releasing in preview its first artificial intelligence assistant, Yasa-1. The multimodal AI is described as “a language assistant with visual and auditory sensors.” The year-old company says it “trained Yasa-1 from scratch,” including pretraining foundation models “from ground zero,” then aligning them and optimizing to its training and server infrastructures. “Yasa-1 is not just a text assistant, it also understands images, short videos and audio (yes, sounds too),” said Reka AI co-founder and Chief Scientist Yi Tay. Yasa-1 is available via Reka’s APIs and as docker containers for on-site or virtual private cloud deployment. Continue reading Yasa-1: Startup Reka Launches New AI Multimodal Assistant

Yahoo Spins Out Big Data Unit Vespa AI as Independent Firm

Yahoo is spinning out its Vespa platform, which leverages AI and data online at scale. The move is being positioned as an effort to make Vespa more widely available to third parties. After supporting Yahoo’s needs for 16 years, the unit in 2021 began serving external customers including Spotify, Wix and OkCupid for needs such as “searching millions of documents within a global organization, serving better data-driven online ads, or allowing AI-based language apps the ability to scale.” Yahoo says it will continue to invest in Vespa and remain its largest customer even after the split. Continue reading Yahoo Spins Out Big Data Unit Vespa AI as Independent Firm

Tubi Chooses ChatGPT to Power Content Recommendations

Fox Corporation’s Tubi TV video streaming service is rolling out a proprietary movie recommendation app called “Rabbit AI” in a beta test for iOS customers in the U.S., with other platforms to follow. Powered by OpenAI’s GPT-4, currently available only to enterprise and other paying customers, Rabbit AI provides “a new way to navigate” Tubi’s library of more than 200,000 movies and TV episodes, “providing hyper-personalized recommendations based on the contextual meaning of the terms,” the company says. A Rabbit AI plugin for ChatGPT is also now available to OpenAI subscribers, Tubi says. Continue reading Tubi Chooses ChatGPT to Power Content Recommendations

OpenAI’s ChatGPT Upgraded with ‘Talk’ Tech, Image Search

OpenAI is experimenting with new voice and image capabilities in ChatGPT. According to the company, users can now “speak with ChatGPT and have it talk back,” thanks to an intuitive new interface that, in addition to facilitating voice conversations, will allow users to show ChatGPT an image to discuss. “Snap a picture of a landmark while traveling and have a live conversation about what’s interesting about it,” OpenAI explains, alternatively suggesting you “snap pictures of your fridge and pantry to figure out what’s for dinner” or have it help with homework based on pictures of a math problem. Continue reading OpenAI’s ChatGPT Upgraded with ‘Talk’ Tech, Image Search

Spotify Uses AI to Copy Host Voices for Podcast Translations

Spotify is using AI to drive podcast language translation in what sounds like the podcaster’s own voice, which has obvious implications for film and television dubbing. Working with podcast notables including Dax Shepard, Monica Padman and Bill Simmons, Spotify used AI to mimic their voices in Spanish, French and German for several episodes. The proprietary Spotify technology uses OpenAI’s new text-to-speech voice-generation technology as well as its open-source Whisper speech recognition system, which transcribes spoken words into text. The result, Spotify says, is “more authentic” and “more personal and natural” than traditional dubbing. Continue reading Spotify Uses AI to Copy Host Voices for Podcast Translations

Amazon Plans to Invest Up to $4 Billion in AI Startup Anthropic

Amazon has entered into a strategic investment in San Francisco-based Anthropic, founded by former members of OpenAI. The AI startup will train and deploy future models using AWS Trainium and Inferentia chips to train and deploy future foundation models with AWS as its primary cloud provider. In turn, Amazon says it will invest up to $4 billion in Anthropic, as it strives to compete with other technology firms in the race to develop generative AI, seeding growth for what is shaping up to be an entirely new economic and social landscape. Continue reading Amazon Plans to Invest Up to $4 Billion in AI Startup Anthropic

Microsoft Unveils Next-Gen Surface Devices, New AI Features

During its Surface and AI event in New York City on Thursday, Microsoft introduced a pair of new Surface laptops and an array of generative AI upgrades to Bing Chat, Windows Copilot and more. Taking center stage in hardware was the company’s more powerful Surface Laptop Studio 2 and the ultra-portable Surface Laptop Go 3. Also unveiled was the Surface Go 4 for Business, the latest miniature version of its Surface Pro tablet, and the company’s large touchscreen Surface Hub, designed for office use. Beginning this month, Microsoft rolls out Copilot — “your everyday AI companion” — in a free Windows 11 update, followed by Bing, Edge, and Microsoft 365 this fall. Continue reading Microsoft Unveils Next-Gen Surface Devices, New AI Features

OpenAI’s Latest Version of DALL-E Integrates with ChatGPT

OpenAI has released the DALL-E 3 generative AI imaging platform in research preview. The latest iteration features more safety options and integrates with OpenAI’s ChatGPT, currently driven by the now seasoned large language model GPT-4. That is the ChatGPT version to which Plus subscribers and enterprise customers have access — the same who will be able to preview DALL-E 3. The free chatbot is built around GPT-3.5. OpenAI says GPT-4 makes for better contextual understanding by DALL-E, which even in version 2 evidenced some glaring comprehension glitches. Continue reading OpenAI’s Latest Version of DALL-E Integrates with ChatGPT

Google Links Bard AI to Apps Including YouTube, Docs, Drive

Google is implementing a plan to help its Bard AI become more competitive with OpenAI’s ChatGPT. Bard Extensions will allow English-language users to expand the chatbot’s knowledge repository to data from various Google apps, including Gmail, Google Docs, Google Drive, Google Maps, YouTube, and Google Flights and hotels, or even information stored “across multiple apps and services,” Google says. The update boosts search engine capabilities with the travel features, while providing some functionalities of a personal assistant by letting it identify missed emails or summarize the relevant points in a document. Continue reading Google Links Bard AI to Apps Including YouTube, Docs, Drive

UK’s Competition Office Issues Principles for Responsible AI

The UK’s Competition and Markets Authority has issued a report featuring seven proposed principles that aim to “ensure consumer protection and healthy competition are at the heart of responsible development and use of foundation models,” or FMs. Ranging from “accountability” and “diversity” to “transparency,” the principles aim to “spur innovation and growth” while implementing social safety measures amidst rapid adoption of apps including OpenAI’s ChatGPT, Microsoft 365 Copilot, Stability AI’s Stable Diffusion. The transformative properties of FMs can “have a significant impact on people, businesses, and the UK economy,” according to the CMA. Continue reading UK’s Competition Office Issues Principles for Responsible AI

Intuit’s GenOS Spawns Its First Customer AI Product: ‘Assist’

Financial software giant Intuit is adding a customer-facing AI assistant to work with individuals and small businesses. Intuit Assist is being integrated across Intuit products starting with TurboTax and expanding to QuickBooks, Credit Karma and Mailchimp. Assist will be embedded across Intuit’s products via a common user interface, allowing customers to get personalized recommendations via contextual datasets. The generative AI assistant was built using Intuit’s Generative AI Operating System, a proprietary corporate model dubbed GenOS, launched in June. Intuit is working with OpenAI to accelerate GenAI app development on GenOS. Continue reading Intuit’s GenOS Spawns Its First Customer AI Product: ‘Assist’

Demand for AI Chips Drives Nvidia to Revenue Record in Q2

Nvidia announced Q2 revenue of $13.51 billion, a 101 percent year-over-year increase that sets a new company record. The data center division — which accounts for the majority of AI chip sales — also established a new benchmark: $10.32 billion in Q2, a 171 percent leap over the prior fiscal Q2. Nvidia projects that revenue for the current quarter will hit $16 billion — about $3.5 billion above analysts’ expectations. Nvidia chips power OpenAI’s popular ChatGPT and other generative AI and cloud computing apps from companies including Amazon, Google, Meta Platforms, Microsoft and VMWare. Continue reading Demand for AI Chips Drives Nvidia to Revenue Record in Q2

AP Is Latest Org to Issue Guidelines for AI in News Reporting

After announcing a partnership with OpenAI last month, the Associated Press has issued guidelines for using generative AI in news reporting, urging caution in using artificial intelligence. The news agency has also added a new chapter in its widely used AP Stylebook pertaining to coverage of AI, a story that “goes far beyond business and technology” and is “also about politics, entertainment, education, sports, human rights, the economy, equality and inequality, international law, and many other issues,” according to AP, which says stories about AI should “show how these tools are affecting many areas of our lives.” Continue reading AP Is Latest Org to Issue Guidelines for AI in News Reporting