OpenAI Pushes Conversational Agents with Three New Models

OpenAI has debuted three new models for transcription and voice generation — gpt-4o-transcribe, gpt-4o-mini-transcribe and gpt-4o-mini-tts. The text-to-speech and speech-to-text AI models are designed to help developers create AI agents with highly customizable voices. OpenAI claims these models will power natural and responsive voice agents, moving AI out of the text-based communications stage and into intuitive spoken conversations. The suite outperforms existing solutions in accuracy and reliability, OpenAI says, especially with “accents, noisy environments, and varying speech speeds,” making them well-suited for customer call centers and meeting notes. Continue reading OpenAI Pushes Conversational Agents with Three New Models

With Hotshot Purchase, xAI to Bring Generative Video to Grok

Elon Musk’s xAI has acquired generative video startup Hotshot to bring motion imaging to Grok 3. Released in February, Grok 3 adds Deep Search and Thinking and improved on its predecessor’s still imaging capabilities, but lacks generative video, a much-requested feature — one that could make Grok a freestanding competitor to OpenAI’s individual offerings: ChatGPT for text, Sora for video, and DALL-E for images. “Cool AI video coming soon!” was Musk’s comment to Hotshot’s acquisition announcement on the networking platform. Hotshot can generate clips of up to 10-seconds at 1280×720 pixels. Continue reading With Hotshot Purchase, xAI to Bring Generative Video to Grok

Baidu Releases New LLMs that Undercut Competition’s Price

Baidu has launched two new AI systems, the native multimodal foundation model Ernie 4.5 and deep-thinking reasoning model Ernie X1. The latter supports features like generative imaging, advanced search and webpage content comprehension. Baidu is touting Ernie X1 as of comparable performance to another Chinese model, DeepSeek-R1, but says it is half the price. Both Baidu models are available to the public, including individual users, through the Ernie website. Baidu, the dominant search engine in China, says its new models mark a milestone in both reasoning and multimodal AI, “offering advanced capabilities at a more accessible price point.” Continue reading Baidu Releases New LLMs that Undercut Competition’s Price

OpenAI and Google Press for Relief on Copyright, State Laws

OpenAI is urging the Trump Administration to declare AI training fair use, seeking unfettered access to copyrighted material for the purpose of educating models. The company is also asking for relief from state AI rules and more permissive AI export rules in a response to President Trump’s call for a U.S. “AI Action Plan.” The deadline to submit responses to the National Science Foundation and Office of Science & Technology Policy (OSTP) request for information (RFI) regarding the plan was Saturday. Google also publicized its response, which largely echoed OpenAI’s points. Continue reading OpenAI and Google Press for Relief on Copyright, State Laws

Snap Launches Generative AI Video Lenses for Platinum Subs

Snapchat has introduced AI Video Lenses for those paying $16 per month for its Platinum tier. Powered by Snap’s custom-built generative video model, the initial three releases are a fox that perches on your shoulder, rambunctious racoons and a large bouquet of flowers with a zoom out effect. After selecting an AI Video Lens and applying it to a Snap, the AI video generates in the background, auto-saving save to Memories while users are free to continue messaging and Snapping on the app. The resulting video can be shared with friends or to Stories and Spotlight. Continue reading Snap Launches Generative AI Video Lenses for Platinum Subs

OpenAI Ramps Up Its Agent Functions as Competition Surges

Feeling the pressure from the “open agent” movement and specifically Chinese startup Butterfly Effect and its new product Manus, OpenAI has expanded the capabilities of its own AI technology, launching new tools to help businesses and developers build their own agents. The company’s new Responses API has the functionality of two earlier tools, the Chat Completions API (facilitating ChatGPT queries and responses) and the Assistants API (for multi-step reasoning and file access). The company is also issuing an Agents SDK, a suite of tools for creating and deploying agents that bundles the Responses API. Continue reading OpenAI Ramps Up Its Agent Functions as Competition Surges

Startup Claims AI Agent Manus Is an Autonomy Breakthrough

Butterfly Effect is the latest Chinese AI firm to get global attention, having drummed up interest in Manus, positioned as a “general agent” that can scour online resources to produce reports. Companies like OpenAI and Google are competing in this space, called deep research. Butterfly Effect says Manus has surpassed OpenAI Deep Research on the GAIA benchmark and the world is listening. The Manus Discord server swelled to more than 138,000 members in the past weeks, and “invite codes” to gain access at this “invitation-only” phase are allegedly going for thousands of dollars on Chinese sales app Xianyu. Continue reading Startup Claims AI Agent Manus Is an Autonomy Breakthrough

Altman’s World Takes on Musk’s X in Race to Everything App

Rivalry between World Network, also known as OpenAI CEO Sam Altman’s “other company,” and Elon Musk’s X is heating up in an escalating race to be first out with an “everything app.” World Network is trying to accelerate adoption of a log-in system that relies on “ocular verification” — mapping the unique pattern of the iris — for “anonymous proof-of-human” validation. World already has a free app for iOS and Android, and recently launched a “mini app store” within it, including functions such as chat, transferring cryptocurrency and shopping for microloans. Continue reading Altman’s World Takes on Musk’s X in Race to Everything App

Amazon Plans an AI Push with Nova Reasoning Model, Agents

Amazon is ramping up its AI activity, reportedly planning to release its own advanced reasoning model as part of the company’s Nova family. The Nova line was introduced in December at re:Invent and the new addition could debut as early as June. Its reasoning prowess is being compared to the abilities of OpenAI’s o3-mini and DeepSeek-R1. But reports say Amazon is taking the hybrid reasoning approach embraced by Anthropic’s Claude 3.7 Sonnet (Amazon has a 10 percent stake in Anthropic). The e-retail giant is also preparing for an agentic AI push, having established a dedicated unit, reports say. Continue reading Amazon Plans an AI Push with Nova Reasoning Model, Agents

Meta Plans Its Own Standalone AI App to Take On ChatGPT

A standalone Meta AI app is in the works for Q2, according to sources familiar with the company’s plans. The move is aligned with Meta Platforms CEO Mark Zuckerberg’s stated intent to propel his company to the forefront of artificial intelligence by year’s end, vaulting ahead of competitors such as OpenAI, Alphabet, Anthropic and xAI. “This is going to be the year when a highly intelligent and personalized AI assistant reaches more than 1 billion people, and I expect Meta AI to be that leading AI assistant,” Zuckerberg said in January during a Q4 earnings call with analysts. Continue reading Meta Plans Its Own Standalone AI App to Take On ChatGPT

OpenAI’s GPT-4.5 Model Sees Patterns and Thinks Creatively

OpenAI is releasing a research preview of what it calls its “largest and best” chat model to date, GPT‑4.5, which scales unsupervised learning in pre-training and post-training. As a result, the new chat model has the ability to recognize patterns, draw connections, and generate creative insights without having to draw on time and energy consuming “reasoning.” GPT‑4.5 is currently available to ChatGPT Pro subscribers ($200 per month) and developers subscribing to OpenAI’s API tier. ChatGPT Plus and ChatGPT Team customers are expected to gain access this week. Continue reading OpenAI’s GPT-4.5 Model Sees Patterns and Thinks Creatively

Amazon’s AI-Powered Alexa+ is Agentic with Computer Vision

Over a year after teasing a next-gen Alexa virtual assistant, Amazon is releasing an AI-powered version called Alexa+. The new personal assistant can do things like order groceries for the household, facilitate event planning, manage smart home utilities and security, and, of course, shop online. “She’s smarter, more conversational, more capable,” according to Amazon SVP of Devices & Services Panos Panay. Strategically priced to entice the AI-curious into Amazon membership, Alexa+ costs $20 per month as a standalone service or comes free with Amazon Prime ($15 per month or $139 per year). Continue reading Amazon’s AI-Powered Alexa+ is Agentic with Computer Vision

Anthropic Introduces a New Claude Hybrid Reasoning Model

Anthropic has released a new frontier model, Claude 3.7 Sonnet, described as the industry’s first “hybrid AI reasoning model.” The new Claude is different in that it can both respond to questions in real time or, alternatively, “think” about a problem for a prolonged period of time — basically as long as a user would like. Users can choose between “near-instant responses or extended, step-by-step thinking that is made visible to the user” by selecting the appropriate “reasoning” capability for Claude, Anthropic says. Along with the new model, Anthropic is also debuting a command line tool for agentic coding, Claude Code. Continue reading Anthropic Introduces a New Claude Hybrid Reasoning Model

Perplexity Deep Research Productivity Tool Offers a Free Tier

“Deep research” is emerging as a model trend, with Perplexity’s Deep Research launching less than three weeks after OpenAI unveiled its own ChatGPT deep research agent, which followed Google’s similar Gemini feature. As its name implies, deep research is a productivity tool, designed to save time by having an AI agent scour materials, compiling data and analysis. Perplexity’s Deep Research “performs dozens of searches, reads hundreds of sources, and reasons through the material to autonomously deliver a comprehensive report,” across topics ranging “from finance and marketing to product research,” the company says. Continue reading Perplexity Deep Research Productivity Tool Offers a Free Tier

xAI Launches Grok 3 as Standalone and for X Premium+ Subs

Elon Musk’s xAI has released its latest AI model Grok 3, which the company is describing as the “smartest AI on Earth.” It includes reasoning capabilities and a new web analysis tool called DeepSearch that returns results “within seconds” and can refine specific sources, according to xAI. Grok 3 was trained with 200,000 Nvidia GPUs, resulting in improved response times and processing power. Future capabilities will include Voice Mode for conversational interaction and audio-to-text conversion. Access to Grok 3 is limited to X Premium+ subscribers or via a SuperGrok plan (that does not include X social features). Continue reading xAI Launches Grok 3 as Standalone and for X Premium+ Subs