OpenAI Adds Codex Software Agent to Some ChatGPT Plans

OpenAI is releasing its Codex agentic coding tool in research preview. Codex lets developers delegate simple, routine programming tasks to software engineering agents that can generate production-ready code, documenting the work as they go. Codex can work on many tasks in parallel doing things like writing software features, answering questions about a codebase, fixing bugs, and proposing pull requests for review. According to OpenAI, “each task runs in its own cloud sandbox environment,” preloaded within the user’s repository. OpenAI began releasing Codex last week to ChatGPT Pro, Enterprise, and Team users, with support for Plus and Edu coming soon. Continue reading OpenAI Adds Codex Software Agent to Some ChatGPT Plans

Google Simplify App Makes Tough Text Easier to Understand

Google is adding a “Simplify” feature for iOS users that uses AI to translate complex or technical text into language that aims to be easy to understand. Simplify leverages what Google calls “a novel prompt refinement approach developed by Google Research,” drawing on the company’s proprietary AI, Gemini, to make complicated writing “digestible — without losing key details.” Google’s research indicates people find Simplify’s plainspeak “significantly more helpful than the original complex text” and improved retention. “Simplify uses AI to make dense text on the web easier to understand — without leaving a web page,” Google explains. Continue reading Google Simplify App Makes Tough Text Easier to Understand

Google Launches Initiative for Positive Film, TV Views on Tech

Google has quietly launched a film and television production initiative called “100 Zeroes” to fund projects (initially from respected indie studios) that are positive about tech and could help promote a positive take on Google’s own products and services. Google is teaming with talent management and production company Range Media Partners on the initiative. While product placement is expected to be one element (for example: a movie character uses an Android device rather than an iPhone), Google is reportedly more focused on a broader plan to promote a general positive view on technology, especially to younger demographics such as Gen Z. Continue reading Google Launches Initiative for Positive Film, TV Views on Tech

Freepik Introduces a Responsibly Trained AI Image Generator

Online graphic design platform Freepik, has unveiled F Lite, a text-to-image generator that the company says was trained only on licensed content, making it safe for commercial use. The 10 billion-parameter F Lite — currently available in two openly-licensed versions — was developed in partnership with Fal.ai, a San Francisco-based AI startup that uses a proprietary inference engine and APIs to enable fast training, inference, and scaling of image, video, audio, and multimodal AI models. Freepik Head of AI Iván de Prado describes F Lite as “a significant milestone in open, responsible AI.” Continue reading Freepik Introduces a Responsibly Trained AI Image Generator

OpenAI Improves ChatGPT for Shopping with Built-In Pricing

OpenAI is expanding ChatGPT’s shopping capabilities, adding product recommendations to help users discover products and brands. The chatbot’s results for shopping queries will now automatically include things like prices, images and ratings, much like searches using Amazon or Google Shopping. The company says that products it features in shopping search results “are chosen independently and are not ads.” With the company under pressure to turn a profit, a challenge for many AI startups, that could of course change. The company is reportedly already working with partners to ensure pricing is up to date. Continue reading OpenAI Improves ChatGPT for Shopping with Built-In Pricing

OpenAI Introduces New Models That Can Reason with Images

OpenAI has released two new AI models that use images as part of their reasoning process, “thinking with images.” OpenAI o3 and o4-mini “are the smartest models we’ve released to date, representing a step change in ChatGPT’s capabilities for everyone from curious users to advanced researchers,” the company says. The new entries in the “o” series also have agentic capabilities and can independently “use and combine every tool within ChatGPT, including searching the web, analyzing uploaded files and other data with Python, reasoning deeply about visual inputs, and even generating images.” Continue reading OpenAI Introduces New Models That Can Reason with Images

Anthropic Adds Deep Research, Google Integration to Claude

Anthropic has upgraded its AI assistant Claude, adding Research, an autonomous capability that integrates with Google Workspace. Claude can now search and reference content in Google Docs as well as communications in Gmail and events in Calendar. “With Research, Claude can search across both your internal work context and the web to help you make decisions and take action faster than before,” Anthropic explains, turning the model into a “true virtual collaborator” for enterprise clients. The expansion puts Anthropic into more direct competition with OpenAI and Microsoft as well as Google with Gemini in the AI productivity space. Continue reading Anthropic Adds Deep Research, Google Integration to Claude

OpenAI Reportedly Has Prototype for Its Own Social Network

OpenAI is working to build a social network that will compete against Elon Musk’s X and Meta’s Instagram, reports say. Though still in the early stages, the project is revolving around an internal prototype that is said to involve a social feed that leverages ChatGPT’s image generator. It’s unclear if an OpenAI social app would be standalone or integrated with ChatGPT, but either way it would most likely heighten the competition between rivals Musk and OpenAI CEO Sam Altman, who recently fended off an unsolicited offer by Musk to purchase his company for $97.4 billion. Continue reading OpenAI Reportedly Has Prototype for Its Own Social Network

OpenAI’s Affordable GPT-4.1 Models Place Focus on Coding

OpenAI has launched a new series of multimodal models dubbed GPT-4.1 that represent what the company says is a leap in small model performance, including longer context windows and improvements in coding and instruction following. Geared to developers and available exclusively via API (not through ChatGPT), the 4.1 series comes in three variations: in addition to the flagship GPT‑4.1, GPT‑4.1 mini and GPT‑4.1 nano, OpenAI’s first nano model. Unlike Web-connected models (which have “retrieval-augmented generation,” or RAG) and can access up-to-date information, they are static knowledge models. Continue reading OpenAI’s Affordable GPT-4.1 Models Place Focus on Coding

OpenAI Closes the Largest Private Tech Funding Round Ever

OpenAI has closed a $40 billion funding round, a record for a private tech firm. The infusion gives the nine-year-old San Francisco startup a $300 billion valuation making it the second most richly apprised private firm in the world, second only to SpaceX at $350 billion and tied with ByteDance, according to CNBC. The round was led by SoftBank Group contributing $30 billion, which likely gives the Japanese holding company the second largest stake, after Microsoft, which is said to have received a commitment for 49 percent of any profits in exchange for nearly $14 billion. Continue reading OpenAI Closes the Largest Private Tech Funding Round Ever

Runway Gen-4 Tackles AI’s Elusive Video Scene Consistency

Runway has introduced a new video generation model, launching a next phase of competition that could transform film production. Notably, its Gen-4 system improves the consistency of characters, locations and objects across multiple scenes, an elusive prospect for most AI video generators. The New York-based startup calls its new development “a step towards Universal Generative Models that understand the world.” The key, Runway says, is to provide a single reference image of the character, item or environment as part of the model’s project material. Runway Gen-4 can generate 5- and 10-second clips at 720p resolution. Continue reading Runway Gen-4 Tackles AI’s Elusive Video Scene Consistency

Amazon’s Nova Model Series Includes Nova Act for AI Agents

Amazon is formally rolling out its new Nova family of foundation models. Teased at the re:Invent conference hosted by AWS, details of the new multimodal series began leaking out this month. As part of the move, Amazon is diving into the agentic AI business with a new model called Nova Act, which is now in research preview. Nova Act is designed to control Web browser actions and independently tackle simple tasks. A Nova Act SDK is also being made available to allow developers to customize their own agents using the general-purpose Nova. The company is pushing for agents to help streamline business productivity. Continue reading Amazon’s Nova Model Series Includes Nova Act for AI Agents

OpenAI Delivers Native GPT-4o Image Generator to ChatGPT

OpenAI has activated the multimodal image generation capabilities of GPT-4o, making it available to ChatGPT users on the Plus, Pro, Team and Free tiers. It replaces DALL-E 3 as the default image generator for the popular chatbot. GPT-4o’s accuracy with text, understanding of symbols and precision with prompts combined with well multimodal capabilities that allow the model to take cues from visual material have transformed its image capabilities from largely unpredictable to “consistent and context-aware,” resulting in “a practical tool with precision and power,” claims OpenAI. Continue reading OpenAI Delivers Native GPT-4o Image Generator to ChatGPT

Canvas and Live Video Add Productivity Features to Gemini AI

Google has added a Canvas feature to its Gemini AI chatbot that provides users with a real-time collaborative space where writing and coding projects can be refined and other ideas iterated and shared. “Canvas is designed for seamless collaboration with Gemini,” according to Gemini Product Director Dave Citron, who notes that Canvas makes it “an even more effective collaborator” in helping bring ideas to life. The move marks a trend whereby AI companies are trying to turn chatbot platforms into turnkey productivity suites. Google is launching a limited release of Gemini Live Video in addition to bringing its Audio Overview feature of NotebookLM to Gemini. Continue reading Canvas and Live Video Add Productivity Features to Gemini AI

Real-Time Web Access Informs Claude 3.7 Sonnet Responses

Anthropic’s Claude can now search the Internet in real time, allowing it to provide timely and relevant responses that are also more accurate than what the chatbot previously offered, according to the company. Claude incorporates direct citations for its Web-retrieved material, so users can fact-check its sources. “Instead of finding search results yourself, Claude processes and delivers relevant sources in a conversational format.” While this is not exactly groundbreaking — ChatGPT, Grok 3, Copilot, Perplexity and Gemini all have real-time Web retrieval and most include citations — Claude takes a slightly different approach. Continue reading Real-Time Web Access Informs Claude 3.7 Sonnet Responses