TikTok Offering ‘AI Alive’ Image-to-Video Generator in Stories

TikTok AI Alive is a new image-to-video feature that can add sequential expression to selfies and add progressive hues to sunsets. Accessible through the platform’s Story Camera, AI Alive uses intelligent editing tools that give anyone, regardless of experience, “the ability to transform static images into captivating, short-form videos enhanced with movement, atmospheric and creative effects.” TikTok says it is prioritizing safety and transparency by adding a label to AI Alive stories, which will also have C2PA metadata embedded, traveling with the content even when it’s downloaded and shared elsewhere. Continue reading TikTok Offering ‘AI Alive’ Image-to-Video Generator in Stories

Pinterest Aims to Improve Discovery with Its AI Search Tools

Pinterest has added more AI options for visual search, hoping to drive discovery. The idea is to make it easier to segment elements in Pin image searches, building databases of relevant or personally associated products. When users view a Pin, they’ll now also see AI-generated words that can be used to learn more about elements of an image that interest them, including informationally and to facilitate shopping. “We’re breaking down and decoding images so users can quickly search and shop for the details of an outfit — whether it’s an aesthetic, color palette, fit or product category,” according to Pinterest. Continue reading Pinterest Aims to Improve Discovery with Its AI Search Tools

Adobe Launches Its Content Authenticity App in Public Beta

Adobe has released its free Content Authenticity web app in beta. The app is designed to help protect creators’ work and allows them to embed a request that generative AI models don’t use their work for training. Users can apply tags for up to 50 images at once. In addition to applying tags, users can customize and inspect Adobe Content Credentials using the the Adobe Content Authenticity browser extension for Google Chrome. The information is invisible until the inspection tool is opened and can include links to a creator’s social media account, website or other identifying attributes. Continue reading Adobe Launches Its Content Authenticity App in Public Beta

OpenAI Introduces New Models That Can Reason with Images

OpenAI has released two new AI models that use images as part of their reasoning process, “thinking with images.” OpenAI o3 and o4-mini “are the smartest models we’ve released to date, representing a step change in ChatGPT’s capabilities for everyone from curious users to advanced researchers,” the company says. The new entries in the “o” series also have agentic capabilities and can independently “use and combine every tool within ChatGPT, including searching the web, analyzing uploaded files and other data with Python, reasoning deeply about visual inputs, and even generating images.” Continue reading OpenAI Introduces New Models That Can Reason with Images

Cohere’s Multimodal Embed Model Organizes Enterprise Data

As enterprises rely more heavily on AI integration to compile research and summarize things like meetings and email threads, the need for contextual search has become increasingly important. AI startup Cohere has released Embed 4 to make the task easier. Embed 4 is a multimodal embedding model that transforms text, images and mixed data (like PDFs, slides or tables) into numerical representations (or “embeddings”) for tasks including semantic search, retrieval-augmented generation (RAG) and classification. Supporting over 100 languages, Embed 4 has an extremely large context window of up to 128,000 tokens. Continue reading Cohere’s Multimodal Embed Model Organizes Enterprise Data

OpenAI Reportedly Has Prototype for Its Own Social Network

OpenAI is working to build a social network that will compete against Elon Musk’s X and Meta’s Instagram, reports say. Though still in the early stages, the project is revolving around an internal prototype that is said to involve a social feed that leverages ChatGPT’s image generator. It’s unclear if an OpenAI social app would be standalone or integrated with ChatGPT, but either way it would most likely heighten the competition between rivals Musk and OpenAI CEO Sam Altman, who recently fended off an unsolicited offer by Musk to purchase his company for $97.4 billion. Continue reading OpenAI Reportedly Has Prototype for Its Own Social Network

Vertex AI Movie Studio Can Create Videos from Start to Score

Among the many tech advancements unveiled at Google Cloud Next include a major generative media upgrade to Vertex AI, Google Cloud’s managed AI development platform. The new Vertex AI Media Studio lets enterprise users generate complete videos from scratch using text prompts. Lyria, Google’s text-to-music model is now available on Vertex in private preview. Both are subject to an “allowlist.” Chirp 3 now creates custom voices with just 10 seconds of audio input, while Imagen 3 has gained improved abilities for reconstructing missing or damaged portions of an image. Continue reading Vertex AI Movie Studio Can Create Videos from Start to Score

Google Firebase Now Full-Stack App Developer in a Browser

Google has turned its Firebase backend-as-a-service (BaaS) platform into a full-stack AI workspace called Firebase Studio that builds custom apps in a browser-based environment. Available to anyone with a Google account during its preview phase, Google says Firebase Studio will be useful to beginners and pros alike, with Gemini-powered AI agents that can be used to automate the process of building, launching and monitoring mobile and web apps and related infrastructure. Firebase Studio “includes everything developers need to create and publish production-quality AI apps quickly, all in one place,” the company announced at Google Cloud Next 2025. Continue reading Google Firebase Now Full-Stack App Developer in a Browser

AWS Updates Nova Reels and Adds Nova Sonic Voice Model

Amazon has updated its Nova model series, with Nova Reel 1.1 now able to generate AI videos of up to two minutes as well as gaining a new ‘multi-shot’ feature. Announced in December, Nova Reel marked Amazon’s initial foray into generative video. AWS developer advocate Elizabeth Fuentes says that Nova Reel accommodates user prompts of up to 4,000 characters that can generate a series of six-second shots for a sequence totaling two minutes. The company also introduced the Nova Sonic real-time voice model that supports third-party enterprise development. Continue reading AWS Updates Nova Reels and Adds Nova Sonic Voice Model

Midjourney Launches V7 Image Generator with Voice Prompts

Generative AI program Midjourney has issued V7 in alpha, marking its first new model in almost a year. Notable updates include personalization turned on by default, which users must first set up — a process Midjourney says takes 5 minutes — and can then toggle on or off at any time. Another new flagship feature, Draft Mode, lets users render lower resolution images at “half the cost and 10 times the speed,” according to Midjourney, emphasizing “it’s so fast that we change the prompt bar to a ‘conversational mode’ when you’re using it on Web.” Draft Mode also supports voice prompts. Continue reading Midjourney Launches V7 Image Generator with Voice Prompts

Alibaba’s Powerful Multimodal Qwen Model Is Built for Mobile

Alibaba Cloud has released Qwen2.5-Omni-7B, a new AI model the company claims is efficient enough to run on edge devices like mobile phones and laptops. Boasting a relatively light 7-billion parameter footprint, Qwen2.5-Omni-7B understands text, images, audio and video and generates real-time responses in text and natural speech. Alibaba says its combination of compact size and multimodal capabilities is “unique,” offering “the perfect foundation for developing agile, cost-effective AI agents that deliver tangible value, especially intelligent voice applications.” One example would be using a phone’s camera to help a vision impaired-person navigate their environment. Continue reading Alibaba’s Powerful Multimodal Qwen Model Is Built for Mobile

OpenAI Delivers Native GPT-4o Image Generator to ChatGPT

OpenAI has activated the multimodal image generation capabilities of GPT-4o, making it available to ChatGPT users on the Plus, Pro, Team and Free tiers. It replaces DALL-E 3 as the default image generator for the popular chatbot. GPT-4o’s accuracy with text, understanding of symbols and precision with prompts combined with well multimodal capabilities that allow the model to take cues from visual material have transformed its image capabilities from largely unpredictable to “consistent and context-aware,” resulting in “a practical tool with precision and power,” claims OpenAI. Continue reading OpenAI Delivers Native GPT-4o Image Generator to ChatGPT

Google Debuts Next-Gen Reasoning Models with Gemini 2.5

Google has released what it calls its most intelligent AI model yet, Gemini 2.5. The first 2.5 model release, an experimental version of Gemini 2.5 Pro, is a next-gen reasoning model that Google says outperformed OpenAI o3-mini and Claude 3.7 Sonnet from Anthropic on common benchmarks “by meaningful margins.” Gemini 2.5 models “are thinking models, capable of reasoning through their thoughts before responding, resulting in enhanced performance and improved accuracy,” according to Google. The new model comes just three months after Google released Gemini 2.0 with reasoning and agentic capabilities. Continue reading Google Debuts Next-Gen Reasoning Models with Gemini 2.5

Roblox Reveals Its Generative AI System Cube for 3D and 4D

San Mateo, California-based game developer Roblox has released a 3D object generator called Cube 3D, the first of several models the company plans to make available. Cube currently generates 3D models and environments from text, and in the future the company plans to add image inputs. Roblox says it is open-sourcing the tool, making it available to users on and off the platform. Cube will serve as the core generative AI system for Roblox’s 3D and 4D plans, the latter referring to interactive responsiveness. The launch coincides with the Game Developers Conference, running through Friday in San Francisco. Continue reading Roblox Reveals Its Generative AI System Cube for 3D and 4D

Baidu Releases New LLMs that Undercut Competition’s Price

Baidu has launched two new AI systems, the native multimodal foundation model Ernie 4.5 and deep-thinking reasoning model Ernie X1. The latter supports features like generative imaging, advanced search and webpage content comprehension. Baidu is touting Ernie X1 as of comparable performance to another Chinese model, DeepSeek-R1, but says it is half the price. Both Baidu models are available to the public, including individual users, through the Ernie website. Baidu, the dominant search engine in China, says its new models mark a milestone in both reasoning and multimodal AI, “offering advanced capabilities at a more accessible price point.” Continue reading Baidu Releases New LLMs that Undercut Competition’s Price