By
Paula ParisiJuly 8, 2025
Google is making it easier to access its Gems customizable Gemini chatbots by bringing them to the side panel in Google Workspace apps, including Google Docs, Slides, Sheets, Drive, and Gmail. The task-specific Gems AI assistants are meant to help with common tasks and eliminate what Google calls “repetitive prompting.” Now they’ll be usable without even prompting Gemini to open. While Google offers pre-made Gems, they can also be customized or individually created to meet specific needs. Both custom and templated Gems can be installed in side panels, leveraging capabilities like @-mentioning or accessing files and folders. Continue reading Google Makes Gems Chatbots Available via Workspace Apps
By
Paula ParisiJuly 3, 2025
Cloudflare, which spent the past year introducing tools to help content providers prevent unwanted AI scraping, is launching a marketplace that lets websites charge for the privilege of using a “pay-per-crawl” model. The Internet infrastructure and security company says it is the first to enable blocking AI crawlers by default, providing access only with permission and, if wanted, compensation. As of July 1, AI companies can use Cloudflare’s marketplace to “clearly state their purpose — if their crawlers are used for training, inference, or search — to help website owners decide which crawlers to allow.” Continue reading Cloudflare Pay-per-Crawl Lets Publishers Monetize Scrapes
By
Paula ParisiJuly 2, 2025
Chinese e-commerce giant Alibaba has released a new multimodal model called Qwen VLo that can understand and generate images. Available for free in preview through Qwen Chat, it can use image or text prompts to generate pictures, and accepts text in multiple languages, including Chinese and English. It can also edit, change backgrounds and switch styles, handling multiple image edits in sequence. An upgrade over January’s Qwen 2.5-VL release, Qwen VLo uses progressive generation, allowing users to see the image creation in progress, and Alibaba says it’s particularly good at making inline adjustments to fine-tune images. Continue reading Alibaba’s Qwen VLo Generative AI Shows Images in Progress
By
Paula ParisiJune 26, 2025
Meta Platforms has announced new generative AI features developed for marketers that produce video advertisements. Announced as part of Meta’s Advantage+ ad suite, marketers can now use up to 20 product stills to create multi-scene video ads with music and text overlay. The company has also added generative AI voices and virtual try-ons to its Advantage+ marketer toolkit. The upgrades were announced at the Cannes Lions International Festival of Creativity, where the topic of Meta using AI not only to create ads but to algorithmically serve them to target audiences was a topic of conversation. Continue reading Meta Unveils New AI Advertising Tools as Part of Advantage+
By
Paula ParisiJune 18, 2025
Meta Platforms is opening its WhatsApp messaging service to advertising. The company revealed that three ad modules will roll out gradually. The ads will be positioned under WhatsApp’s Updates tab, a section discreet from WhatsApp’s users’ message inboxes and private chats. The Updates tab is also the entry point to WhatsApp’s Status feature, which lets users share photos, videos and text that disappear after 24 hours, similar to Instagram Stories. Meta says the Updates tab gets 1.5 billion visitors per day. The company is also seeking to monetize WhatsApp’s Channels feature by offering paid subscriptions and promoted Channels. Continue reading Meta Platforms Is Gradually Bringing Advertising to WhatsApp
By
Paula ParisiJune 10, 2025
Snapchat is offering experimental augmented reality and generative AI tools through its new Lens Studio iOS app and web tool. According to the company, “you can generate your own AI effects, add your dancing Bitmoji to the fun, and express yourself with Lenses that reflect your mood or an inside joke.” The app responds to text prompts, producing filters that can be published to Snapchat. Snap previously offered generative AI capabilities only to professional creators as part of its Lens Studio. The company has also launched an app that lets users read and reply to messages using their Apple Watch. For professional developers, Snap’s Lens Studio has added tools to build Bitmoji games. Continue reading GenAI Powers Snapchat Tools for Creating AR Lenses, Bitmoji
By
Paula ParisiJune 9, 2025
Chatbot platform Character.AI is rolling out its video generator, AvatarFX, in general release after a month in closed beta. It’s also adding a sharing feature called Scenes and Streams that will serve content to Character.AI’s community feed, coming soon to mobile. Users can now tap AvatarFX to create up to five videos per day, starting by uploading a photo, choosing a voice and writing dialogue for the character. Character.AI started as 1:1 text chat in the summer of 2023. Now the company is “expanding into a multi-modal world” with “more ways for creators to build immersive narratives and dynamic experiences.” Continue reading Character.AI Goes Wide with AvatarFX, Adds Mobile Features
By
Paula ParisiJune 9, 2025
China’s Manus AI has unveiled a text-to-video generator it says can transform “prompts into complete stories — structured, sequenced, and ready to watch. With a single prompt, Manus plans each scene, crafts the visuals, and animates your vision,” the company announced last week. Manus generated buzz in March for its agentic approach to AI, and now it is putting that autonomous technology to work on generative AI, promising story generation within minutes. Last month, the firm that developed Manus, Butterfly Effect, reportedly secured $75 million in funding led by U.S.-based Benchmark for a nearly $500 million valuation. Continue reading Manus AI Takes an Agentic Approach with Its Video Generator
By
Paula ParisiMay 12, 2025
Google is adding a “Simplify” feature for iOS users that uses AI to translate complex or technical text into language that aims to be easy to understand. Simplify leverages what Google calls “a novel prompt refinement approach developed by Google Research,” drawing on the company’s proprietary AI, Gemini, to make complicated writing “digestible — without losing key details.” Google’s research indicates people find Simplify’s plainspeak “significantly more helpful than the original complex text” and improved retention. “Simplify uses AI to make dense text on the web easier to understand — without leaving a web page,” Google explains. Continue reading Google Simplify App Makes Tough Text Easier to Understand
By
Paula ParisiApril 14, 2025
Among the many tech advancements unveiled at Google Cloud Next include a major generative media upgrade to Vertex AI, Google Cloud’s managed AI development platform. The new Vertex AI Media Studio lets enterprise users generate complete videos from scratch using text prompts. Lyria, Google’s text-to-music model is now available on Vertex in private preview. Both are subject to an “allowlist.” Chirp 3 now creates custom voices with just 10 seconds of audio input, while Imagen 3 has gained improved abilities for reconstructing missing or damaged portions of an image. Continue reading Vertex AI Movie Studio Can Create Videos from Start to Score
By
Paula ParisiApril 14, 2025
YouTube is rolling out a new tool called Music Assistant to its Creator Music marketplace. Music Assistant uses generative AI to automatically add royalty-free instrumental background music to long-form videos. Accessed via a dedicated tab in Creator Music, Music Assistant provides more control over things like the style, mood and instruments of the desired music, which is requested via text prompts. Creator Music is available in the U.S. to creators enrolled in the YouTube Partner Program. YouTube is also experimenting with a Shorts feature that automatically synchronizes audio for short-form video creators. Continue reading YouTube Brings Generative Music Assistant to Video Creation
By
Paula ParisiApril 10, 2025
Amazon has updated its Nova model series, with Nova Reel 1.1 now able to generate AI videos of up to two minutes as well as gaining a new ‘multi-shot’ feature. Announced in December, Nova Reel marked Amazon’s initial foray into generative video. AWS developer advocate Elizabeth Fuentes says that Nova Reel accommodates user prompts of up to 4,000 characters that can generate a series of six-second shots for a sequence totaling two minutes. The company also introduced the Nova Sonic real-time voice model that supports third-party enterprise development. Continue reading AWS Updates Nova Reels and Adds Nova Sonic Voice Model
By
Paula ParisiMarch 28, 2025
Alibaba Cloud has released Qwen2.5-Omni-7B, a new AI model the company claims is efficient enough to run on edge devices like mobile phones and laptops. Boasting a relatively light 7-billion parameter footprint, Qwen2.5-Omni-7B understands text, images, audio and video and generates real-time responses in text and natural speech. Alibaba says its combination of compact size and multimodal capabilities is “unique,” offering “the perfect foundation for developing agile, cost-effective AI agents that deliver tangible value, especially intelligent voice applications.” One example would be using a phone’s camera to help a vision impaired-person navigate their environment. Continue reading Alibaba’s Powerful Multimodal Qwen Model Is Built for Mobile
By
Paula ParisiMarch 27, 2025
OpenAI has activated the multimodal image generation capabilities of GPT-4o, making it available to ChatGPT users on the Plus, Pro, Team and Free tiers. It replaces DALL-E 3 as the default image generator for the popular chatbot. GPT-4o’s accuracy with text, understanding of symbols and precision with prompts combined with well multimodal capabilities that allow the model to take cues from visual material have transformed its image capabilities from largely unpredictable to “consistent and context-aware,” resulting in “a practical tool with precision and power,” claims OpenAI. Continue reading OpenAI Delivers Native GPT-4o Image Generator to ChatGPT
By
Paula ParisiMarch 27, 2025
Google has released what it calls its most intelligent AI model yet, Gemini 2.5. The first 2.5 model release, an experimental version of Gemini 2.5 Pro, is a next-gen reasoning model that Google says outperformed OpenAI o3-mini and Claude 3.7 Sonnet from Anthropic on common benchmarks “by meaningful margins.” Gemini 2.5 models “are thinking models, capable of reasoning through their thoughts before responding, resulting in enhanced performance and improved accuracy,” according to Google. The new model comes just three months after Google released Gemini 2.0 with reasoning and agentic capabilities. Continue reading Google Debuts Next-Gen Reasoning Models with Gemini 2.5