Image Archives - Page 2 of 16

Alibaba’s Qwen VLo Generative AI Shows Images in Progress

By Paula Parisi
July 2, 2025

Chinese e-commerce giant Alibaba has released a new multimodal model called Qwen VLo that can understand and generate images. Available for free in preview through Qwen Chat, it can use image or text prompts to generate pictures, and accepts text in multiple languages, including Chinese and English. It can also edit, change backgrounds and switch styles, handling multiple image edits in sequence. An upgrade over January’s Qwen 2.5-VL release, Qwen VLo uses progressive generation, allowing users to see the image creation in progress, and Alibaba says it’s particularly good at making inline adjustments to fine-tune images. Continue reading Alibaba’s Qwen VLo Generative AI Shows Images in Progress

Google Doppl Lets You Try on Outfits Using Generative Video

By Paula Parisi
July 1, 2025

Google Labs is testing Doppl, an experimental app that uses AI to let you virtually try on clothes. Available on iOS and Android in the U.S., Doppl requires the user to upload a full body photo to which images of outfits can then be applied. It will work with various types of outfit photos, from pictures taken with a smartphone to screen grabs from shopping sites or social media. Doppl can also create AI-generated videos from a static image to give an idea of what the outfit would look like from different angles when worn. While Google hopes Doppl “helps you explore your style in new and exciting ways,” it cautions that the app “is in its early days and it might not always get things right.” Continue reading Google Doppl Lets You Try on Outfits Using Generative Video

Adobe Project Indigo iOS App Improves Smartphone Photos

By Paula Parisi
June 23, 2025

Adobe has released a camera app called Project Indigo that makes smartphone cameras more “SLR-like,” with “full manual controls, a more natural look and the highest image quality that computational photography can provide — in both JPEG and raw formats.” The Project Indigo app is available free, for now, on iOS from Adobe Labs, which quietly announced the product on its research website. The app aims to leverage a decade’s worth of advances in computational photography to help mobile photographers improve low-light and high dynamic range (HDR) image capture. Continue reading Adobe Project Indigo iOS App Improves Smartphone Photos

GenAI Powers Snapchat Tools for Creating AR Lenses, Bitmoji

By Paula Parisi
June 10, 2025

Snapchat is offering experimental augmented reality and generative AI tools through its new Lens Studio iOS app and web tool. According to the company, “you can generate your own AI effects, add your dancing Bitmoji to the fun, and express yourself with Lenses that reflect your mood or an inside joke.” The app responds to text prompts, producing filters that can be published to Snapchat. Snap previously offered generative AI capabilities only to professional creators as part of its Lens Studio. The company has also launched an app that lets users read and reply to messages using their Apple Watch. For professional developers, Snap’s Lens Studio has added tools to build Bitmoji games. Continue reading GenAI Powers Snapchat Tools for Creating AR Lenses, Bitmoji

Character.AI Goes Wide with AvatarFX, Adds Mobile Features

By Paula Parisi
June 9, 2025

Chatbot platform Character.AI is rolling out its video generator, AvatarFX, in general release after a month in closed beta. It’s also adding a sharing feature called Scenes and Streams that will serve content to Character.AI’s community feed, coming soon to mobile. Users can now tap AvatarFX to create up to five videos per day, starting by uploading a photo, choosing a voice and writing dialogue for the character. Character.AI started as 1:1 text chat in the summer of 2023. Now the company is “expanding into a multi-modal world” with “more ways for creators to build immersive narratives and dynamic experiences.” Continue reading Character.AI Goes Wide with AvatarFX, Adds Mobile Features

Google Photos Rolling Out Redesign and New AI Editing Tools

By Paula Parisi
June 3, 2025

Google is celebrating 10 years of Google Photos by introducing a redesign of the Photos editor, including helpful new tools. The Photos editor gets some AI editing features previously available only on Pixel phones as part of its generative AI Magic Editor. The Photos platform is also expanding access to its AI-powered text-to-image Reimagine and automatic framing and related features first introduced with the Pixel 9. The company explains there are currently more than 1.5 billion monthly Photos users that have stored 9+ trillion photos and videos. The updates reflect Google’s AI push as it continues to integrate Gemini across its growing family of products and services. Continue reading Google Photos Rolling Out Redesign and New AI Editing Tools

YouTube Shorts Powers New Visual Search with Google Lens

By Paula Parisi
June 2, 2025

YouTube is integrating Google Lens, allowing viewers to search elements of what they see while watching YouTube Shorts. The visual search enhancement aims to provide more ways to unearth information and discover content in an interactive, intuitive way. YouTube provides an example involving a Short filmed on location that features landmarks a viewer may be interested in visiting. In this example, users could ask Google Lens for related information to learn the name of the destination and helpful details regarding its culture and history, results that would appear over the video content as a visual overlay. YouTube began rolling out the Lens feature in beta to all viewers last week. Continue reading YouTube Shorts Powers New Visual Search with Google Lens

TikTok Offering ‘AI Alive’ Image-to-Video Generator in Stories

By Paula Parisi
May 15, 2025

TikTok AI Alive is a new image-to-video feature that can add sequential expression to selfies and add progressive hues to sunsets. Accessible through the platform’s Story Camera, AI Alive uses intelligent editing tools that give anyone, regardless of experience, “the ability to transform static images into captivating, short-form videos enhanced with movement, atmospheric and creative effects.” TikTok says it is prioritizing safety and transparency by adding a label to AI Alive stories, which will also have C2PA metadata embedded, traveling with the content even when it’s downloaded and shared elsewhere. Continue reading TikTok Offering ‘AI Alive’ Image-to-Video Generator in Stories

Pinterest Aims to Improve Discovery with Its AI Search Tools

By Paula Parisi
May 12, 2025

Pinterest has added more AI options for visual search, hoping to drive discovery. The idea is to make it easier to segment elements in Pin image searches, building databases of relevant or personally associated products. When users view a Pin, they’ll now also see AI-generated words that can be used to learn more about elements of an image that interest them, including informationally and to facilitate shopping. “We’re breaking down and decoding images so users can quickly search and shop for the details of an outfit — whether it’s an aesthetic, color palette, fit or product category,” according to Pinterest. Continue reading Pinterest Aims to Improve Discovery with Its AI Search Tools

Adobe Launches Its Content Authenticity App in Public Beta

By Paula Parisi
April 29, 2025

Adobe has released its free Content Authenticity web app in beta. The app is designed to help protect creators’ work and allows them to embed a request that generative AI models don’t use their work for training. Users can apply tags for up to 50 images at once. In addition to applying tags, users can customize and inspect Adobe Content Credentials using the the Adobe Content Authenticity browser extension for Google Chrome. The information is invisible until the inspection tool is opened and can include links to a creator’s social media account, website or other identifying attributes. Continue reading Adobe Launches Its Content Authenticity App in Public Beta

OpenAI Introduces New Models That Can Reason with Images

By Paula Parisi
April 18, 2025

OpenAI has released two new AI models that use images as part of their reasoning process, “thinking with images.” OpenAI o3 and o4-mini “are the smartest models we’ve released to date, representing a step change in ChatGPT’s capabilities for everyone from curious users to advanced researchers,” the company says. The new entries in the “o” series also have agentic capabilities and can independently “use and combine every tool within ChatGPT, including searching the web, analyzing uploaded files and other data with Python, reasoning deeply about visual inputs, and even generating images.” Continue reading OpenAI Introduces New Models That Can Reason with Images

Cohere’s Multimodal Embed Model Organizes Enterprise Data

By Paula Parisi
April 17, 2025

As enterprises rely more heavily on AI integration to compile research and summarize things like meetings and email threads, the need for contextual search has become increasingly important. AI startup Cohere has released Embed 4 to make the task easier. Embed 4 is a multimodal embedding model that transforms text, images and mixed data (like PDFs, slides or tables) into numerical representations (or “embeddings”) for tasks including semantic search, retrieval-augmented generation (RAG) and classification. Supporting over 100 languages, Embed 4 has an extremely large context window of up to 128,000 tokens. Continue reading Cohere’s Multimodal Embed Model Organizes Enterprise Data

OpenAI Reportedly Has Prototype for Its Own Social Network

By Paula Parisi
April 17, 2025

OpenAI is working to build a social network that will compete against Elon Musk’s X and Meta’s Instagram, reports say. Though still in the early stages, the project is revolving around an internal prototype that is said to involve a social feed that leverages ChatGPT’s image generator. It’s unclear if an OpenAI social app would be standalone or integrated with ChatGPT, but either way it would most likely heighten the competition between rivals Musk and OpenAI CEO Sam Altman, who recently fended off an unsolicited offer by Musk to purchase his company for $97.4 billion. Continue reading OpenAI Reportedly Has Prototype for Its Own Social Network

Vertex AI Movie Studio Can Create Videos from Start to Score

By Paula Parisi
April 14, 2025

Among the many tech advancements unveiled at Google Cloud Next include a major generative media upgrade to Vertex AI, Google Cloud’s managed AI development platform. The new Vertex AI Media Studio lets enterprise users generate complete videos from scratch using text prompts. Lyria, Google’s text-to-music model is now available on Vertex in private preview. Both are subject to an “allowlist.” Chirp 3 now creates custom voices with just 10 seconds of audio input, while Imagen 3 has gained improved abilities for reconstructing missing or damaged portions of an image. Continue reading Vertex AI Movie Studio Can Create Videos from Start to Score

Google Firebase Now Full-Stack App Developer in a Browser

By Paula Parisi
April 11, 2025

Google has turned its Firebase backend-as-a-service (BaaS) platform into a full-stack AI workspace called Firebase Studio that builds custom apps in a browser-based environment. Available to anyone with a Google account during its preview phase, Google says Firebase Studio will be useful to beginners and pros alike, with Gemini-powered AI agents that can be used to automate the process of building, launching and monitoring mobile and web apps and related infrastructure. Firebase Studio “includes everything developers need to create and publish production-quality AI apps quickly, all in one place,” the company announced at Google Cloud Next 2025. Continue reading Google Firebase Now Full-Stack App Developer in a Browser