Natural Language Archives

Google Adds New Features to Its Flow GenAI Storytelling Tool

By Paula Parisi
December 2, 2025

Google has added four new features in Flow, its AI tool for storytelling, that offer more precise control over images and videos. The upgrades include generative imaging with Nano Banana Pro, doodle prompts, an object insertion/removal tool and camera motion. Flow was introduced in May and offers the ability to edit and build scenes using natural language. The improvements aim to make Flow output more polished. “In Flow, you can use images to serve as the characters, subjects and starting points for your clips” with pictures you upload or create in Flow with the new “Images” tab, according to the company. Continue reading Google Adds New Features to Its Flow GenAI Storytelling Tool

Amazon Adds Agentic AI to ‘Connect’ Customer Service Tool

By Paula Parisi
December 2, 2025

Amazon has added agentic AI capabilities to Amazon Connect, the neural text-to-speech tool that provides AI-powered customer service support and analytics in real time. Connect is capable of neural text-to-speech in more than 30 languages and also delivers automated speech recognition. Leveraging advanced speech models from Nova Sonic, the Connect agents “deliver natural, human-like conversations, responding with the right pace, tone, and understanding across multiple languages and accents,” Amazon says. The company has also integrated third-party automated speech recognition and text-to-speech solutions from Deepgram and ElevenLabs with Connect. Continue reading Amazon Adds Agentic AI to ‘Connect’ Customer Service Tool

Gemini’s Nano Banana Image Editor Added to Google Photos

By Paula Parisi
November 21, 2025

As promised last month, Google Photos is getting AI enhancements powered by Gemini’s top-rated image editing model Nano Banana. Users can now open a photo, select “Help me edit” and type “Remove Riley’s sunglasses, open my eyes, make Engel smile and open her eyes” to quickly doctor a shot. Photos will draw on images from your private library of “face groups” to generate “personalized, accurate edits of people in your photo library,” Google says. The company is also introducing a new “Ask” button to get information about photos and make requests and expanding natural language instruction. Continue reading Gemini’s Nano Banana Image Editor Added to Google Photos

Squarespace Partners with Perplexity, Debuts Chat Site Builder

By Paula Parisi
October 9, 2025

Squarespace, the platform launched in 2003 for website development and operation, is getting a refresh aimed at helping entrepreneurs and creative professionals incorporate more personalization and AI features. The ability to build websites via chat is coming soon. AI Optimization (AIO) for search is another focus. The company has partnered with AI answer engine Perplexity to serve as the website building and hosting partner for Perplexity’s new browser, Comet. Squarespace has also launched Finish Layer, a design suite with capabilities for animation, transforms, and advanced editing to help websites add “immersive experiences with professional-grade customization.” Continue reading Squarespace Partners with Perplexity, Debuts Chat Site Builder

Alibaba’s Qwen3-Omni AI Ingests Text, Images, Audio, Video

By Paula Parisi
September 24, 2025

Alibaba Cloud’s newest AI model, Qwen3-Omni-30B-A3B, has debuted with a splash. The Chinese company is touting it as “the first natively end-to-end omni-modal AI unifying text, image, audio & video in one model.” While Qwen3-Omni can accept prompts of text, image, audio and video, it only outputs text and audio. Alibaba Cloud has released the three versions of Qwen3-Omni so users can select based on their needs, choosing between general multimodal capabilities, deep reasoning or specialized audio understanding. Alibaba has also developed an AI chip called T-Head that performs comparably to Nvidia’s H20. Continue reading Alibaba’s Qwen3-Omni AI Ingests Text, Images, Audio, Video

Slack to Roll Out New AI Features for Enterprise Collaboration

By Paula Parisi
July 21, 2025

AI is now part of every paid Slack subscription, and the platform continues building out its agentic OS with new features including enterprise search connectors, writing assistance, and contextual definitions embedded in the app. The Enterprise+ plan, which scales AI company-wide, now includes AI for drafting documents and answering questions using information stored in Slack chats and connected apps. Business+ is adding recaps, translations, workflow generation and AI-powered search, while the entry-level Pro tier offers AI summarization for conversations in channels, threads and huddles, making it easier to stay current on communications. Continue reading Slack to Roll Out New AI Features for Enterprise Collaboration

AWS Kiro Agentic AI Developer Tool Now Free During Preview

By Paula Parisi
July 17, 2025

AWS has released a new AI coding tool called Kiro in preview. This IDE for agent apps is described by some as a vibe coding platform. However, AWS says Kiro “goes way beyond,” getting prototypes into production systems with features such as specs and hooks. In fact, Kiro was designed specifically to reduce issues common to vibe coding, the process of creating software using AI agents reacting to natural language prompts. This makes it popular among non-coders, resulting in an often chaotic process that Kiro attempts to professionalize. Available for free during preview, Kiro supports most popular programming languages. Continue reading AWS Kiro Agentic AI Developer Tool Now Free During Preview

Nothing Phone (3) Touts 50MP Cameras, Tiny Second Screen

By Paula Parisi
July 2, 2025

British phone maker Nothing Technology Limited has announced the Phone (3), with a divergent look and major camera upgrades that cater to its young fanbase. At a launch event in London, Nothing CEO Carl Pei said his company, which also makes headphones and earbuds, will go “all-in” marketing its new Android flagship smartphone against premium offerings from Samsung and others. It starts at $800, 25 percent more than the Phone (2), but Pei says there are plenty of upgrades — including four 50MP cameras and a circular mini-LED screen on the back of the device. Roughly the size of a quarter, the secondary screen displays data in a retro-looking dot matrix format called the “Glyph Matrix.” Continue reading Nothing Phone (3) Touts 50MP Cameras, Tiny Second Screen

Zencoder Testing Agent Shaves Weeks Off App Development

By Paula Parisi
June 17, 2025

Startup Zencoder (formerly For Good AI) has launched a cloud-based AI-powered E2E testing agent that simplifies the pipeline from initial code to production-ready applications. Now in public beta, Zentester tackles “verification,” which Zencoder founder and CEO Andrew Filev calls “the missing link” in scaling AI-created code from concept to market-ready app. That complicated process is often delayed by a bottleneck in final testing. Zentester is designed to take that late-stage verification process “from days to hours,” Filev says. Zentester has the typical agent superpowers — seeing and interacting as users do by clicking buttons, filling in forms and navigating workflows. Continue reading Zencoder Testing Agent Shaves Weeks Off App Development

Agentic Browser Opera Neon Available Soon via Subscription

By Paula Parisi
May 30, 2025

The Norwegian browser company behind Opera is working on an AI-powered version with agentic powers. Called Opera Neon, users can chat using the browser’s native integrated AI agent that will search the web, get answers and provide context for webpages. To do this, Opera Neon draws on previously showcased Opera tech called Browser Operator, which automates routine web tasks like form completion, hotel bookings and even some shopping functions. “Neon performs these tasks locally in the browser, preserving users’ privacy and security,” according to Opera. The company, which has been around since 1996, was acquired by a Chinese consortium in 2016. Continue reading Agentic Browser Opera Neon Available Soon via Subscription

Google Firebase Now Full-Stack App Developer in a Browser

By Paula Parisi
April 11, 2025

Google has turned its Firebase backend-as-a-service (BaaS) platform into a full-stack AI workspace called Firebase Studio that builds custom apps in a browser-based environment. Available to anyone with a Google account during its preview phase, Google says Firebase Studio will be useful to beginners and pros alike, with Gemini-powered AI agents that can be used to automate the process of building, launching and monitoring mobile and web apps and related infrastructure. Firebase Studio “includes everything developers need to create and publish production-quality AI apps quickly, all in one place,” the company announced at Google Cloud Next 2025. Continue reading Google Firebase Now Full-Stack App Developer in a Browser

ThinkAnalytics Bows Advertising, Curation Tool ThinkMediaAI

By Paula Parisi
February 21, 2025

ThinkAnalytics has launched an AI-powered platform designed for video service providers. Called ThinkMediaAI, it is said to unify content monetization — including contextual advertising, content curation and content bundling — across a variety of services, from live to CTV, FAST, VOD and more. Headquartered in the UK, with offices in Los Angeles, Singapore and India, ThinkAnalytics leverages a recommendation engine, claiming to track more than 475 million real-time data records and 8 billion recommendations per day. The company plans to showcase the new tech at NAB 2025, April 5-9 in Las Vegas. Continue reading ThinkAnalytics Bows Advertising, Curation Tool ThinkMediaAI

Reasoning Model Competes with Advanced AI at a Lower Cost

By Paula Parisi
February 10, 2025

Model training continues to hit new lows in terms of cost, a phenomenon known as the commoditization of AI that has rocked Wall Street. An AI reasoning model created for under $50 in cloud compute credits is reportedly performing comparably to established reasoning models such as OpenAI o1 and DeepSeek-R1 on tests of math and coding aptitude. Called s1-32B, it was created by researchers at Stanford and the University of Washington by customizing Alibaba’s Qwen2.5-32B-Instruct, feeding it 1,000 prompts with responses sourced from Google’s new Gemini 2.0 Flash Thinking Experimental reasoning model. Continue reading Reasoning Model Competes with Advanced AI at a Lower Cost

Twelve Labs Creating AI That Can Search and Analyze Video

By Paula Parisi
December 18, 2024

Twelve Labs has raised $30 million in funding for its efforts to train video-analyzing models. The San Francisco-based company has received strategic investments from notable enterprise infrastructure providers Databricks and SK Telecom as well as Snowflake Ventures and HubSpot Ventures. Twelve Labs targets customers using video across a variety of fields including media and entertainment, professional sports leagues, content creators and business users. The funding coincides with the release of Twelve Labs’ new video foundation model, Marengo 2.7, which applies a multi-vector approach to video understanding. Continue reading Twelve Labs Creating AI That Can Search and Analyze Video

Luma AI Upgrades Its Video Generator and Adds Image Model

By Paula Parisi
December 2, 2024

Anticipating what one outlet calls “the likely imminent release of OpenAI’s Sora,” generative AI video competitors are compelled to step up their game. Luma AI has released a major upgrade to its Dream Machine, speeding its already quick video generation and enabling a chat function for natural language prompts, so you can talk to it as with OpenAI’s ChatGPT. In addition to the new interface, Dream Machine is going mobile and adding a new foundation image model, Luma AI Photon, which “has been purpose built to advance the power and capabilities of Dream Machine,” according to the company. Continue reading Luma AI Upgrades Its Video Generator and Adds Image Model