ChatGPT Goes Multimodal: OpenAI Adds Vision, Voice Ability

OpenAI began previewing vision capabilities for GPT-4 in March, and the company is now starting to roll out the image input and output to users of its popular ChatGPT. The multimodal expansion also includes audio functionality, with OpenAI proclaiming late last month that “ChatGPT can now see, hear and speak.” The upgrade vaults GPT-4 into the multimodal category with what OpenAI is apparently calling GPT-4V (for “Vision,” though equally applicable to “Voice”). “We’re rolling out voice and images in ChatGPT to Plus and Enterprise users,” OpenAI announced. Continue reading ChatGPT Goes Multimodal: OpenAI Adds Vision, Voice Ability

Magic Studio from Canva Offers AI Design for All Skill Levels

Web-based design app Canva has raised the curtain on its AI-powered Magic Studio as part of the company’s 10-year anniversary outreach. Canva is positioning Magic Studio as collecting diverse AI tools to provide a “comprehensive AI-design platform” for business and home users that want to automate labor-intensive tasks like creating and editing images and outputting to different formats using generative artificial intelligence. Created for “the 99 percent of the world without complex design skills,” Canva’s Magic Studio offers many of the features now being built-in to smartphones and software suites, but easier and “all in one place.” Continue reading Magic Studio from Canva Offers AI Design for All Skill Levels

Adobe Launches Web Version of Photoshop with AI Features

Adobe has officially added Photoshop on the web as one of its Photoshop plans. The web version is geared to Photoshop newbies and comes complete with Adobe Firefly generative AI features including Generative Fill and Generative Expand. Adobe called it “a major milestone” since introducing Photoshop on the web in beta two years ago, starting with “an early preview of image editing capabilities.” Features now available for commercial use on the web include the ability to easily add or remove elements from any image, change a background, expand the frame, and create visuals using text-based prompts. Continue reading Adobe Launches Web Version of Photoshop with AI Features

Getty GenAI Tool for Images and Video Is Powered by Nvidia

Nvidia’s Picasso continues to gain market share among visual companies looking for an AI foundry to train models for generative use. Getty Images has partnered with Nvidia to create custom foundation models for still images and video. Generative AI by Getty Images lets customers create visuals using Getty’s library of licensed photos. The tool is trained on Getty’s own creative library and has the company’s guarantee of “full indemnification for commercial use.” Getty joins Shutterstock and Adobe among enterprise clients using Picasso. Runway and Cuebric are using it, too — and Picasso is still in development. Continue reading Getty GenAI Tool for Images and Video Is Powered by Nvidia

OpenAI’s ChatGPT Upgraded with ‘Talk’ Tech, Image Search

OpenAI is experimenting with new voice and image capabilities in ChatGPT. According to the company, users can now “speak with ChatGPT and have it talk back,” thanks to an intuitive new interface that, in addition to facilitating voice conversations, will allow users to show ChatGPT an image to discuss. “Snap a picture of a landmark while traveling and have a live conversation about what’s interesting about it,” OpenAI explains, alternatively suggesting you “snap pictures of your fridge and pantry to figure out what’s for dinner” or have it help with homework based on pictures of a math problem. Continue reading OpenAI’s ChatGPT Upgraded with ‘Talk’ Tech, Image Search

OpenAI’s Latest Version of DALL-E Integrates with ChatGPT

OpenAI has released the DALL-E 3 generative AI imaging platform in research preview. The latest iteration features more safety options and integrates with OpenAI’s ChatGPT, currently driven by the now seasoned large language model GPT-4. That is the ChatGPT version to which Plus subscribers and enterprise customers have access — the same who will be able to preview DALL-E 3. The free chatbot is built around GPT-3.5. OpenAI says GPT-4 makes for better contextual understanding by DALL-E, which even in version 2 evidenced some glaring comprehension glitches. Continue reading OpenAI’s Latest Version of DALL-E Integrates with ChatGPT

Google Introduces an AI Watermark That Cannot Be Removed

Google DeepMind and Google Cloud have teamed to launch what they claim is an indelible AI watermark tool, which if it works would mark an industry first. Called SynthID, the technique for identifying AI-generated images is being launched in beta. The technology embeds its digital watermark “directly into the pixels of an image, making it imperceptible to the human eye, but detectable for identification,” according to DeepMind. SynthID is being released to a limited number of Google’s Vertex AI customers using Imagen, a Google AI language model that generates photorealistic images. Continue reading Google Introduces an AI Watermark That Cannot Be Removed

Pinterest Touts AI and Amazon Partnership with Q2 Earnings

Social image pinboarding and shopping inspiration platform Pinterest touted its recently announced Amazon partnership and AI efforts as part of its Q2 2023 earnings, which showed a 6 percent gain in year-over-year revenue of $708 million, beating analyst expectations. Pinterest announced the multiyear partnership with Amazon that marked a Pinterest first for third-party ads. On the investor call, Pinterest CEO Bill Ready told analysts the company has been testing Amazon ads traffic and is “very pleased” with the early results. When users click on Amazon ads on Pinterest they land on Amazon’s site to complete their purchase. Continue reading Pinterest Touts AI and Amazon Partnership with Q2 Earnings

Google’s AI-Powered Search Delivers Relevant, Visual Results

Google is adding images and video to its Search Generative Experience (SGE), an AI-powered context tool the company began testing in May that some are already calling “the future of Google Search.” Those who have signed up for Search Labs and enabled SGE will begin seeing more multimedia at the top of their search results. The idea is to help searchers “get up to speed on a new topic, uncover quick tips for your specific questions or discover products and things to consider — with article links to dig deeper,” Google explains of its latest AI improvements. Continue reading Google’s AI-Powered Search Delivers Relevant, Visual Results

Apple Chatbot ‘Ajax’ Could Be Next Major Player in AI Space

Apple is reportedly developing tools it could use to enter the artificial intelligence space, joining rivals such as Microsoft and Google, which have already released popular products. In Cupertino, the company is said to have built a framework for large language models, which power AI-based chatbot offerings similar to Google’s Bard and OpenAI’s ChatGPT. Called Ajax, the platform is the basis for what is referred to inside the company as Apple GPT. Though Apple has built automation into its products for some time, it could now be preparing to make a direct play for the generative AI market. Continue reading Apple Chatbot ‘Ajax’ Could Be Next Major Player in AI Space

Wix AI Site Generator Builds Websites Using Only AI Prompts

Global SaaS and website creation platform Wix Ltd. will release an AI Site Generator that allows people to create websites using only natural language artificial intelligence prompts. The generator will include a suite of AI-powered capabilities, many of which Wix is already offering as part of its template-based site-building framework. The package “significantly streamlines the entire website-building, design and management process,” offering automated tools that provide the opportunity for Wix users to “operationalize and grow their businesses with never-before-seen ease,” the company co-founder and CEO Avishai Abrahami said. Continue reading Wix AI Site Generator Builds Websites Using Only AI Prompts

Adobe Pursues Ethical, Responsible AI in the Creative Space

As a next step in its advances in ethical AI, Adobe has announced its Firefly generative AI platform now supports text prompts in more than 100 international languages. The company says its Firefly AI app has generated over one billion images in Firefly and Photoshop since implementation in March. Adobe has also deployed artificial intelligence in Express, Illustrator and the Creative Cloud. Positioning its latest news as an expansion of global proportions, Adobe’s generative AI products will now support text prompts in native dialects in the standalone Firefly web service, with localization coming to more than 20 additional languages. Continue reading Adobe Pursues Ethical, Responsible AI in the Creative Space

Meta Develops Computer Vision AI That Learns Like Humans

Meta Platforms continues to make progress on a mission to develop artificial intelligence that can teach itself to learn how the world works. Chief AI Scientist Yann LeCun has taken a special interest in developing the new model, called Image Joint Embedding Predictive Architecture, or I-JEPA, which learns by building an internal representation of the outside world and analyzing image abstracts instead of comparing pixels. The approach allows AI techto learn more like humans do, with their ability to figure out complex tasks and adapt to new situations. Continue reading Meta Develops Computer Vision AI That Learns Like Humans

Runway Makes Next Advance in Consumer Text-to-Video AI

Google-backed AI startup Runway has released Gen-2, an early entry among commercially available text-to-video models. Previously waitlisted in limited release, the commercial availability is impactful, since text-to-video is predicted as the next big bump in artificial intelligence, following the explosion of AI use generating text and images. While Runway’s solution may not be ready to serve as a professional video tool, this is the next step in development of tech expected to impact media and entertainment. Filmmaker Joe Russo recently predicted that within the next two years, AI may have the ability to create feature films. Continue reading Runway Makes Next Advance in Consumer Text-to-Video AI

Photo App Reimagine Brings Old Images to Life with AI Tools

Family history platform MyHeritage is releasing a mobile app called Reimagine that enables high-speed scanning of entire album pages to complement the company’s AI tools for restoring — and even facially animating — historical photos. Now users can easily import printed photos stored in albums by snapping page pictures on their iOS or Android device. The app will separate the individual photos, cropping and saving them as standalone images to which metadata can be added for indexing. The app also works with individual photos, or digital uploads from a camera roll. Continue reading Photo App Reimagine Brings Old Images to Life with AI Tools