Zoom Introduces AI-Powered Productivity Tools for Workplace

Zoom is starting to launch its AI-powered tools first announced in October. Available to Zoom Workplace subscribers, the new Zoom Docs has been engineered from the ground up for AI optimization, leveraging Zoom’s AI Companion for what the company says will result in increased productivity and collaboration. Zoom Docs users will be able to open documents from within the videoconferencing app and can use generative AI to help write and edit them. The results will be easily shareable, Zoom says of its bid to compete with biggies like Google and Microsoft in the business productivity space. Continue reading Zoom Introduces AI-Powered Productivity Tools for Workplace

Black Forest Labs Announces Suite of Text-to-Image Models

A new generative AI startup called Black Forest Labs has hit the scene, debuting with a suite of text-to-image models branded FLUX.1. Based in Germany, Black Forest was founded by some of the researchers involved in developing Stable Diffusion and has raised $31 million in funding from principal investor Andreessen Horowitz and angels including CAA founder and former talent agent Michael Ovitz. The FLUX.1 suite focuses on “image detail, prompt adherence, style diversity and scene complexity,” the company says of its three initial variants: FLUX.1 [pro], FLUX.1 [dev] and FLUX.1 [schnell]. Continue reading Black Forest Labs Announces Suite of Text-to-Image Models

Runway’s Gen-3 Alpha Creates Realistic Video from Still Image

AI media firm Runway has launched Gen-3 Alpha, building on the text-to-video model by using images to prompt realistic videos generated in seconds. Navigate to Runway’s web-based interface and click on “try Gen 3-Alpha” and you’ll land on a screen with an image uploader, as well as a text box for those who either prefer that approach or want to use natural language to tweak results. Runway lets users generate up to 10 seconds of contiguous video using a credit system. “Image to Video is major update that greatly improves the artistic control,” Runway said in an announcement. Continue reading Runway’s Gen-3 Alpha Creates Realistic Video from Still Image

OpenAI Brings Advanced Voice Mode Feature to ChatGPT Plus

OpenAI has released its new Advanced Voice Mode in a limited alpha rollout for select ChatGPT Plus users. The feature, which is being implemented for the ChatGPT mobile app on Android and iOS, aims for more natural dialogue with the AI chatbot. Powered by GPT-4o, which is multimodal, Advanced Voice Mode is said to be able to sense emotional inflections, including excitement, sadness or singing. According to an OpenAI post on X, the company plans to “continue to add more people on a rolling basis” so that everyone using ChatGPT Plus will have access to the new feature in the fall. Continue reading OpenAI Brings Advanced Voice Mode Feature to ChatGPT Plus

Senate Introduces NO FAKES Act to Address Deepfakes and AI

The Senate has introduced the NO FAKES Act (Nurture Originals, Foster Art, and Keep Entertainment Safe) to protect artists — their voices and visual likenesses — from the proliferation of deepfakes and digital replicas created without consent. The bipartisan bill seeks to impose liability for damages to those who violate the proposed new law. If passed, the NO FAKES Act would be the first federal protection from AI image appropriation, supporters say. Those who’ve rallied to the cause include SAG-AFTRA, the Recording Industry Association of America, the Motion Picture Association, Disney and major talent agencies. Continue reading Senate Introduces NO FAKES Act to Address Deepfakes and AI

Apple Intelligence Preview and Updated iOS 18 Beta Released

Apple’s iOS 18 public beta 2 has arrived, with new wallpapers for CarPlay, a newly designed Hidden Apps folder in the Apps Library and the ability to use dark mode widgets in broad daylight, among other updates. Public beta 2 includes iPadOS 18, but does not include Apple Intelligence, which is expected this fall. However, a separate Apple Intelligence preview was introduced this week. In addition, a new Apple research paper leads some to believe its Apple Intelligence AI models were pre-trained in the cloud using Google Tensor Processing Units, leading to speculation that Big Tech be considering alternatives to Nvidia. But Apple has always been an AI outlier. Continue reading Apple Intelligence Preview and Updated iOS 18 Beta Released

Apple Joins the Safe AI Initiative as NIST Amps Up Outreach

The U.S. Commerce Department has issued a large package of material designed to help AI developers and those using the systems with an approach to identifying and mitigating risks stemming from generative AI and foundation models. Prepared by the National Institute of Standards and Technology and the AI Safety Institute, the guidance includes the initial public draft of its guidelines on “Managing Misuse Risk for Dual-Use Foundation Models.” Dual-use refers to models that can be used for good or ill. The release also includes an open-source software test called Dioptra. Apple is the latest to join the government’s voluntary commitments to responsible AI innovation. Continue reading Apple Joins the Safe AI Initiative as NIST Amps Up Outreach

Stable Video 4D Adds Time Dimension to Generative Imagery

Stability AI has unveiled an experimental new model, Stable Video 4D, which generates photorealistic 3D video. Building on what it created with Stable Video Diffusion, released in November, this latest model can take moving image data of an object and iterate it from multiple angles — generating up to eight different perspectives. Stable Video 4D can generate five frames across eight views in about 40 seconds using a single inference, according to the company, which says the model has “future applications in game development, video editing, and virtual reality.” Users begin by uploading a single video and specifying desired 3D camera poses. Continue reading Stable Video 4D Adds Time Dimension to Generative Imagery

OpenAI Begins Testing Prototype of New AI Search Features

San Francisco-based OpenAI revealed it is currently testing SearchGPT, a prototype of new AI search features that provides “fast and timely answers with clear and relevant sources.” The testing arrives as similar technology is made available by leading search services Google and Microsoft Bing. The SearchGPT prototype, featuring a user interface similar to that of OpenAI’s ChatGPT chatbot and virtual assistant, launched last week to a group of 10,000 test users and publishers who will be tapped for feedback. The plan is to iterate an improved version and then integrate SearchGPT directly into ChatGPT, although no timeline was provided. Continue reading OpenAI Begins Testing Prototype of New AI Search Features

Meta Calls New Llama the First Open-Source Frontier Model

In April, Meta Platforms revealed that it was working on an open-source AI model that performed as well as proprietary models from top AI companies such as OpenAI and Anthropic. Now, Meta CEO Mark Zuckerberg says that model has arrived in the form of Llama 3.1 405B, “the first frontier-level open-source AI model.” The company is also releasing “new and improved” Llama 3.1 70B and 8B models. In addition to general cost and performance benefits, the fact that the Llama 3.1 405B model is open source “will make it the best choice for fine-tuning and distilling smaller models,” according to Meta. Continue reading Meta Calls New Llama the First Open-Source Frontier Model

Mistral, Nvidia Bring Enterprise AI to Desktop with NeMo 12B

Nvidia and French startup Mistral AI are jointly releasing a new language model called Mistral NeMo 12B that brings enterprise AI capabilities to the desktop without the need for major cloud resources. Developers can easily customize and deploy the new LLM for applications supporting chatbots, multilingual tasks, coding and summarization, according to Nvidia. “NeMo 12B offers a large context window of up to 128k tokens,” explains Mistral, adding that “its reasoning, world knowledge, and coding accuracy are state-of-the-art in its size category.” Available under the Apache 2.0 license, it is easy to implement as a drop-in replacement for Mistral 7B. Continue reading Mistral, Nvidia Bring Enterprise AI to Desktop with NeMo 12B

Google, OpenAI, Nvidia and Others Form AI Security Coalition

A consortium of top tech firms have joined forces to launch a security group focused on artificial intelligence applications. The cybersecurity-focused non-profit OASIS will oversee operational aspects of the Coalition for Secure AI, to be known as CoSAI, described as an “open-source community.” OASIS lists Google, IBM, Intel, Microsoft, Nvidia and PayPal as founding Premier Sponsors of CoSAI, whose “additional founding sponsors” include Amazon, Anthropic, Cisco, Chainguard, Cohere, GenLab, OpenAI and Wiz. “CoSAI is an initiative to enhance trust and security in AI use and deployment,” OASIS announced at the Aspen Security Forum. Continue reading Google, OpenAI, Nvidia and Others Form AI Security Coalition

Microsoft Designer Adds AI Editing, Launches Mobile Release

Microsoft has officially moved its AI-powered Designer app out of preview, making the Canva competitor available to iOS and Android users. The app uses text prompts to generate images and designs for items such as logos, greeting cards, stickers and invitations. Powered by OpenAI’s DALL-E 3 image model, Designer is available as an app in Windows and as a free mobile app. New capabilities include the ability to edit existing designs and the addition of “prompt templates” to help users who are starting the design process with a blank canvas. “Just describe what you want to see, and Designer can create it for you,” explains Microsoft. Continue reading Microsoft Designer Adds AI Editing, Launches Mobile Release

Gemini Powering Google Vids Multimedia Presentation Builder

Google has launched the beta version of its Gemini-powered Google Vids productivity app, which lets users create work-related video presentations that embed documents, slides, audio recordings and even additional videos into a timeline. Incorporated into Workspace Labs, Google’s AI preview space, Google says invited participants can use Vids to “build a narrative with high quality templates” or “get to a first draft faster.” Access to Google’s royalty-free stock content library and Vids recording studio means a project can be completed “without ever leaving Workspace,” according to the company. Continue reading Gemini Powering Google Vids Multimedia Presentation Builder

Microsoft Targets Enterprise Productivity with Spreadsheet AI

Microsoft is working on a new productivity tool that helps artificial intelligence better understand spreadsheets. Still in the experimental phase, SpreadsheetLLM addresses challenges that are unique to applying AI to spreadsheets, “with their extensive two-dimensional grids, various layouts, and diverse formatting options,” the company explains. Hailed as a significant development in the enterprise space, where spreadsheets are used for everything from data entry to financial modeling and are shared among departments, Microsoft points out that as a research area spreadsheet-optimized AI has generally been overlooked in favor of flashier use-cases. Continue reading Microsoft Targets Enterprise Productivity with Spreadsheet AI