Businesses Experiment with DALL-E 2, Report Mixed Results

OpenAI’s powerful text-to-image generator DALL-E 2 is still in beta, but businesses are already testing it for commercial use. Apparel firm Stitch Fix has been using it to visualize fabric and color personalization, while Heinz tapped the AI system for a marketing campaign. Cosmopolitan used it to design a magazine cover. Others have leveraged the image engine to generate logos and thumbnails. These early adopters are identifying technical issues that OpenAI says it is addressing as it readies DALL-E 2 for enterprise. Foremost among the complaints is the lack of a dedicated API for public use. Continue reading Businesses Experiment with DALL-E 2, Report Mixed Results

Nvidia Turbo Charges NeMo Megatron Large Training Model

Nvidia has issued a software update for its formidable NeMo Megatron giant language training model, increasing efficiency and speed. Barely a year since Nvidia unveiled Megatron, this latest improvement further leverages the transformer engine architecture that has become synonymous with deep learning since Google introduced the concept in 2017. New features result in what Nvidia says is a 5x reduction in memory requirements and up to a 30 percent gain in speed for models as large as 1 trillion parameters, making NeMo Megatron better at handling transformer tasks across the entire stack. Continue reading Nvidia Turbo Charges NeMo Megatron Large Training Model

Google’s Imagen AI Model Makes Advances in Text-to-Image

Google has released a research paper on a new text-to-image generator called Imagen, which combines the power of large transformer language models for text with the capabilities of diffusion models in high-fidelity image generation. “Our key discovery is that generic large language models (e.g. T5), pretrained on text-only corpora, are surprisingly effective at encoding text for image synthesis,” the company said. Simultaneously, Google is introducing DrawBench, a benchmark for text-to-image models it says was used to compare Imagen with other recent technologies including VQGAN+CLIP, latent diffusion models, and OpenAI’s DALL-E 2. Continue reading Google’s Imagen AI Model Makes Advances in Text-to-Image

Nvidia Introduces New Architecture to Power AI Data Centers

Nvidia CEO Jensen Huang announced a host of new AI tech geared toward data centers at the GTC 2022 conference this week. Available in Q3, the H100 Tensor Core GPUs are built on the company’s new Hopper GPU architecture. Huang described the H100 as the next “engine of the world’s AI infrastructures.” Hopper debuts in Nvidia DGX H100 systems designed for enterprise. With data centers, “companies are manufacturing intelligence and operating giant AI factories,” Huang said, speaking from a real-time virtual environment in the firm’s Omniverse 3D simulation platform. Continue reading Nvidia Introduces New Architecture to Power AI Data Centers

No-Code AI and Prediction Tools Bring Coding to the People

A new AI revolution is underway, turning people who know little about coding into developers. Called “no code,” startups are emerging to productize this new category, which essentially lets people use familiar, clickable web interfaces and even natural language to automate tasks or create simple applications, while machine learning takes over the rest. Proponents predict it will be a game-changer, powering a brigade of “citizen developers” to leverage artificial intelligence without knowing how to write code. Startups entering the space include Juji, which makes creating AI chatbots as easy as programming PowerPoint. Continue reading No-Code AI and Prediction Tools Bring Coding to the People

Advances by OpenAI and DeepMind Boost AI Language Skills

Advances in language comprehension for artificial intelligence are issuing from San Francisco’s OpenAI and London-based DeepMind. OpenAI, which has been working on large language models, says it now lets customers fine-tune its GPT-3 models using their own custom data, while the Alphabet-owned DeepMind is talking-up Gopher, a 280-billion parameter deep-learning language model that has scored impressively on tests. Sophisticated language models have the ability to comprehend natural language, as well as predict and generate text, requirements for creating advanced AI systems that can dispense information and advice or that are required to follow instructions. Continue reading Advances by OpenAI and DeepMind Boost AI Language Skills

OpenAI Debuts Tool to Translate Natural Language into Code

OpenAI’s Codex, an AI system that translates natural language into code, was released via an API in private beta. Codex, trained on billions of lines of public code, can turn plain English commands into 12+ programming languages and also powers GitHub service Copilot that suggests whole lines of code within Microsoft Visual Studio and other development environments. OpenAI explained that Codex will be offered for free during an “initial period,” and invites “businesses and developers to build on top of it through the API.”

Continue reading OpenAI Debuts Tool to Translate Natural Language into Code

OpenAI and Microsoft Introduce $100 Million AI Startup Fund

OpenAI unveiled a $100 million OpenAI Startup Fund to fund early-stage companies pursuing ways that AI can have a “transformative” impact on healthcare, education, climate change and other fields. OpenAI chief executive Sam Altman said the Fund will make “big, early bets” on no more than 10 such companies. OpenAI, with funding from Microsoft and others, will manage the Fund. Selected projects will get “early access” to future OpenAI systems, support from OpenAI’s team and credits for Microsoft Azure. Continue reading OpenAI and Microsoft Introduce $100 Million AI Startup Fund

OpenAI and EleutherAI Foster Open-Source Text Generators

OpenAI’s GPT-3, the much-noted AI text generator, is now being used in 300+ apps by “tens of thousands” of developers and generating 4.5 billion words per day. Meanwhile, a collective of researchers, EleutherAI is building transformer-based language models with plans to offer an open source, GPT-3-sized model to the public for free. The non-profit OpenAI has an exclusivity deal with Microsoft that gives the tech giant unique access to GPT-3’s underlying code. But OpenAI has made access to its general API available to all comers, who then build services on top of it. Continue reading OpenAI and EleutherAI Foster Open-Source Text Generators

GPT-3: New Applications Developed for OpenAI’s NLP Model

OpenAI’s natural language processing (NLP) model GPT-3 offers 175 billion parameters, compared with its predecessor, GPT-2’s mere 1.5 billion parameters. The result of GPT-3’s immense size has enabled it to generate human-like text based on only a few examples of a task. Now, many users have gained access to the API, and the result has been some interesting use cases and applications. But the ecosystem is still nascent and how it matures — or whether it’s superseded by another NLP model — remains to be seen. Continue reading GPT-3: New Applications Developed for OpenAI’s NLP Model

CES: Sessions Examine the Potential of Quantum Computing

Two CES 2021 panels addressed the current state and anticipated advances in quantum computing, which is already being applied to problems in business, academia and government. However, the hardware is not as stable and robust as people would like, and the algorithms are not yet up to the task to solve the problems that many researchers envision for them. This has not stopped entrepreneurs, major corporations and governments from dedicated significant resources in R&D and implementations, nor from VCs and sovereign funds making major bets on who the winners will be. Continue reading CES: Sessions Examine the Potential of Quantum Computing

OpenAI Unveils AI-Powered DALL-E Text-to-Image Generator

OpenAI unveiled DALL-E, which generates images from text using two multimodel AI systems that leverage computer vision and NLP. The name is a reference to surrealist artist Salvador Dali and Pixar’s animated robot WALL-E. DALL-E relies on a 12-billion parameter version of GPT-3. OpenAI demonstrated that DALL-E can manipulate and rearrange objects in generated imagery and also create images from scratch based on text prompts. It has stated that it plans to “analyze how models like DALL·E relate to societal issues.” Continue reading OpenAI Unveils AI-Powered DALL-E Text-to-Image Generator

Fable Studio Bets on a Future with AI-Powered Virtual Beings

San Francisco-based Fable Studio, a VR studio that won an Emmy Award for its “Wolves in the Walls” project, has debuted its first efforts in creating conversational AI virtual beings. Charlie and Beck, two characters that can converse as if they were real people, are Fable Studio’s bet in the future of such virtual beings for entertainment and even companionship. Its first AI being was Lucy, an 8-year-old girl, who starred in “Wolves in the Walls” and is now a standalone online character after the company debuted her in alpha tests last month. Continue reading Fable Studio Bets on a Future with AI-Powered Virtual Beings

Virtual Event: GPT-3 and Its Implications for the M&E Industry

To fully examine the inner workings and potential impact of deep learning language model GPT-3 on media, ETC’s project on AI & Neuroscience in Media is hosting a virtual event on November 10 from 11:00 am to 12:15 pm. RSVP here to join moderator Yves Bergquist of ETC@USC and presenter Dr. Mark Riedl of Georgia Tech as they present, “Machines That Can Write: A Deep Look at GPT-3 and its Implications for the Industry.” The launch last June of OpenAI’s GPT-3, a language model that uses deep learning to generate human-like text, has raised many questions in the creative community and the world at large.  Continue reading Virtual Event: GPT-3 and Its Implications for the M&E Industry

Microsoft Inks Deal with OpenAI for Exclusive GPT-3 License

Microsoft struck a deal with AI startup OpenAI to be the exclusive licensee of language comprehension model GPT-3. According to Microsoft EVP Kevin Scott, the deal is an “incredible opportunity to expand our Azure-powered AI platform in a way that democratizes AI technology.” Among potential uses are “aiding human creativity and ingenuity in areas like writing and composition, describing and summarizing large blocks of long-form data (including code), converting natural language to another language.” Continue reading Microsoft Inks Deal with OpenAI for Exclusive GPT-3 License