GPT-3 Archives - ETCentric

France’s Mistral AI Makes Its Global Debut on Microsoft Azure

By ETCentric Staff
March 5, 2024

Paris-based startup Mistral AI has made an immediate splash in the world of artificial intelligence, securing partnerships with IBM, Microsoft and others nine months after its launch. The company is offering natural language processing models, including its flagship Mistral Large, which becomes only the second LLM (after OpenAI) to land a commercial berth on Microsoft’s Azure cloud, where Meta Platforms’ Llama 2 is available in preview. Boasting “top-tier reasoning capacities” and sophisticated conversational capabilities, Mistral Large specializes in “reasoning, analysis and generation (RAG), is multilingual and supports up to 32,000 tokens.” Continue reading France’s Mistral AI Makes Its Global Debut on Microsoft Azure

Senators Question Meta Platforms About Recent LLaMA Leak

By Paula Parisi
June 8, 2023

Meta Platforms CEO Mark Zuckerberg received a letter this week from Senators Richard Blumenthal and Josh Hawley of the Subcommittee on Privacy, Technology & the Law that took the executive to task for an online leak of the company’s LLaMA artificial intelligence system. The 65-billion parameter language model, which is still under development, was open-sourced in February. Available on request through Meta’s GitHub portal, it wound up on 4chan and BitTorrent “making it available to anyone, anywhere in the world, without monitoring or oversight,” the senators wrote. Continue reading Senators Question Meta Platforms About Recent LLaMA Leak

Microsoft Study: GPT-4 Nearing Artificial General Intelligence

By Paula Parisi
May 18, 2023

A March research paper by Microsoft has reopened discussion as to whether artificial intelligence is inching toward human reasoning, as the industry grapples with how an AI system can assimilate training data in a way that allows it to generate answers and promulgate ideas that weren’t programmed into it. Asked for a stable way to stack a book, nine eggs, a laptop, a bottle and a nail, the Microsoft AI generated a response researchers say hinted at artificial general intelligence, or AGI, a term used to connote an as yet theoretical type of machine learning that can duplicate human reasoning. Continue reading Microsoft Study: GPT-4 Nearing Artificial General Intelligence

Stability AI Debuts Open Source StableLM Foundation Model

By Paula Parisi
April 26, 2023

Stability AI has released StableLM, an open source language model that will compete with OpenAI’s GPT-4 to create apps like ChatGPT. The Alpha version of StableLM is available in 3 billion and 7 billion parameters, and the company promises 15 billion to 65 billion parameter models to come. “With the launch of the StableLM suite of models, Stability AI is continuing to make foundational AI technology accessible to all,” the London-based company said. The StableLM models can generate text and code to power various downstream applications with appropriate training. Continue reading Stability AI Debuts Open Source StableLM Foundation Model

OpenAI Targets Affordable AI with ChatGPT and Whisper APIs

By Paula Parisi
March 6, 2023

OpenAI is now allowing third-party developers integrate ChatGPT into their apps, a solution the company says will be a more cost-effective alternative. The language model can be used for more than chat, says OpenAI, which also has a new speech-to-text model called Whisper. The company is also touting gpt-3.5-turbo, calling it the “best model for many non-chat use cases.” With a major investment from Microsoft, and the eyes of the industry on it, OpenAI seems to be feeling some pressure to add earnings to the success it has as a thought leader. Continue reading OpenAI Targets Affordable AI with ChatGPT and Whisper APIs

Meta Toolformer Sidesteps AI Language Limits with API Calls

By Paula Parisi
February 21, 2023

With language models like ChatGPT dominating recent tech news, Meta Platforms has unveiled a new artificial intelligence platform of its own called Toolformer that breaks new ground in that it can teach itself to use external apps and APIs. The result, Meta says, is that Toolformer combines the conversational aptitude and other things large language models are good at while shoring up those areas in which it typically does not excel — like math and fact-checking — by figuring out how to use external tools like search engines, calculators and calendars. Continue reading Meta Toolformer Sidesteps AI Language Limits with API Calls

Business World Asks if Generative AI is Ready for Enterprise

By Paula Parisi
February 17, 2023

IT pros are grappling with the ways ChatGPT can be worked into the enterprise stack. The generative artificial intelligence from OpenAI has demonstrated the ability to compile reports, craft marketing pitches and write software code, which makes it seem convenient for business use. Yet concerns remain, including potential security risks and sometimes erratic or inappropriate data feedback. In the past week, one third-party tester had ChatGPT pledge love for its interlocutor, while another received a detailed lecture on why cow eggs are bigger than chicken eggs. Continue reading Business World Asks if Generative AI is Ready for Enterprise

OpenAI CTO Calls for Regulation as AI Tech Rapidly Expands

By Paula Parisi
February 9, 2023

Less than a week after UBS proclaimed ChatGPT a record-setter for the app with the fastest-growing user base, the popular AI chatbot has racked up accomplishments that include passing “a U.S. medical-licensing exam, a Wharton Business School MBA exam, and four major university law-school exams,” according to TIME, which couches it in the context of “a brilliant child.” Amidst the fusillade of publicity, OpenAI CTO Mira Murati, who led the teams behind both DALL-E and ChatGPT, says it’s “time to move toward regulating AI,” which “can be misused, or used by bad actors,” raising questions about global governance. Continue reading OpenAI CTO Calls for Regulation as AI Tech Rapidly Expands

CES: Generative AI Is Having Its ‘War of the Worlds’ Moment

By Yves Bergquist
January 13, 2023

ChatGPT came too late (end of November) to make a significant impact on CES this year, but the cacophony of opinions about the generative AI model definitely made its way to Vegas. The timing was perfect. Just as the crypto crash left the hype industry paralyzed, OpenAI launched ChatGPT in what now feels like a nerdy and frustrating tech version of the Rolling Stones’ Altamont concert in ’69 (with computer scientists as the Hells Angels). Make no mistake: this is a landmark achievement in machine learning — perhaps the single greatest since the 2006 paper by Hinton, Salakhutdinov, Osindero and Teh on backpropagation in deep neural networks. However, it’s critical that industries, including M&E, distinguish between hype and reality. Continue reading CES: Generative AI Is Having Its ‘War of the Worlds’ Moment

Businesses Experiment with DALL-E 2, Report Mixed Results

By Paula Parisi
August 12, 2022

OpenAI’s powerful text-to-image generator DALL-E 2 is still in beta, but businesses are already testing it for commercial use. Apparel firm Stitch Fix has been using it to visualize fabric and color personalization, while Heinz tapped the AI system for a marketing campaign. Cosmopolitan used it to design a magazine cover. Others have leveraged the image engine to generate logos and thumbnails. These early adopters are identifying technical issues that OpenAI says it is addressing as it readies DALL-E 2 for enterprise. Foremost among the complaints is the lack of a dedicated API for public use. Continue reading Businesses Experiment with DALL-E 2, Report Mixed Results

Nvidia Turbo Charges NeMo Megatron Large Training Model

By Paula Parisi
August 2, 2022

Nvidia has issued a software update for its formidable NeMo Megatron giant language training model, increasing efficiency and speed. Barely a year since Nvidia unveiled Megatron, this latest improvement further leverages the transformer engine architecture that has become synonymous with deep learning since Google introduced the concept in 2017. New features result in what Nvidia says is a 5x reduction in memory requirements and up to a 30 percent gain in speed for models as large as 1 trillion parameters, making NeMo Megatron better at handling transformer tasks across the entire stack. Continue reading Nvidia Turbo Charges NeMo Megatron Large Training Model

Google’s Imagen AI Model Makes Advances in Text-to-Image

By Paula Parisi
May 25, 2022

Google has released a research paper on a new text-to-image generator called Imagen, which combines the power of large transformer language models for text with the capabilities of diffusion models in high-fidelity image generation. “Our key discovery is that generic large language models (e.g. T5), pretrained on text-only corpora, are surprisingly effective at encoding text for image synthesis,” the company said. Simultaneously, Google is introducing DrawBench, a benchmark for text-to-image models it says was used to compare Imagen with other recent technologies including VQGAN+CLIP, latent diffusion models, and OpenAI’s DALL-E 2. Continue reading Google’s Imagen AI Model Makes Advances in Text-to-Image

Nvidia Introduces New Architecture to Power AI Data Centers

By Paula Parisi
March 24, 2022

Nvidia CEO Jensen Huang announced a host of new AI tech geared toward data centers at the GTC 2022 conference this week. Available in Q3, the H100 Tensor Core GPUs are built on the company’s new Hopper GPU architecture. Huang described the H100 as the next “engine of the world’s AI infrastructures.” Hopper debuts in Nvidia DGX H100 systems designed for enterprise. With data centers, “companies are manufacturing intelligence and operating giant AI factories,” Huang said, speaking from a real-time virtual environment in the firm’s Omniverse 3D simulation platform. Continue reading Nvidia Introduces New Architecture to Power AI Data Centers

No-Code AI and Prediction Tools Bring Coding to the People

By Paula Parisi
March 22, 2022

A new AI revolution is underway, turning people who know little about coding into developers. Called “no code,” startups are emerging to productize this new category, which essentially lets people use familiar, clickable web interfaces and even natural language to automate tasks or create simple applications, while machine learning takes over the rest. Proponents predict it will be a game-changer, powering a brigade of “citizen developers” to leverage artificial intelligence without knowing how to write code. Startups entering the space include Juji, which makes creating AI chatbots as easy as programming PowerPoint. Continue reading No-Code AI and Prediction Tools Bring Coding to the People

Advances by OpenAI and DeepMind Boost AI Language Skills

By Paula Parisi
December 17, 2021

Advances in language comprehension for artificial intelligence are issuing from San Francisco’s OpenAI and London-based DeepMind. OpenAI, which has been working on large language models, says it now lets customers fine-tune its GPT-3 models using their own custom data, while the Alphabet-owned DeepMind is talking-up Gopher, a 280-billion parameter deep-learning language model that has scored impressively on tests. Sophisticated language models have the ability to comprehend natural language, as well as predict and generate text, requirements for creating advanced AI systems that can dispense information and advice or that are required to follow instructions. Continue reading Advances by OpenAI and DeepMind Boost AI Language Skills