Llama Archives - ETCentric

Nvidia Positions Its NeMo Microservices for AI Agent-Building

By Paula Parisi
April 28, 2025

Nvidia has released NeMo microservices into general availability with version 25.4, pivoting its profile from a modular toolkit for creating custom generative AI models to emphasizing it as a platform for building AI agents at scale. As AI agents have become an in-demand commodity, Nvidia is leveraging the fact that NeMo’s capabilities seem purpose built to help them grow and thrive. Built around the Kubernetes open-source container management system, NeMo microservices are offered as “an end-to-end developer platform for creating state-of-the-art agentic AI systems,” according to Nvidia. Continue reading Nvidia Positions Its NeMo Microservices for AI Agent-Building

OpenAI Reportedly Has Prototype for Its Own Social Network

By Paula Parisi
April 17, 2025

OpenAI is working to build a social network that will compete against Elon Musk’s X and Meta’s Instagram, reports say. Though still in the early stages, the project is revolving around an internal prototype that is said to involve a social feed that leverages ChatGPT’s image generator. It’s unclear if an OpenAI social app would be standalone or integrated with ChatGPT, but either way it would most likely heighten the competition between rivals Musk and OpenAI CEO Sam Altman, who recently fended off an unsolicited offer by Musk to purchase his company for $97.4 billion. Continue reading OpenAI Reportedly Has Prototype for Its Own Social Network

Meta Unveils Multimodal Llama 4 Models, Previews Behemoth

By Paula Parisi
April 8, 2025

Meta Platforms has released its first Llama 4 models, a multimodal trio that ranges from the foundational Behemoth to tiny Scout, with Maverick in between. With 16 experts and only 17B active parameters (the number used per task), Llama Scout is “more powerful than all previous generation Llama models, while fitting in a single Nvidia H100 GPU,” according to Meta. Maverick, with 17B active parameters and 128 experts, is touted as beating GPT-4o and Gemini 2.0 Flash across various benchmarks, “while achieving comparable results to the new DeepSeek v3 on reasoning and coding with less than half the active parameters.” Continue reading Meta Unveils Multimodal Llama 4 Models, Previews Behemoth

Meta Plans Its Own Standalone AI App to Take On ChatGPT

By Paula Parisi
March 5, 2025

A standalone Meta AI app is in the works for Q2, according to sources familiar with the company’s plans. The move is aligned with Meta Platforms CEO Mark Zuckerberg’s stated intent to propel his company to the forefront of artificial intelligence by year’s end, vaulting ahead of competitors such as OpenAI, Alphabet, Anthropic and xAI. “This is going to be the year when a highly intelligent and personalized AI assistant reaches more than 1 billion people, and I expect Meta AI to be that leading AI assistant,” Zuckerberg said in January during a Q4 earnings call with analysts. Continue reading Meta Plans Its Own Standalone AI App to Take On ChatGPT

Highly Realistic Alibaba GenVid Models Are Available for Free

By Paula Parisi
February 28, 2025

Alibaba has open-sourced its Wan 2.1 video- and image-generating AI models, heating up an already competitive space. The Wan 2.1 family, which has four models, is said to produce “highly realistic” images and videos from text and images. The company has since December been previewing a new reasoning model, QwQ-Max, indicating it will be open-sourced when fully released. The move comes after another Chinese AI company, DeepSeek, released its R1 reasoning model for free download and use, triggering demand for more open-source artificial intelligence. Continue reading Highly Realistic Alibaba GenVid Models Are Available for Free

Meta’s Llama 3.3 Delivers More Processing for Less Compute

By Paula Parisi
December 10, 2024

Meta Platforms has packed more artificial intelligence into a smaller package with Llama 3.3, which the company released last week. The open-source large language model (LLM) “improves core performance at a significantly lower cost, making it even more accessible to the entire open-source community,” Meta VP of Generative AI Ahmad Al-Dahle wrote on X social. The 70 billion parameter text-only Llama 3.3 is said to perform on par with the 405 billion parameter model that was part of Meta’s Llama 3.1 release in July, with less computing power required, significantly lowering its operational costs. Continue reading Meta’s Llama 3.3 Delivers More Processing for Less Compute

Meta’s Investments in Adtech, AI, the Metaverse Yield Results

By Paula Parisi
November 1, 2024

Meta Platforms revenue was up 19 percent to $40.6 billion in Q3 compared to the same period one year earlier. Profit rose to $15.7 billion — a 35 percent increase from 2023. The company believes that its years of investments in adtech, artificial intelligence and the metaverse are starting to pay off. In Q3, Meta reported $23.2 billion in expenses and capital expenditures of $9.2 billion. And the company isn’t taking its foot off the accelerator, having increased its annual spending forecast by $1 billion to a minimum of $38 billion. Additionally, Meta’s advertising revenue for Q3 was just a tick under its high-end spend projection of $40 billion. Continue reading Meta’s Investments in Adtech, AI, the Metaverse Yield Results

‘EU AI Act Checker’ Holds Big AI Accountable for Compliance

By Paula Parisi
October 18, 2024

A new LLM framework evaluates how well generative AI models are meeting the challenge of compliance with the legal parameters of the European Union’s AI Act. The free and open-source software is the product of a collaboration between ETH Zurich; Bulgaria’s Institute for Computer Science, Artificial Intelligence and Technology (INSAIT); and Swiss startup LatticeFlow AI. It is being billed as “the first evaluation framework of the EU AI Act for Generative AI models.” Already, it has found that some of the top AI foundation models are falling short of European regulatory goals in areas including cybersecurity resilience and discriminatory output. Continue reading ‘EU AI Act Checker’ Holds Big AI Accountable for Compliance

Meta Unveils New Open-Source Multimodal Model Llama 3.2

By Paula Parisi
September 27, 2024

Meta’s Llama 3.2 release includes two new multimodal LLMs, one with 11 billion parameters and one with 90 billion — considered small- and medium-sized — and two lightweight, text-only models (1B and 3B) that fit onto edge and mobile devices. Included are pre-trained and instruction-tuned versions. In addition to text, the multimodal models can interpret images, supporting apps that require visual understanding. Meta says the models are free and open source. Alongside them, the company is releasing “the first official Llama Stack distributions,” enabling “turnkey deployment” with integrated safety. Continue reading Meta Unveils New Open-Source Multimodal Model Llama 3.2

New Microsoft Safety Tools Fix AI Flubs, Detect Proprietary IP

By Paula Parisi
September 26, 2024

Microsoft has released a suite of “Trustworthy AI” features that address concerns about AI security and reliability. The four new capabilities include Correction, a content detection upgrade in Microsoft Azure that “helps fix hallucination issues in real time before users see them.” Embedded Content Safety allows customers to embed Azure AI Content Safety on devices where cloud connectivity is intermittent or unavailable, while two new filters flag AI output of protected material. Additionally, a transparency safeguard providing the company’s AI assistant, Microsoft 365 Copilot, with specific “web search query citations” is coming soon. Continue reading New Microsoft Safety Tools Fix AI Flubs, Detect Proprietary IP

Latest Gemma 2 Models Emphasize Security and Performance

By Paula Parisi
August 7, 2024

Google has unveiled three additions to its Gemma 2 family of compact yet powerful open-source AI models, emphasizing safety and transparency. The company’s Gemma 2 2B is a 2.6 billion parameter update to the lightweight 2B parameter Gemma 2, with built-in improvements in safety and performance. Built on Gemma 2, ShieldGemma is a suite of safety content classifier models that “filter the input and outputs of AI models and keep the user safe.” Interoperability model tool Gemma Scope offers what Google calls “unparalleled insight into our models’ inner workings.” Continue reading Latest Gemma 2 Models Emphasize Security and Performance

Meta Calls New Llama the First Open-Source Frontier Model

By Paula Parisi
July 25, 2024

In April, Meta Platforms revealed that it was working on an open-source AI model that performed as well as proprietary models from top AI companies such as OpenAI and Anthropic. Now, Meta CEO Mark Zuckerberg says that model has arrived in the form of Llama 3.1 405B, “the first frontier-level open-source AI model.” The company is also releasing “new and improved” Llama 3.1 70B and 8B models. In addition to general cost and performance benefits, the fact that the Llama 3.1 405B model is open source “will make it the best choice for fine-tuning and distilling smaller models,” according to Meta. Continue reading Meta Calls New Llama the First Open-Source Frontier Model

Tough EU Laws Prompt Meta, Apple to Withhold New Products

By Paula Parisi
July 19, 2024

U.S. tech companies are fighting back against what they feel are overly oppressive European Union regulations by withholding products from that market. Meta Platforms will not release its next Llama multimodal AI model there, along with future products. Apple last month said certain Apple Intelligence AI features will not be released in the EU. Previously, tech companies would accommodate regional laws by adapting global strategies so they could do business everywhere with the same products. Given the restrictions of the Digital Markets Act and other EU rules, Big Tech is signaling that may no longer be possible. Continue reading Tough EU Laws Prompt Meta, Apple to Withhold New Products

IBM Introduces Granite LLMs for Enterprise Code Developers

By Paula Parisi
May 15, 2024

IBM has released a family of its Granite AI models to the open-source community. The series of decoder-only Granite code models are purpose-built to write computer code for enterprise developers, with training in 116 programming languages. These Granite models range in size from 3 to 34 billion parameters in base model and instruction-tuned variants. They offer a range of uses, from modernizing older code with new languages to optimizing programs for on-device memory constraints, such as might be experienced when conforming for mobile gadgets. In addition to generation, the models can repair and explain code. Continue reading IBM Introduces Granite LLMs for Enterprise Code Developers

Opera Browser Is Experimenting with Local Support for LLMs

By ETCentric Staff
April 9, 2024

Opera has become the first browser to add support for large language models (LLMs). At this point the feature is experimental, and available only on the Opera One Developer browser as part of the AI Feature Drops program. The update offers about 150 LLMs from more than 50 different families, including Meta’s LLaMA, Google’s Gemma, Mixtral and Vicuna. Opera had previously only offered local support for its own Aria AI, a competitor to Microsoft Copilot and OpenAI’s ChatGPT. The local LLMs are being offered for testing as a complimentary addition to Opera’s online Aria service. Continue reading Opera Browser Is Experimenting with Local Support for LLMs