Model Training Archives - Page 3 of 10

CES: Nvidia’s Cosmos Models Teach AI About Physical World

By Paula Parisi
January 14, 2025

Nvidia Cosmos, a platform of generative world foundation models (WFMs) and related tools to advance the development of physical AI systems like autonomous vehicles and robots, was introduced at CES 2025. Cosmos WFMs are designed to provide developers a way to generate massive amounts of photo-real, physics-based synthetic data to train and evaluate their existing models. The goal is to reduce costs by streamlining real-world testing with a ready data pipeline. Developers can also build custom models by fine-tuning Cosmos WFMs. Cosmos integrates Nvidia Omniverse, a physics simulation tool used for entertainment world-building. Continue reading CES: Nvidia’s Cosmos Models Teach AI About Physical World

Asteria and Moonvalley Team Up on Ethical AI for Hollywood

By Paula Parisi
January 6, 2025

Artist-led generative AI film and animation studio Asteria has joined with AI research startup Moonvalley to create Marey, “the entertainment industry’s first clean foundational AI model.” Trained exclusively on “ethically sourced data” owned by Asteria, Marey is named after the French physiologist who helped put the “motion” in motion pictures, Étienne-Jules Marey. Asteria says Marey is being trained for a 2025 debut targeting major Hollywood productions with a pitch that says it has not and will not rely on scraped data, something Big Tech competitors have taken heat on. Continue reading Asteria and Moonvalley Team Up on Ethical AI for Hollywood

Meta’s Llama 3.3 Delivers More Processing for Less Compute

By Paula Parisi
December 10, 2024

Meta Platforms has packed more artificial intelligence into a smaller package with Llama 3.3, which the company released last week. The open-source large language model (LLM) “improves core performance at a significantly lower cost, making it even more accessible to the entire open-source community,” Meta VP of Generative AI Ahmad Al-Dahle wrote on X social. The 70 billion parameter text-only Llama 3.3 is said to perform on par with the 405 billion parameter model that was part of Meta’s Llama 3.1 release in July, with less computing power required, significantly lowering its operational costs. Continue reading Meta’s Llama 3.3 Delivers More Processing for Less Compute

AI Boom Boosts Nvidia Sales by 94 Percent as Profits Double

By Paula Parisi
November 22, 2024

Nvidia sales were up 94 percent to $35 billion in the most recent quarter when profits more than doubled, to $19.3 billion, telegraphing the strength of the artificial intelligence boom that took the company from the top supplier of graphics boards for gaming PCs to the world’s most valuable public company with a market cap of $3.59 trillion. Nvidia founder and CEO Jensen Huang told analysts that demand for the company’s latest AI chip, Blackwell, has been “incredible,” driving projections of $3.59 trillion in revenue for the current quarter as customers begin to take shipments. Continue reading AI Boom Boosts Nvidia Sales by 94 Percent as Profits Double

Meta, Reuters Sign Multi-Year AI Content Licensing Agreement

By Paula Parisi
October 29, 2024

Marking its first news deal in years, Meta Platforms entered into an agreement with Reuters to use its content to answer user questions posed to its Meta AI chatbot. The arrangement comes as Meta has been minimizing news content on its services in response to publisher demands for revenue sharing and regulatory criticism over misinformation. Terms of the partnership were not disclosed, nor were details provided as to whether Meta plans to use Reuters content for model training. Meta AI is available across its Facebook, Whatsapp, Instagram and Messenger services. Continue reading Meta, Reuters Sign Multi-Year AI Content Licensing Agreement

Penguin Random House Warns All Against AI Model Training

By Paula Parisi
October 22, 2024

Penguin Random House, the world’s largest commercial book publisher, has updated the copyright disclaimer that appears in every book to say “no part of this book may be used or reproduced in any manner for the purpose of training artificial intelligence technologies or systems.” The warning will roll out globally on all new releases as well as backlist titles that are reprinted. Tom Weldon, CEO of Penguin Random House UK, has told staff the company will at its discretion “use generative AI tools selectively and responsibly, where we see a clear case that they can advance our goals.” Continue reading Penguin Random House Warns All Against AI Model Training

Nvidia’s Impressive AI Model Could Compete with Top Brands

By Paula Parisi
October 21, 2024

Nvidia has debuted a new AI model, Llama-3.1-Nemotron-70B-Instruct, that it claims is outperforming competitors GPT-4o from OpenAI and Anthropic’s Claude 3.5 Sonnet. The impressive showing has prompted speculation of an AI shakeup and a significant shift in Nividia’s AI strategy, which has thus far been focused primarily on chipmaking. The model was quietly released on Hugging Face, and Nvidia says as of October 1 it ranked first on three top automatic alignment benchmarks, “edging out strong frontier models” and vaulting Nvidia to the forefront of the LLM field in areas like comprehension, context and generation. Continue reading Nvidia’s Impressive AI Model Could Compete with Top Brands

Pyramid Flow Introduces a New Approach to Generative Video

By Paula Parisi
October 14, 2024

Generative video models seem to be debuting daily. Pyramid Flow, among the latest, aims for realism, producing dynamic video sequences that have temporal consistency and rich detail while being open source and free. The model can create clips of up to 10 seconds using both text and image prompts. It offers a cinematic look, supporting 1280×768 pixel resolution clips at 24 fps. Developed by a consortium of researchers from Peking University, Beijing University and Kuaishou Technology, Pyramid Flow harnesses a new technique that starts with low-resolution video, outputting at full-res only at the end of the process. Continue reading Pyramid Flow Introduces a New Approach to Generative Video

Nvidia Releases Open-Source Frontier-Class Multimodal LLMs

By Paula Parisi
October 4, 2024

Nvidia has unveiled the NVLM 1.0 family of multimodal LLMs, a powerful open-source AI that the company says performs comparably to proprietary systems from OpenAI and Google. Led by NVLM-D-72B, with 72 billion parameters, Nvidia’s new entry in the AI race achieved what the company describes as “state-of-the-art results on vision-language tasks, rivaling the leading proprietary models (e.g., GPT-4o) and open-access models.” Nvidia has made the model weights publicly available and says it will also be releasing the training code, a break from the closed approach of OpenAI, Anthropic and Google. Continue reading Nvidia Releases Open-Source Frontier-Class Multimodal LLMs

MIT Spinoff Liquid Eschews GPTs for Its Fluid Approach to AI

By Paula Parisi
October 2, 2024

AI startup Liquid, founded by alums of MIT’s Computer Science and Artificial Intelligence Laboratory (CSAIL), has released its first models. Called Liquid Foundation Models, or LFMs, the multimodal family approaches “intelligence” differently than the pre-trained transformer models that dominate the field. Instead, the LFMs take a path of “first principles,” which MIT describes as “the same way engineers build engines, cars, and airplanes,” explaining that the models are large neural networks with computational units “steeped in theories of dynamic systems, signal processing and numeric linear algebra.” Continue reading MIT Spinoff Liquid Eschews GPTs for Its Fluid Approach to AI

Allen Institute Announces Vision-Optimized Molmo AI Models

By Paula Parisi
October 1, 2024

The Allen Institute for AI (also known as Ai2, founded by Paul Allen and led by Ali Farhadi) has launched Molmo, a family of four open-source multimodal models. While advanced models “can perceive the world and communicate with us, Molmo goes beyond that to enable one to act in their worlds, unlocking a whole new generation of capabilities, everything from sophisticated web agents to robotics,” according to Ai2. On some third-party benchmark tests, Molmo’s 72 billion parameter model outperforms other open AI offerings and “performs favorably” against proprietary rivals like OpenAI’s GPT-4o, Google’s Gemini 1.5 and Anthropic’s Claude 3.5 Sonnet, Ai2 says. Continue reading Allen Institute Announces Vision-Optimized Molmo AI Models

Runway Launches $5M AI Film Fund as Open Call to Creators

By Paula Parisi
September 30, 2024

Artificial intelligence platform Runway has launched The Hundred Film Fund to help finance 100 projects that use its AI to tell stories. Created by the company through its Runway Studios, the Fund is starting with $5 million, “with the potential to grow to $10 million.” Runway is presenting the Fund as “an open call to all creative professionals who have AI-augmented film projects in the pre- or post-production phases and are in need of funding.” Directors, producers and screenwriters are among those invited to apply. The program will consider all formats, from features to shorts, documentaries, experimental projects, music videos and more. Continue reading Runway Launches $5M AI Film Fund as Open Call to Creators

Cloudflare Tool Can Prevent AI Bots from Scraping Websites

By Paula Parisi
September 25, 2024

Cloudflare has released AI Audit, a free set of new tools designed to help websites analyze and control how their content is used by artificial intelligence models. Described as “one-click blocking” to prevent unauthorized AI scraping, Cloudflare says it will also make it easier to identify the content bots scan most, so they can wall it off and negotiate payment in exchange for access. Helping its clients toward a sustainable future, Cloudflare is also creating a marketplace for sites to negotiate fees based on AI audits that trace cyber footprints on server files. Continue reading Cloudflare Tool Can Prevent AI Bots from Scraping Websites

AWS Transfers OpenSearch Stewardship to Linux Foundation

By Paula Parisi
September 18, 2024

Amazon is transferring its OpenSearch platform to the Linux Foundation’s new OpenSearch Software Foundation. By handing a third-party the open-source project it has developed internally since 2021, Amazon hopes to accelerate collaboration in data-driven search and analytics, an area of focus due to the proliferation of model training. Not to be confused with commercial search (Google, Bing), engines like OpenSearch are geared toward enterprise and academia. Because it is licensed under Apache 2.0, OpenSearch is a viable starting point for organizations that customize internal platforms for searching, monitoring and analyzing large volumes of data. Continue reading AWS Transfers OpenSearch Stewardship to Linux Foundation

OpenAI Previews New LLMs Capable of Complex Reasoning

By Paula Parisi
September 16, 2024

OpenAI is previewing a new series of AI models that can reason and correct complex coding mistakes, providing a more efficient solution for developers. Powered by OpenAI o1, the new models are “designed to spend more time thinking before they respond, much like a person would,” and as a result can “solve harder problems than previous models in science, coding, and math,” OpenAI claims, noting that “through training, they learn to refine their thinking process, try different strategies, and recognize their mistakes.” The first model in the series is being released in preview in OpenAI’s popular ChatGPT and in the company’s API. Continue reading OpenAI Previews New LLMs Capable of Complex Reasoning