By
Paula ParisiOctober 3, 2024
OpenAI unveiled major updates at its DevDay conference with the focus largely on making AI more accessible, efficient and affordable. Included were four innovations: Vision Fine-Tuning in the API, Model Distillation, Prompt Caching and the public beta of Realtime API. The approach underscores OpenAI’s effort to empower its developer ecosystem even as it continues to compete for end-users in the enterprise space. The Realtime API gives developers the option of building “nearly real-time” speech-to-speech app experiences, selecting from among six OpenAI voices. Vision Fine-Tuning for GPT-4o enables customization of the model’s visual understanding of images and text. Continue reading OpenAI Showcases Latest Updates for Voice, Picture and More
By
Paula ParisiOctober 2, 2024
AI startup Liquid, founded by alums of MIT’s Computer Science and Artificial Intelligence Laboratory (CSAIL), has released its first models. Called Liquid Foundation Models, or LFMs, the multimodal family approaches “intelligence” differently than the pre-trained transformer models that dominate the field. Instead, the LFMs take a path of “first principles,” which MIT describes as “the same way engineers build engines, cars, and airplanes,” explaining that the models are large neural networks with computational units “steeped in theories of dynamic systems, signal processing and numeric linear algebra.” Continue reading MIT Spinoff Liquid Eschews GPTs for Its Fluid Approach to AI
By
Paula ParisiOctober 1, 2024
The Allen Institute for AI (also known as Ai2, founded by Paul Allen and led by Ali Farhadi) has launched Molmo, a family of four open-source multimodal models. While advanced models “can perceive the world and communicate with us, Molmo goes beyond that to enable one to act in their worlds, unlocking a whole new generation of capabilities, everything from sophisticated web agents to robotics,” according to Ai2. On some third-party benchmark tests, Molmo’s 72 billion parameter model outperforms other open AI offerings and “performs favorably” against proprietary rivals like OpenAI’s GPT-4o, Google’s Gemini 1.5 and Anthropic’s Claude 3.5 Sonnet, Ai2 says. Continue reading Allen Institute Announces Vision-Optimized Molmo AI Models
By
Paula ParisiSeptember 30, 2024
Visual effects studio Digital Domain has brought its Autonomous Virtual Human project to Amazon Web Services, which will provide generative AI and machine learning tools and provide Digital Domain’s creations and processes a home in the global cloud. The collaboration “aims to propel the evolution and global reach of Digital Domain’s AVH technology and expand its use for multiple industries, including entertainment, gaming, healthcare, hospitality, and commercial applications,” Amazon said in a statement that emphasizes “AWS cloud services, particularly Amazon Bedrock,” as providing the infrastructure and adaptability “to drive AVH’s growth.” Continue reading Digital Domain Leverages AWS for Its Virtual Human Initiative
By
Paula ParisiSeptember 30, 2024
The Tor Project has merged operation with Tails, a Linux-based portable operating system that uses Tor to protect users from digital surveillance. Tor, a global non-profit that develops tools for online privacy and anonymity, will incorporate Tails into its structure for simpler collaboration, “better sustainability, reduced overhead, and expanded training and outreach programs to counter a larger number of digital threats,” according to the Tor Project. The move comes as regulatory forces heighten efforts to break end-to-end encryption. Tor emphasizes the alliance will “strengthen both organizations’ ability to protect people worldwide from surveillance and censorship.” Continue reading Privacy-Focused Tor Platform Absorbs Linux-Based Tails OS
By
Paula ParisiSeptember 27, 2024
Meta’s Llama 3.2 release includes two new multimodal LLMs, one with 11 billion parameters and one with 90 billion — considered small- and medium-sized — and two lightweight, text-only models (1B and 3B) that fit onto edge and mobile devices. Included are pre-trained and instruction-tuned versions. In addition to text, the multimodal models can interpret images, supporting apps that require visual understanding. Meta says the models are free and open source. Alongside them, the company is releasing “the first official Llama Stack distributions,” enabling “turnkey deployment” with integrated safety. Continue reading Meta Unveils New Open-Source Multimodal Model Llama 3.2
By
Paula ParisiSeptember 26, 2024
Microsoft has released a suite of “Trustworthy AI” features that address concerns about AI security and reliability. The four new capabilities include Correction, a content detection upgrade in Microsoft Azure that “helps fix hallucination issues in real time before users see them.” Embedded Content Safety allows customers to embed Azure AI Content Safety on devices where cloud connectivity is intermittent or unavailable, while two new filters flag AI output of protected material. Additionally, a transparency safeguard providing the company’s AI assistant, Microsoft 365 Copilot, with specific “web search query citations” is coming soon. Continue reading New Microsoft Safety Tools Fix AI Flubs, Detect Proprietary IP
By
Paula ParisiSeptember 25, 2024
Cloudflare has released AI Audit, a free set of new tools designed to help websites analyze and control how their content is used by artificial intelligence models. Described as “one-click blocking” to prevent unauthorized AI scraping, Cloudflare says it will also make it easier to identify the content bots scan most, so they can wall it off and negotiate payment in exchange for access. Helping its clients toward a sustainable future, Cloudflare is also creating a marketplace for sites to negotiate fees based on AI audits that trace cyber footprints on server files. Continue reading Cloudflare Tool Can Prevent AI Bots from Scraping Websites
By
Paula ParisiSeptember 25, 2024
Google announced that the latest update to Password Manager now enables users to sync their passkeys across multiple devices. Previously, Google passkeys could only be easily saved to Password Manager on Android, limiting cross-device utility. Scanning a QR code on an Android device was previously required to use passkeys on non-native platforms. The update makes it possible to use Google Password Manager on desktop systems that run Windows, macOS and Linux. ChromeOS is currently being beta tested and Google says iOS support is “coming soon.” Continue reading Google Debuts Secure Passkey Sync Feature Across Devices
By
Paula ParisiSeptember 24, 2024
Amazon has joined the ranks of firms offering generative video tools, although its release is aimed only at advertisers, at least for now. Simply called Video Generator, it can turn a product image into a video that showcases the product and even demonstrates its features, “leveraging Amazon’s unique insights to vividly bring a product story to life.” At the company’s Accelerate 2024 conference Amazon also debuted Live Image, which lets brands create animated GIFs from stills, a customizable chatbot assistant for third-party sellers, and a new AI-powered recommendation engine based on customer interests. Continue reading Amazon’s Video Generator Turns Stills into Advertising Clips
By
Paula ParisiSeptember 20, 2024
YouTube is going all in on generative AI with nine new generative features announced at the Made on YouTube creator event in New York. Google DeepMind’s AI video generation model, Veo, is coming to YouTube Shorts later this year, enabling “even more incredible video backgrounds, breathing life into concepts that were once impossible to visualize,” as well as six-second standalone AI segments that can be incorporated into short videos. “Imagine a BookTuber stepping into the pages of the classic novel ‘The Secret Garden,’” suggests YouTube Chief Product Officer Johanna Voolich in describing the new AI-powered features. Continue reading YouTube Unveils New AI-Powered Features at Creator Event
By
Paula ParisiSeptember 19, 2024
GoPro has announced two new cameras, the $399 Hero13 Black with swappable lenses, and its smallest 4K camera ever, the $199 Hero. The high-end Hero13 Black boasts better battery performance and four interchangeable Hero Black-series lens modules with automatic adjustments for settings. A 13x Burst Slo-Mo feature captures up to 400 frames per second at 720p, with options for 5.3K at 120 frames per second or 900p at 360 fps. Improved Wi-Fi 6 uploads at up to 40 percent faster transfer speeds and enhanced audio and voice settings are among the upgrades. Continue reading GoPro’s Hero13 Black Earns Adds New Lens Mount and HLG HDR
By
Paula ParisiSeptember 19, 2024
Snap is rolling out its fifth generation of Spectacles — standalone AR glasses that enable use of Lenses to “experience the world together with friends.” The firm is also launching a Spectacles Developer Program, and at a rental fee of $99 per month, that’s who the devices are aimed at, for now. Spectacles are powered by Snap OS, optimized to leverage people’s natural responses to interacting with their environment. They work seamlessly with mobile devices, turning smartphones into custom game controllers with Lenses. There’s even a Spectator Mode, “so friends without Spectacles can follow along, mirror your phone screen, and more.” Continue reading Snap Targets Developers with $99 per Month AR Spectacles
By
Paula ParisiSeptember 17, 2024
Dolby Labs has introduced cloud-based solutions to support clients with real-time, interactive streaming capabilities. The announcement, made from IBC 2024 in Amsterdam, follows Dolby’s July acquisition of streaming tools provider THEO Technologies, which services top sports, media and entertainment companies worldwide. Dolby and THEO promise streaming that is “more interactive, personalized, and delivered with extremely low latency.” Dolby will also offer a new capability, THEOads, providing an advertising environment “that is optimized for the dynamic nature of live content.” Continue reading Dolby to Expand Its Cloud-Based Live Streaming with THEO
By
Paula ParisiSeptember 17, 2024
Blackmagic Design announced that its new URSA Cine 17K 65 camera is available for orders from resellers worldwide, starting at $29,995. The cinema camera, which includes a massive 65mm RGBW 17,520 x 8,040 sensor with larger photo-sites for 16 stops of dynamic range, was previewed in April at NAB. Its features include interchangeable PL, LPL and Hasselblad lens mounts and industry standard Lemo and Fischer connections. The base model comes with 8TB of internal storage and also has high-speed networking built-in for media uploads and syncing to Blackmagic Cloud. Continue reading Blackmagic URSA Cine 17K Camera Priced Starting at $30K