Deepgram’s Speech Portfolio Now Includes Human-Like Aura

Deepgram’s new Aura software turns text into generative audio with a “human-like voice.” The 9-year-old voice recognition company has raised nearly $86 million to date on the strength of its Voice AI platform. Aura is an extremely low-latency text-to-speech voice AI that can be used for voice AI agents, the company says. Paired with Deepgram’s Nova-2 speech-to-text API, developers can use it to “easily (and quickly) exchange real-time information between humans and LLMs to build responsive, high-throughput AI agents and conversational AI applications,” according to Deepgram. Continue reading Deepgram’s Speech Portfolio Now Includes Human-Like Aura

GTC: Nvidia Unveils Blackwell GPU for Trillion-Parameter LLMs

Nvidia unveiled what it is calling the world’s most powerful AI processing system, the Blackwell GPU, purpose built to power real-time generative AI on trillion-parameter large language models at what the company says will be up to 25x less cost and energy consumption than its predecessors. Blackwell’s capabilities will usher in what the company promises will be a new era in generative AI computing. News from Nvidia’s GTC 2024 developer conference included the NIM software platform, purpose built to streamline the setup of custom and pre-trained AI models in a production environment, and the DGX SuperPOD server, powered by Blackwell. Continue reading GTC: Nvidia Unveils Blackwell GPU for Trillion-Parameter LLMs

YouTube Adds GenAI Labeling Requirement for Realistic Video

YouTube has added new rules requiring those uploading realistic-looking videos that are “made with altered or synthetic media, including generative AI” to label them using a new tool in Creator Studio. The new labeling “is meant to strengthen transparency with viewers and build trust between creators and their audience,” YouTube says, listing examples of content that require disclosure as “likeness of a realistic person” including voice as well as image, “altering footage of real events or places” and “generating realistic scenes” of fictional major events, “like a tornado moving toward a real town.” Continue reading YouTube Adds GenAI Labeling Requirement for Realistic Video

Apple Unveils Progress in Multimodal Large Language Models

Apple researchers have gone public with new multimodal methods for training large language models using both text and images. The results are said to enable AI systems that are more powerful and flexible, which could have significant ramifications for future Apple products. These new models, which Apple calls MM1, support up to 30 billion parameters. The researchers identify multimodal large language models (MLLMs) as “the next frontier in foundation models,” which exceed the performance of LLMs and “excel at tasks like image captioning, visual question answering and natural language inference.” Continue reading Apple Unveils Progress in Multimodal Large Language Models

Grok-1 Architecture Open-Sourced for General Release by xAI

Elon Musk’s xAI has released its Grok chatbot and open-sourced part of the underlying Grok-1 model architecture for any developer or entrepreneur to use for purposes including commercial applications. Musk unveiled Grok in November and announced that it would be publicly released this month. The chatbot itself is available to X social premium members, who can ask the cheeky AI questions and get answers with a snarky attitude inspired by “The Hitchhiker’s Guide to the Galaxy” sci-fi novel. The training for Grok’s foundation LLM is said to include X social posts. Continue reading Grok-1 Architecture Open-Sourced for General Release by xAI

AI Startup Perplexity Targets $1B Valuation with New Funding

Perplexity is a year-old AI startup whose conversational “answer engine” has gained attention as a potential challenger to conventional search. Two months ago the venture raised $73.6 million in Series B funding from investors including Nvidia and Amazon founder Jeff Bezos via his Bezos Expeditions, resulting in a valuation of about $520 million. Now the company is said to be finalizing another cash infusion that is predicted to double its valuation to roughly $1 billion. The current financing round is reportedly being led by former Y Combinator partner Daniel Gross through his own investment fund. Continue reading AI Startup Perplexity Targets $1B Valuation with New Funding

Apple, Google, Microsoft, Mozilla Team on Speedometer 3.0

The Apple WebKit team introduced the initial version of the Speedometer benchmark in 2014. Since then, it has become an industry-wide tool for gauging browser optimization and performance, even as some stakeholders complained that having been developed in the Apple ecosystem, it could not help but exhibit systemic biases that favored Safari. So, Microsoft, Google and Mozilla joined Apple to create Speedometer 3.0, “a new governance benchmark” that aims for neutrality across the architectures used by Google Chrome, Microsoft Edge and Mozilla’s Firefox. Continue reading Apple, Google, Microsoft, Mozilla Team on Speedometer 3.0

Reddit Hopes to Raise $748M in IPO Aimed at $6.4B Valuation

Reddit is moving ahead with its IPO and plans to raise between $682 million and $748 million on a fully diluted valuation of between $5.8 billion and $6.4 billion. Although no date has been announced, the IPO is expected to take place sometime this month. According to a Securities and Exchange Commission filing Monday, Reddit says it will offer 22 million 15.3 million Class A common shares and 6.7 million insider shares from investors including CEO Steve Huffman and COO Jen Wong. Pricing will be between $31 and $34 per share. The proposed market cap is $4.9 billion to $5.4 billion. Continue reading Reddit Hopes to Raise $748M in IPO Aimed at $6.4B Valuation

Researchers Call for Safe Harbor for the Evaluation of AI Tools

Artificial intelligence stakeholders are calling for safe harbor legal and technical protections that will allow them access to conduct “good-faith” evaluations of various AI products and services without fear of reprisal. More than 300 researchers, academics, creatives, journalists and legal professionals had as of last week signed an open letter calling on companies including Meta Platforms, OpenAI and Google to allow access for safety testing and red teaming of systems they say are shrouded in opaque rules and secrecy despite the fact that millions of consumers are already using them. Continue reading Researchers Call for Safe Harbor for the Evaluation of AI Tools

Google Introduces Open-Source Marketing Measurement Tool

Google has rolled out an open-source marketing mix model (MMM) called Meridian that aims to help in formulating cross-channel media strategies in the current environment of fragmented media consumption and privacy changes. As marketers contend with Google’s plan to sunset the use of third-party cookies by the end of this year, MMMs — classic tools of yesteryear — “are experiencing a renaissance,” says the search giant. MMMs are statistical analyses companies use to help measure the impact of cross-channel marketing sales. Google says it has “observed more customers turning to MMMs, especially performance and full-funnel marketers.” Continue reading Google Introduces Open-Source Marketing Measurement Tool

YouTube, Comscore Integrate Campaign Ratings with Shorts

Comscore and YouTube have expanded their partnership by integrating Comscore Campaign Ratings (CCR) with YouTube Shorts and In-Feed inventory, making available a range of additional ad data specific to those outlets, across connected TV, mobile and desktop. In the months ahead, the toolkit will also add measurement of Masthead inventory. YouTube has been connected to CCR for standard video inventory and YouTube TV since Q4 2021. YouTube Shorts is a fast-growing part of the Google-owned video ecosystem, averaging over 70 billion daily views, according to YouTube and Comscore. Continue reading YouTube, Comscore Integrate Campaign Ratings with Shorts

Anthropic’s Claude 3 AI Is Said to Have ‘Near-Human’ Abilities

Anthropic has released Claude 3, claiming new industry benchmarks that see the family of three new large language models approaching “near-human” cognitive capability in some instances. Accessible via Anthropic’s website, the three new models — Claude 3 Haiku, Claude 3 Sonnet and Claude 3 Opus — represent successively increased complexity and parameter count. Sonnet is powering the current Claude.ai chatbot and is free, for now, requiring only an email sign-in. Opus comes with the the $20 monthly subscription for Claude Pro. Both are generally available from the Anthropic website and via API in 159 countries, with Haiku coming soon. Continue reading Anthropic’s Claude 3 AI Is Said to Have ‘Near-Human’ Abilities

Apple Fined $1.95 Billion by EU for Music Streaming Antitrust

Apple has been fined $1.95 billion by the European Union after the bloc’s executive body, the European Commission, found the iPhone maker in violation of antitrust law by using its App Store market dominance to stifle music streaming competition. The EC found that Apple suppressed the ability of app developers to communicate with iOS users about alternative music subscription services available outside the App Store. The fine stems from a 2019 complaint from Spotify that triggered an investigation into Apple. Spotify hailed the result as a win for consumers and “an important moment in the fight for a more open Internet,” while Apple has vowed to appeal. Continue reading Apple Fined $1.95 Billion by EU for Music Streaming Antitrust

Lightricks LTX Studio Is a Text-to-Video Filmmaking Platform

Lightricks, the company behind apps including Facetune, Photoleap and Videoleap, has come up with a text-to-video tool called LTX Studio that it is being positioned as a turnkey AI tool for filmmakers and other creators. “From concept to creation,” the new app aims to enable “the transformation of a single idea into a cohesive, AI-generated video.” Currently waitlisted, Lightricks says it will make the web-based tool available to the public for free, at least initially, beginning in April, allowing users to “direct each scene down to specific camera angles with specialized AI.” Continue reading Lightricks LTX Studio Is a Text-to-Video Filmmaking Platform

Adobe’s Prototype AI Tool Is a ‘Photoshop for Music-Making’

Project Music GenAI Control, an experimental work from Adobe Research, is setting out to change how people create and edit custom audio and music. The prototype tool lets creators generate music from text prompts, “and then have fine-grained control to edit that audio for their precise needs,” according to Adobe. Designed to help create music for broadcasts, podcasts or other “audio that’s just the right mood, tone, and length,” it can generate music from text prompts like “powerful rock,” “happy dance” or “sad jazz,” says Adobe Research Senior Research Scientist Nicholas Bryan, a creator of the technology. Continue reading Adobe’s Prototype AI Tool Is a ‘Photoshop for Music-Making’