Deepgram’s Speech Portfolio Now Includes Human-Like Aura

Deepgram’s new Aura software turns text into generative audio with a “human-like voice.” The 9-year-old voice recognition company has raised nearly $86 million to date on the strength of its Voice AI platform. Aura is an extremely low-latency text-to-speech voice AI that can be used for voice AI agents, the company says. Paired with Deepgram’s Nova-2 speech-to-text API, developers can use it to “easily (and quickly) exchange real-time information between humans and LLMs to build responsive, high-throughput AI agents and conversational AI applications,” according to Deepgram. Continue reading Deepgram’s Speech Portfolio Now Includes Human-Like Aura

Gannett, McClatchy Cancel Associated Press News Contracts

In news rocking the publishing world, two of the largest newspaper chains in the U.S. have drastically downsized their contracts with the Associated Press, eliminating AP journalism from their combined 230 news outlets, including Gannett’s USA Today and McClatchy’s The Miami Herald. Though neither chain disclosed how much the move will save, the AP assesses “it is likely to be in the millions of dollars” for each. Gannett announced it has chosen another newswire partner, Reuters, and says it will continue to subscribe to the AP Stylebook and election results data. AP says its Gannett contract runs through the end of 2024. Continue reading Gannett, McClatchy Cancel Associated Press News Contracts

GTC: Nvidia Unveils Blackwell GPU for Trillion-Parameter LLMs

Nvidia unveiled what it is calling the world’s most powerful AI processing system, the Blackwell GPU, purpose built to power real-time generative AI on trillion-parameter large language models at what the company says will be up to 25x less cost and energy consumption than its predecessors. Blackwell’s capabilities will usher in what the company promises will be a new era in generative AI computing. News from Nvidia’s GTC 2024 developer conference included the NIM software platform, purpose built to streamline the setup of custom and pre-trained AI models in a production environment, and the DGX SuperPOD server, powered by Blackwell. Continue reading GTC: Nvidia Unveils Blackwell GPU for Trillion-Parameter LLMs

YouTube Adds GenAI Labeling Requirement for Realistic Video

YouTube has added new rules requiring those uploading realistic-looking videos that are “made with altered or synthetic media, including generative AI” to label them using a new tool in Creator Studio. The new labeling “is meant to strengthen transparency with viewers and build trust between creators and their audience,” YouTube says, listing examples of content that require disclosure as “likeness of a realistic person” including voice as well as image, “altering footage of real events or places” and “generating realistic scenes” of fictional major events, “like a tornado moving toward a real town.” Continue reading YouTube Adds GenAI Labeling Requirement for Realistic Video

Apple Unveils Progress in Multimodal Large Language Models

Apple researchers have gone public with new multimodal methods for training large language models using both text and images. The results are said to enable AI systems that are more powerful and flexible, which could have significant ramifications for future Apple products. These new models, which Apple calls MM1, support up to 30 billion parameters. The researchers identify multimodal large language models (MLLMs) as “the next frontier in foundation models,” which exceed the performance of LLMs and “excel at tasks like image captioning, visual question answering and natural language inference.” Continue reading Apple Unveils Progress in Multimodal Large Language Models

Grok-1 Architecture Open-Sourced for General Release by xAI

Elon Musk’s xAI has released its Grok chatbot and open-sourced part of the underlying Grok-1 model architecture for any developer or entrepreneur to use for purposes including commercial applications. Musk unveiled Grok in November and announced that it would be publicly released this month. The chatbot itself is available to X social premium members, who can ask the cheeky AI questions and get answers with a snarky attitude inspired by “The Hitchhiker’s Guide to the Galaxy” sci-fi novel. The training for Grok’s foundation LLM is said to include X social posts. Continue reading Grok-1 Architecture Open-Sourced for General Release by xAI

Figure Unveils Humanoid Robot, Draws Notable Investments

Robotics firm Figure AI is getting a lot of attention for its humanoid robot, Figure 01, which the company unveiled along with news that it has raised $675 million, for a $2.6 billion valuation, from investors including OpenAI, Nvidia, Microsoft and Amazon founder Jeff Bezos. Pronounced “Figure One,” the general purpose robot looks and moves like a human, and can perform mundane tasks like serving food as well as undesirable jobs like picking up trash. It “sees” using “onboard cameras that feed into a large vision-language model (VLM) trained by OpenAI,” according to Figure co-founder and CEO Brett Adcock. Continue reading Figure Unveils Humanoid Robot, Draws Notable Investments

EU Lawmakers Pass AI Act, World’s First Major AI Regulation

The European Union has passed the Artificial Intelligence Act, becoming the first global entity to pass comprehensive law to regulate AI’s development and use. Member states agreed on the framework in December 2023, and it was adopted Wednesday by the European Parliament with 523 votes in favor, 46 against and 49 abstentions. The legislation establishes what are being called “sweeping rules” for those building AI as well as those who deploy it. The rules, which will take effect gradually, implement new risk assessments, ban AI uses deemed “high risk,” and mandate transparency requirements. Continue reading EU Lawmakers Pass AI Act, World’s First Major AI Regulation

Midjourney Creates a Feature to Advance Image Consistency

Artificial intelligence imaging service Midjourney has been embraced by storytellers who have also been clamoring for a feature that enables characters to regenerate consistently across new requests. Now Midjourney is delivering that functionality with the addition of the new “–cref” tag (short for Character Reference), available for those who are using Midjourney v6 on the Discord server. Users can achieve the effect by adding the tag to the end of text prompts, followed by a URL that contains the master image subsequent generations should match. Midjourney will then attempt to repeat the particulars of a character’s face, body and clothing characteristics. Continue reading Midjourney Creates a Feature to Advance Image Consistency

AI Startup Perplexity Targets $1B Valuation with New Funding

Perplexity is a year-old AI startup whose conversational “answer engine” has gained attention as a potential challenger to conventional search. Two months ago the venture raised $73.6 million in Series B funding from investors including Nvidia and Amazon founder Jeff Bezos via his Bezos Expeditions, resulting in a valuation of about $520 million. Now the company is said to be finalizing another cash infusion that is predicted to double its valuation to roughly $1 billion. The current financing round is reportedly being led by former Y Combinator partner Daniel Gross through his own investment fund. Continue reading AI Startup Perplexity Targets $1B Valuation with New Funding

Startup Cognition Launches AI Software Coding Engine Devin

Months-old startup Cognition AI has emerged from stealth mode with Devin, a generative platform it is calling “the world’s first fully autonomous AI software engineer.” Although Cognition has yet to make Devin widely available, much less allow independent testing, if its claims are true it would mark a turning point in the AI coding space, moving it from a field of AI assistants to a full-fledged AI engineer. Based on natural language instruction, Devin could potentially take a project from concept to execution rather than simply suggesting code snippets or offering barebones frameworks. Continue reading Startup Cognition Launches AI Software Coding Engine Devin

House Passes Bill That Could Remove TikTok from App Stores

The House of Representatives voted 352 to 65 today to pass a bill that could lead to a nationwide ban of popular video-sharing app TikTok, owned by China’s ByteDance and currently used by 170 million Americans. The bill, introduced out of concern for national security, would prohibit TikTok from app stores in the U.S. unless it is spun off from ByteDance. It is not clear how the Senate will respond to the proposed legislation, which advanced unanimously by the House Energy and Commerce Committee (50-0), and President Biden indicated he would sign. Meanwhile, China’s foreign ministry has called the measure an “act of bullying.” Continue reading House Passes Bill That Could Remove TikTok from App Stores

Reddit Hopes to Raise $748M in IPO Aimed at $6.4B Valuation

Reddit is moving ahead with its IPO and plans to raise between $682 million and $748 million on a fully diluted valuation of between $5.8 billion and $6.4 billion. Although no date has been announced, the IPO is expected to take place sometime this month. According to a Securities and Exchange Commission filing Monday, Reddit says it will offer 22 million 15.3 million Class A common shares and 6.7 million insider shares from investors including CEO Steve Huffman and COO Jen Wong. Pricing will be between $31 and $34 per share. The proposed market cap is $4.9 billion to $5.4 billion. Continue reading Reddit Hopes to Raise $748M in IPO Aimed at $6.4B Valuation

Soul Machines Aims for Photorealistic Marilyn Monroe Chatbot

Soul Machines debuted a synthetic Marilyn Monroe last week at SXSW. The New Zealand-based company teamed on the Digital Marilyn project with Authentic Brands Group, a New York management firm that represents a host of fashion labels as well as personalities such as Elvis Presley, David Beckham and Muhammad Ali. The result is a sophisticated chatbot that Soul Machines describes as an “interactive experience.” Drawing on biological AI, Soul Machines is packaging a “personalized engagement opportunity” for fans and brands, which could lead to new approaches in advertising and promotions. Continue reading Soul Machines Aims for Photorealistic Marilyn Monroe Chatbot

Researchers Call for Safe Harbor for the Evaluation of AI Tools

Artificial intelligence stakeholders are calling for safe harbor legal and technical protections that will allow them access to conduct “good-faith” evaluations of various AI products and services without fear of reprisal. More than 300 researchers, academics, creatives, journalists and legal professionals had as of last week signed an open letter calling on companies including Meta Platforms, OpenAI and Google to allow access for safety testing and red teaming of systems they say are shrouded in opaque rules and secrecy despite the fact that millions of consumers are already using them. Continue reading Researchers Call for Safe Harbor for the Evaluation of AI Tools