By
ETCentric StaffMarch 29, 2024
Databricks, a San Francisco-based company focused on cloud data and artificial intelligence, has released a generative AI model called DBRX that it says sets new standards for performance and efficiency in the open source category. The mixture-of-experts (MoE) architecture contains 132 billion parameters and was pre-trained on 12T tokens of text and code data. Databricks says it provides the open community and enterprises who want to build their own LLMs with capabilities previously limited to closed model APIs. Compared to other open models, Databricks claims it outperforms alternatives including Llama 2-70B and Mixtral on certain benchmarks. Continue reading Databricks DBRX Model Offers High Performance at Low Cost
By
ETCentric StaffMarch 28, 2024
Researchers from the Massachusetts Institute of Technology and Adobe have unveiled a new AI acceleration tool that makes generative apps like DALL-E 3 and Stable Diffusion up to 30x faster by reducing the process to a single step. The new approach, called distribution matching distillation, or DMD, maintains or enhances image quality while greatly streamlining the process. Theoretically, the technique “marries the principles of generative adversarial networks (GANs) with those of diffusion models,” consolidating “the hundred steps of iterative refinement required by current diffusion models” into one step, MIT PhD student and project lead Tianwei Yin says. Continue reading New Tech from MIT, Adobe Advances Generative AI Imaging
By
ETCentric StaffMarch 28, 2024
U.S. recorded music revenue grew 8 percent in 2023, to an estimated record high of $17.1 billion at retail. It was the eighth consecutive year of growth, according to the RIAA, which says streaming continued to be the biggest driver, notching new heights of paid subscriptions, robust growth in ad-supported listening, and healthy increased contributions from new platforms. Streaming accounted for 84 percent of retail revenue, at $14.4 billion including from 96.8 million paid subscriptions. On the supply-side, wholesale revenue grew 7 percent to $11 billion, also a record. Continue reading Streaming Drives U.S. Recorded Music to Record $17 Billion
By
ETCentric StaffMarch 27, 2024
OpenAI’s Sora text- and image-to-video tool isn’t publicly available yet, but the company is showing what it’s capable of by putting it in the hands of seven artists. The results — from a short film about a balloon man to a hybrid flamingo giraffe — are stirring excitement and priming the pump for what OpenAI CTO Mira Murati says will be a 2024 general release. Challenges include making it cheaper to run and enhancing guardrails. Since introducing Sora last month, OpenAI says it’s “been working with visual artists, designers, creative directors and filmmakers to learn how Sora might aid in their creative process.” Continue reading OpenAI Releases Early Demos of Sora Video Generation Tool
By
ETCentric StaffMarch 26, 2024
A 2024 Digital Media Trends study by Deloitte says media and entertainment companies “should be thinking more about the world ahead than the one they’re being forced to leave behind,” a suggestion underscored by the fact that 60 percent of Gen Zs surveyed prefer watching user-generated content on social platforms to programming offered by streaming services “because they don’t have to spend time searching for what to watch.” Both Gen Zs and Millennials also believe they get better recommendations from social media than the commercial platforms (54 percent). Continue reading Gen Z, Millennials Prefer Social Videos to Streaming Services
By
ETCentric StaffMarch 25, 2024
Stability AI has released Stable Video 3D, a generative video model based on the company’s foundation model Stable Video Diffusion. SV3D, as it’s called, comes in two versions. Both can generate and animate multi-view 3D meshes from a single image. The more advanced version also let users set “specified camera paths” for a “filmed” look to the video generation. “By adapting our Stable Video Diffusion image-to-video diffusion model with the addition of camera path conditioning, Stable Video 3D is able to generate multi-view videos of an object,” the company explains. Continue reading Stable Video 3D Generates Orbital Animation from One Image
By
ETCentric StaffMarch 21, 2024
In news rocking the publishing world, two of the largest newspaper chains in the U.S. have drastically downsized their contracts with the Associated Press, eliminating AP journalism from their combined 230 news outlets, including Gannett’s USA Today and McClatchy’s The Miami Herald. Though neither chain disclosed how much the move will save, the AP assesses “it is likely to be in the millions of dollars” for each. Gannett announced it has chosen another newswire partner, Reuters, and says it will continue to subscribe to the AP Stylebook and election results data. AP says its Gannett contract runs through the end of 2024. Continue reading Gannett, McClatchy Cancel Associated Press News Contracts
By
ETCentric StaffMarch 20, 2024
Nvidia unveiled what it is calling the world’s most powerful AI processing system, the Blackwell GPU, purpose built to power real-time generative AI on trillion-parameter large language models at what the company says will be up to 25x less cost and energy consumption than its predecessors. Blackwell’s capabilities will usher in what the company promises will be a new era in generative AI computing. News from Nvidia’s GTC 2024 developer conference included the NIM software platform, purpose built to streamline the setup of custom and pre-trained AI models in a production environment, and the DGX SuperPOD server, powered by Blackwell. Continue reading GTC: Nvidia Unveils Blackwell GPU for Trillion-Parameter LLMs
By
ETCentric StaffMarch 20, 2024
YouTube has added new rules requiring those uploading realistic-looking videos that are “made with altered or synthetic media, including generative AI” to label them using a new tool in Creator Studio. The new labeling “is meant to strengthen transparency with viewers and build trust between creators and their audience,” YouTube says, listing examples of content that require disclosure as “likeness of a realistic person” including voice as well as image, “altering footage of real events or places” and “generating realistic scenes” of fictional major events, “like a tornado moving toward a real town.” Continue reading YouTube Adds GenAI Labeling Requirement for Realistic Video
By
ETCentric StaffMarch 19, 2024
Apple researchers have gone public with new multimodal methods for training large language models using both text and images. The results are said to enable AI systems that are more powerful and flexible, which could have significant ramifications for future Apple products. These new models, which Apple calls MM1, support up to 30 billion parameters. The researchers identify multimodal large language models (MLLMs) as “the next frontier in foundation models,” which exceed the performance of LLMs and “excel at tasks like image captioning, visual question answering and natural language inference.” Continue reading Apple Unveils Progress in Multimodal Large Language Models
By
ETCentric StaffMarch 18, 2024
Robotics firm Figure AI is getting a lot of attention for its humanoid robot, Figure 01, which the company unveiled along with news that it has raised $675 million, for a $2.6 billion valuation, from investors including OpenAI, Nvidia, Microsoft and Amazon founder Jeff Bezos. Pronounced “Figure One,” the general purpose robot looks and moves like a human, and can perform mundane tasks like serving food as well as undesirable jobs like picking up trash. It “sees” using “onboard cameras that feed into a large vision-language model (VLM) trained by OpenAI,” according to Figure co-founder and CEO Brett Adcock. Continue reading Figure Unveils Humanoid Robot, Draws Notable Investments
By
ETCentric StaffMarch 15, 2024
Artificial intelligence imaging service Midjourney has been embraced by storytellers who have also been clamoring for a feature that enables characters to regenerate consistently across new requests. Now Midjourney is delivering that functionality with the addition of the new “–cref” tag (short for Character Reference), available for those who are using Midjourney v6 on the Discord server. Users can achieve the effect by adding the tag to the end of text prompts, followed by a URL that contains the master image subsequent generations should match. Midjourney will then attempt to repeat the particulars of a character’s face, body and clothing characteristics. Continue reading Midjourney Creates a Feature to Advance Image Consistency
By
ETCentric StaffMarch 14, 2024
Perplexity is a year-old AI startup whose conversational “answer engine” has gained attention as a potential challenger to conventional search. Two months ago the venture raised $73.6 million in Series B funding from investors including Nvidia and Amazon founder Jeff Bezos via his Bezos Expeditions, resulting in a valuation of about $520 million. Now the company is said to be finalizing another cash infusion that is predicted to double its valuation to roughly $1 billion. The current financing round is reportedly being led by former Y Combinator partner Daniel Gross through his own investment fund. Continue reading AI Startup Perplexity Targets $1B Valuation with New Funding
By
ETCentric StaffMarch 14, 2024
Months-old startup Cognition AI has emerged from stealth mode with Devin, a generative platform it is calling “the world’s first fully autonomous AI software engineer.” Although Cognition has yet to make Devin widely available, much less allow independent testing, if its claims are true it would mark a turning point in the AI coding space, moving it from a field of AI assistants to a full-fledged AI engineer. Based on natural language instruction, Devin could potentially take a project from concept to execution rather than simply suggesting code snippets or offering barebones frameworks. Continue reading Startup Cognition Launches AI Software Coding Engine Devin
By
ETCentric StaffMarch 12, 2024
Soul Machines debuted a synthetic Marilyn Monroe last week at SXSW. The New Zealand-based company teamed on the Digital Marilyn project with Authentic Brands Group, a New York management firm that represents a host of fashion labels as well as personalities such as Elvis Presley, David Beckham and Muhammad Ali. The result is a sophisticated chatbot that Soul Machines describes as an “interactive experience.” Drawing on biological AI, Soul Machines is packaging a “personalized engagement opportunity” for fans and brands, which could lead to new approaches in advertising and promotions. Continue reading Soul Machines Aims for Photorealistic Marilyn Monroe Chatbot