By
Paula ParisiApril 28, 2025
Nvidia has released NeMo microservices into general availability with version 25.4, pivoting its profile from a modular toolkit for creating custom generative AI models to emphasizing it as a platform for building AI agents at scale. As AI agents have become an in-demand commodity, Nvidia is leveraging the fact that NeMo’s capabilities seem purpose built to help them grow and thrive. Built around the Kubernetes open-source container management system, NeMo microservices are offered as “an end-to-end developer platform for creating state-of-the-art agentic AI systems,” according to Nvidia. Continue reading Nvidia Positions Its NeMo Microservices for AI Agent-Building
By
Paula ParisiApril 18, 2025
Adobe has taken a stake in business avatar firm Synthesia, which creates clones for corporate videos using generative AI. The investment of an undisclosed sum through Adobe Ventures was interpreted by one media outlet as a bet that the UK startup’s technology “will transform video production.” Adobe couched the move as a strategic alliance. The investment became public along with Synthesia’s announcement that it surpassed the $100 million mark for what the privately held company says qualifies as recurring annual revenue. Nvidia is also an investor. Continue reading Adobe Investment in Synthesia Could Fuel AI Video Production
By
Paula ParisiApril 18, 2025
Agentic AI company Moveworks has opened an AI Agent Marketplace that launches with more than 100 pre-built agents, enabling users to discover, install, and deploy AI assistants that automate business processes. Agentic AI is booming, as businesses seek to offload tasks from human workers to software. To support that, new companies and existing ones have started providing pre-built agents that are more convenient than building them from scratch. “What once took weeks to build can now be installed and deployed in mere minutes,” Moveworks says, touting its library offerings. Continue reading Moveworks Joins Competition in Offering Enterprise AI Agents
By
Paula ParisiApril 17, 2025
Anthropic has upgraded its AI assistant Claude, adding Research, an autonomous capability that integrates with Google Workspace. Claude can now search and reference content in Google Docs as well as communications in Gmail and events in Calendar. “With Research, Claude can search across both your internal work context and the web to help you make decisions and take action faster than before,” Anthropic explains, turning the model into a “true virtual collaborator” for enterprise clients. The expansion puts Anthropic into more direct competition with OpenAI and Microsoft as well as Google with Gemini in the AI productivity space. Continue reading Anthropic Adds Deep Research, Google Integration to Claude
By
Paula ParisiApril 17, 2025
As enterprises rely more heavily on AI integration to compile research and summarize things like meetings and email threads, the need for contextual search has become increasingly important. AI startup Cohere has released Embed 4 to make the task easier. Embed 4 is a multimodal embedding model that transforms text, images and mixed data (like PDFs, slides or tables) into numerical representations (or “embeddings”) for tasks including semantic search, retrieval-augmented generation (RAG) and classification. Supporting over 100 languages, Embed 4 has an extremely large context window of up to 128,000 tokens. Continue reading Cohere’s Multimodal Embed Model Organizes Enterprise Data
By
Paula ParisiApril 14, 2025
Among the many tech advancements unveiled at Google Cloud Next include a major generative media upgrade to Vertex AI, Google Cloud’s managed AI development platform. The new Vertex AI Media Studio lets enterprise users generate complete videos from scratch using text prompts. Lyria, Google’s text-to-music model is now available on Vertex in private preview. Both are subject to an “allowlist.” Chirp 3 now creates custom voices with just 10 seconds of audio input, while Imagen 3 has gained improved abilities for reconstructing missing or damaged portions of an image. Continue reading Vertex AI Movie Studio Can Create Videos from Start to Score
By
Paula ParisiApril 11, 2025
Google’s Gemini coding assistant has gained agentic capabilities, available as part of Gemini in Android Studio, a subscription service for businesses designed to make app development for the Android ecosystem easier and more secure. This agent-centric “AI-powered cloud for developers and operators” is designed to infuse AI into all stages of application development, laying the groundwork for more rapid software creation cycles. The service is available to those who subscribe to Gemini Code Assist Standard or Enterprise editions. The new offering was unveiled at the Google Cloud Next 2025 developer conference in Las Vegas. Continue reading Google Pushes Gemini in Android Studio for App Developers
By
Paula ParisiApril 10, 2025
Amazon has updated its Nova model series, with Nova Reel 1.1 now able to generate AI videos of up to two minutes as well as gaining a new ‘multi-shot’ feature. Announced in December, Nova Reel marked Amazon’s initial foray into generative video. AWS developer advocate Elizabeth Fuentes says that Nova Reel accommodates user prompts of up to 4,000 characters that can generate a series of six-second shots for a sequence totaling two minutes. The company also introduced the Nova Sonic real-time voice model that supports third-party enterprise development. Continue reading AWS Updates Nova Reels and Adds Nova Sonic Voice Model
By
Paula ParisiApril 8, 2025
Sentient, a year-old non-profit backed by Peter Thiel’s Founders Fund, has released Open Deep Search (ODS), an open-source framework that leverages existing LLMs to enhance search and reasoning capabilities. Essentially a system of custom plugins and tools, ODS works with DeepSeek’s open-source R1 model as well as proprietary systems like OpenAI’s GPT-4o and Anthropic’s Claude to deliver advanced search functionality. That modular aspect is in fact ODS’s main innovation, its creators say, claiming it beats Perplexity and OpenAI’s GPT-4o Search Preview on benchmarks for accuracy and transparency. Continue reading Non-Profit Sentient Launches New ‘Open Deep Search’ Model
By
Paula ParisiApril 2, 2025
Runway has introduced a new video generation model, launching a next phase of competition that could transform film production. Notably, its Gen-4 system improves the consistency of characters, locations and objects across multiple scenes, an elusive prospect for most AI video generators. The New York-based startup calls its new development “a step towards Universal Generative Models that understand the world.” The key, Runway says, is to provide a single reference image of the character, item or environment as part of the model’s project material. Runway Gen-4 can generate 5- and 10-second clips at 720p resolution. Continue reading Runway Gen-4 Tackles AI’s Elusive Video Scene Consistency
By
Paula ParisiMarch 19, 2025
Google is expanding its AI presence in the UK market, hosting a splashy launch event there for Agentspace. Google in December launched Agentspace, an AI agent hub that makes it easy for enterprises to build, manage and deploy custom agents using Gemini. The gathering was hosted by Google DeepMind CEO Demis Hassabis, and Google Cloud CEO Thomas Kurian and included participation by local customers BT Group and advertising powerhouse WPP. Google invited UK businesses to store cloud data locally using its $1 billion data center, opening there this year. The company also promoted its new Chirp 3 audio generator, which offers HD voice synthesis. Continue reading Google Launches Agentspace in the UK and Promotes Chirp 3
By
Paula ParisiMarch 18, 2025
Baidu has launched two new AI systems, the native multimodal foundation model Ernie 4.5 and deep-thinking reasoning model Ernie X1. The latter supports features like generative imaging, advanced search and webpage content comprehension. Baidu is touting Ernie X1 as of comparable performance to another Chinese model, DeepSeek-R1, but says it is half the price. Both Baidu models are available to the public, including individual users, through the Ernie website. Baidu, the dominant search engine in China, says its new models mark a milestone in both reasoning and multimodal AI, “offering advanced capabilities at a more accessible price point.” Continue reading Baidu Releases New LLMs that Undercut Competition’s Price
By
Paula ParisiMarch 17, 2025
Cerebras Systems was founded 10 years ago on the belief that there would be a shortage of processors powerful enough to drive enterprise AI computing at scale. Its solution, the Cerebras Wafer-Scale Engine, is integrated into Cerebras’ CS-3 systems, which will power six new data centers launching this year that the company says will make it “the world’s number one provider of high-speed inference and the largest domestic high speed inference cloud.” Cerebras notes the new facilities will collectively serve over 40 million Llama 70B tokens per second to clients that now include Hugging Face and financial intelligence firm AlphaSense. Continue reading Cerebras Is Moving into Mainstream with New AI Data Centers
By
ETCentric StaffMarch 11, 2025
CES 2025 welcomed over 141,000 attendees from around the globe to Las Vegas. With more than 4,500 exhibitors, including 1,400 startups, and more than 6,000 media attendees, CES highlights the innovation and technology trends addressing global challenges and shaping the future. This year’s show focused on artificial intelligence, unveiling a wave of innovative offerings — whether practical, visionary or experimental. Among the show’s major trends were AI integration across all industries, shifting demographics and purchasing patterns (with Gen Z the one to watch), sustainability and security, and smart devices and smarter homes. ETC@USC attended the conference for live reporting on products and services. Our post-show report features extensive coverage and perspectives related to key creative, business, and technology areas. Continue reading ETC’s CES 2025 Report: Focus on AI Innovation & Integration
By
Paula ParisiMarch 10, 2025
Alibaba is making AI news again, releasing another Qwen reasoning model, QwQ-32B, which was trained and scaled using reinforcement learning (RL). The Qwen team says it “has the potential to enhance model performance beyond conventional pretraining and post-training methods.” QwQ-32B, a 32 billion parameter model, “achieves performance comparable to DeepSeek-R1, which boasts 671 billion parameters (with 37 billion activated),” Alibaba claims. While parameters refer to the total set of adjustable weights and biases in the model’s neural network, “activated” parameters are a subset used for a specific inference task, like generating a response. Continue reading Alibaba Says Qwen Reasoning Model on Par with DeepSeek