Runway Makes Next Advance in Consumer Text-to-Video AI

Google-backed AI startup Runway has released Gen-2, an early entry among commercially available text-to-video models. Previously waitlisted in limited release, the commercial availability is impactful, since text-to-video is predicted as the next big bump in artificial intelligence, following the explosion of AI use generating text and images. While Runway’s solution may not be ready to serve as a professional video tool, this is the next step in development of tech expected to impact media and entertainment. Filmmaker Joe Russo recently predicted that within the next two years, AI may have the ability to create feature films. Continue reading Runway Makes Next Advance in Consumer Text-to-Video AI

Meta Testing Decentralized Instagram App as Rival to Twitter

Details are emerging about the text-based Twitter competitor being developed by Meta Platforms. What is being referred to internally as “Instagram’s new text-based app for conversations” will offer a feed with text posts of up to 500 characters that are capable of attaching links, photos, and videos. The move comes as alternatives including Bluesky, Cohost, Hive, Mastodon and Substack try to gain market share by luring disaffected Twitter users to their platforms. Instagram’s entry in progress — codenamed “P92,” and alternately referred to as “Barcelona” — may soon be interoperable with all of them. Continue reading Meta Testing Decentralized Instagram App as Rival to Twitter

Meta’s Open-Source ImageBind Works Across Six Modalities

Meta Platforms has built and is open-sourcing ImageBind, an artificial intelligence that combines six modalities: audio, visual, text, thermal, movement and depth data. Currently a research project, it suggests a future in which AI models generate multisensory content. “ImageBind equips machines with a holistic understanding that connects objects in a photo with how they will sound, their 3D shape, how warm or cold they are, and how they move,” Meta says. In other words, ImageBind’s approach more closely approximates human thinking by training on the relationship between things rather than ingesting massive datasets so as absorb every possibility. Continue reading Meta’s Open-Source ImageBind Works Across Six Modalities

Microsoft’s Next Generation of Bing AI Interacts with Images

Microsoft’s AI-powered Bing search engine has been drawing in excess of 100 million daily active users and logged half a billion chats. With OpenAI’s GPT-4 and DALL-E 2 models driving the action, it has also created over 200 million images since debuting in limited preview in February. Seeking to build on that momentum, Microsoft is adding new features and integrating Bing more tightly with its Edge browser. The company is also ditching its waitlist in a move to open preview. “We’re underway with the transformation of search,” CVP and consumer CMO Yusuf Mehdi said at a preview event last week. Continue reading Microsoft’s Next Generation of Bing AI Interacts with Images

Anthropic Takes Claude Chatbot Public After Months of Tests

After several months of testing, Anthropic is making its AI chatbot Claude available for general release in two configurations: the high-performace Claude and a lighter, cheaper, faster option called Claude Instant. Anthropic was launched in 2021 by a pair of former OpenAI employees, and its Claude chatbots are competitors to that firm’s ChatGPT. Accessible through a chat interface and API in Anthropic’s developer console, Claude is being marketed as the product of training designed to produce a more “helpful, honest, and harmless AI systems.” To that end, Anthropic says “Claude is much less likely to produce harmful outputs.” Continue reading Anthropic Takes Claude Chatbot Public After Months of Tests

OpenAI Announces Official Launch of GPT-4 Multimodal Tech

OpenAI has released GPT-4, which it says is a more powerful and reliable version of the artificial intelligence technology powering its viral ChatGPT chatbot. GPT-4 can analyze images and handle larger blocks of text and is generally “more creative and collaborative” than earlier iterations when it comes to things like composing songs, writing screenplays and mimicking a user’s authorial style. “GPT-4 can solve difficult problems with greater accuracy, thanks to its broader general knowledge and problem-solving abilities,” OpenAI says. GPT-4 is already driving the chatbot technology behind Microsoft’s Bing AI search engine, now in beta. Continue reading OpenAI Announces Official Launch of GPT-4 Multimodal Tech

Google’s PaLM API, MakerSuite Coming to Select Developers

Google is readying an API and other enterprise tools for its Pathways Language Model (PaLM) — a large language model similar to GPT — to encourage developers to create chatbots and other apps using the platform. PaLM is one of Google’s most advanced systems, with the capability to generate text, images, code, video and audio from natural language prompts. Much like OpenAI’s GTP series and the LLaMA family from Meta Platforms, it is suitable for a wide variety of general tasks. To facilitate PaLM’s use for specific tasks, Google is launching the MakerSuite along with the PaLM API. Continue reading Google’s PaLM API, MakerSuite Coming to Select Developers

Discord Integrates OpenAI Tech, Updates AI-Driven Features

Chat app Discord is expanding the use of artificial intelligence on its platform, including the addition of OpenAI technology to its chatbot and moderation features. Discord says it has 150 million users across 19 million interest groups, called “servers,” that dialogue using text, audio and video chat. Discord’s Midjourney text-to-image generation group is its largest community, with in excess of 13 million members. “Harnessed properly, AI can fundamentally enhance and empower genuine human connection,” Discord CEO Jason Citron said at a press event last week, heralding “the most exciting moments in technology emerging.” Continue reading Discord Integrates OpenAI Tech, Updates AI-Driven Features

Reddit Tests Split-View Text and Video Feeds, Other Updates

Reddit is introducing changes designed to make it easier for users to browse and navigate its communities. Currently testing is a concept that separates text and video into separate streams, dubbed “Read” and “Watch.” Users can toggle between the split-view feeds. In the current format, both “Read” and “Watch” will include recommendations as well as posts that users subscribe to. “In 2023, the product and design improvements you’ll see from us will simplify and streamline how people discover, join, and contribute (post, vote, comment) to communities and bring new ways to engage in conversations and content,” Reddit explains. Continue reading Reddit Tests Split-View Text and Video Feeds, Other Updates

Microsoft Unveils AI Model That Comprehends Image Content

Microsoft researchers have unveiled Kosmos-1, a new AI model the company says analyzes images for content, performs visual text recognition, solves visual puzzles and passes visual IQ tests. It also understands natural language instructions. The new model is what’s known as multimodal AI, which means it uses different instruction sets, from text to audio and video. Mixing media is a key step in building artificial general intelligence (AGI) that can perform tasks in a manner approximating human performance. Examples from a Kosmos-1 research paper show it can effectively analyze images, answering questions about them. Continue reading Microsoft Unveils AI Model That Comprehends Image Content

Meta Toolformer Sidesteps AI Language Limits with API Calls

With language models like ChatGPT dominating recent tech news, Meta Platforms has unveiled a new artificial intelligence platform of its own called Toolformer that breaks new ground in that it can teach itself to use external apps and APIs. The result, Meta says, is that Toolformer combines the conversational aptitude and other things large language models are good at while shoring up those areas in which it typically does not excel — like math and fact-checking — by figuring out how to use external tools like  search engines, calculators and calendars. Continue reading Meta Toolformer Sidesteps AI Language Limits with API Calls

GlossAi Content Propagation App Raises $8M in Seed Round

GlossAi can turn full-length videos — or even whole libraries of video and podcast content —into an array of short clips and posts suitable for dissemination across a wide swathe of outlets. The Israel-based firm has raised $8 million in a seed round as it enters an emerging market in which Adobe and AI startup QuickVid are already playing, but no single app has definitely taken hold. GlossAi has the ability to take a video and automatically generate not only a highlight reel, but also things like 15-second snippets, blog posts (from a transcript), slide decks and more. Continue reading GlossAi Content Propagation App Raises $8M in Seed Round

Alphabet Reveals Major AI Push, Plans to Take On ChatGPT

Alphabet is touting artificial intelligence advances as it faces disappointing Q4 earnings, with CEO Sundar Pichai, who is also CEO of Google, telling analysts the company will soon share its own generative AI system with the public, competing head-on with OpenAI’s ChatGPT and DALL-E. “In the coming weeks and months, we’ll make these language models available, starting with LaMDA, so that people can engage directly with them,” Pichai said. Google’s parent company reported a 3.6 percent decline in core ad revenue, at $59 billion in Q4, while overall revenue was up 1 percent to $76 billion. Continue reading Alphabet Reveals Major AI Push, Plans to Take On ChatGPT

ChatGPT, the Fastest Growing App, Intros Subscription Plan

OpenAI is piloting a $20 per month subscription plan called ChatGPT Plus for its text-generating chatbot. The paid plan offers benefits over the free version that include faster response times, access to ChatGPT even during peak periods and early access to new features. OpenAI will soon begin inviting U.S. customers to subscribe and said it plans to offer the Plus plan in more territories. Since debuting ChatGPT, the company has received feedback from “millions of people” using the viral to draft prose, edit content, brainstorm ideas, educate and assist with programming. Continue reading ChatGPT, the Fastest Growing App, Intros Subscription Plan

Instagram Creators Launch Artifact, Called a ‘TikTok for Text’

Instagram co-founders Kevin Systrom and Mike Krieger are back with a personalized news feed called Artifact that that uses artificial intelligence to pattern users’ interests and the friends that most likely want to discuss them with you. The new app — whose name combines articles, facts and artificial intelligence — opened a public waiting list this week and is available on iOS and Android. The Verge calls it “TikTok for text,” adding that “you might also call it Google Reader reborn as a mobile app or maybe even a surprise attack on Twitter.” Continue reading Instagram Creators Launch Artifact, Called a ‘TikTok for Text’