Google Offers Public Preview of Gemini Pro for Cloud Clients

Google is moving its most powerful artificial intelligence model, Gemini 1.5 Pro, into public preview for developers and Google Cloud customers. Gemini 1.5 Pro includes what Google claims is a breakthrough in long context understanding, with the ability to run 1 million tokens of information “opening up new possibilities for enterprises to create, discover and build using AI.” Gemini’s multimodal capabilities allow it to process audio, video, text, code and more, which when combined with long context, “enables enterprises to do things that just weren’t possible with AI before,” according to Google. Continue reading Google Offers Public Preview of Gemini Pro for Cloud Clients

Opera Browser Is Experimenting with Local Support for LLMs

Opera has become the first browser to add support for large language models (LLMs). At this point the feature is experimental, and available only on the Opera One Developer browser as part of the AI Feature Drops program. The update offers about 150 LLMs from more than 50 different families, including Meta’s LLaMA, Google’s Gemma, Mixtral and Vicuna. Opera had previously only offered local support for its own Aria AI, a competitor to Microsoft Copilot and OpenAI’s ChatGPT. The local LLMs are being offered for testing as a complimentary addition to Opera’s online Aria service. Continue reading Opera Browser Is Experimenting with Local Support for LLMs

Databricks DBRX Model Offers High Performance at Low Cost

Databricks, a San Francisco-based company focused on cloud data and artificial intelligence, has released a generative AI model called DBRX that it says sets new standards for performance and efficiency in the open source category. The mixture-of-experts (MoE) architecture contains 132 billion parameters and was pre-trained on 12T tokens of text and code data. Databricks says it provides the open community and enterprises who want to build their own LLMs with capabilities previously limited to closed model APIs. Compared to other open models, Databricks claims it outperforms alternatives including Llama 2-70B and Mixtral on certain benchmarks. Continue reading Databricks DBRX Model Offers High Performance at Low Cost

Apple Unveils Progress in Multimodal Large Language Models

Apple researchers have gone public with new multimodal methods for training large language models using both text and images. The results are said to enable AI systems that are more powerful and flexible, which could have significant ramifications for future Apple products. These new models, which Apple calls MM1, support up to 30 billion parameters. The researchers identify multimodal large language models (MLLMs) as “the next frontier in foundation models,” which exceed the performance of LLMs and “excel at tasks like image captioning, visual question answering and natural language inference.” Continue reading Apple Unveils Progress in Multimodal Large Language Models

Grok-1 Architecture Open-Sourced for General Release by xAI

Elon Musk’s xAI has released its Grok chatbot and open-sourced part of the underlying Grok-1 model architecture for any developer or entrepreneur to use for purposes including commercial applications. Musk unveiled Grok in November and announced that it would be publicly released this month. The chatbot itself is available to X social premium members, who can ask the cheeky AI questions and get answers with a snarky attitude inspired by “The Hitchhiker’s Guide to the Galaxy” sci-fi novel. The training for Grok’s foundation LLM is said to include X social posts. Continue reading Grok-1 Architecture Open-Sourced for General Release by xAI

Ozone Helps Users Customize Content Moderation on Bluesky

Decentralized social platform Bluesky has open-sourced a tool called Ozone that facilitates custom moderation. Debuting this week, Ozone lets individuals or teams collaboratively review and label content on the platform. “We’re opening up the ability to run your own independent moderation services, seamlessly integrated into the Bluesky app,” the company says, explaining “you’ll be able to create and subscribe to additional moderation services” on top of that which is administered by Bluesky’s moderation team, “giving you unprecedented control over your social media experience.” Continue reading Ozone Helps Users Customize Content Moderation on Bluesky

Google Introduces Open-Source Marketing Measurement Tool

Google has rolled out an open-source marketing mix model (MMM) called Meridian that aims to help in formulating cross-channel media strategies in the current environment of fragmented media consumption and privacy changes. As marketers contend with Google’s plan to sunset the use of third-party cookies by the end of this year, MMMs — classic tools of yesteryear — “are experiencing a renaissance,” says the search giant. MMMs are statistical analyses companies use to help measure the impact of cross-channel marketing sales. Google says it has “observed more customers turning to MMMs, especially performance and full-funnel marketers.” Continue reading Google Introduces Open-Source Marketing Measurement Tool

France’s Mistral AI Makes Its Global Debut on Microsoft Azure

Paris-based startup Mistral AI has made an immediate splash in the world of artificial intelligence, securing partnerships with IBM, Microsoft and others nine months after its launch. The company is offering natural language processing models, including its flagship Mistral Large, which becomes only the second LLM (after OpenAI) to land a commercial berth on Microsoft’s Azure cloud, where Meta Platforms’ Llama 2 is available in preview. Boasting “top-tier reasoning capacities” and sophisticated conversational capabilities, Mistral Large specializes in “reasoning, analysis and generation (RAG), is multilingual and supports up to 32,000 tokens.” Continue reading France’s Mistral AI Makes Its Global Debut on Microsoft Azure

Apple Launches Open-Source Language-Based Image Editor

Apple has released MGIE, an open-source AI model that edits images using natural language instructions. MGIE, short for MLLM-Guided Image Editing, can also modify and optimize images. Developed in conjunction with University of California Santa Barbara, MGIE is Apple’s first AI model. The multimodal MGIE, which understands text and image input, also crops, resizes, flips, and adds filters based on text instructions using what Apple says is an easier instruction set than other AI editing programs, and is simpler and faster than learning a traditional program, like Apple’s own Final Cut Pro. Continue reading Apple Launches Open-Source Language-Based Image Editor

Meta Combines Research Units to Develop Open-Source AGI

In a Threads post last week, Meta Platforms CEO Mark Zuckerberg announced that the company’s new frontier is open-source artificial general intelligence (AGI). Meta has united its FAIR and GenAI research teams behind the goal of developing such a platform, which Zuckerberg described as part of the company’s “long-term vision.” “The next generation of services required is building full general intelligence, building the best AI assistants, AIs for creators, AIs for businesses and more,” Zuckerberg said, explaining that will require “advances in every area of AI from reasoning to planning to coding to memory and other cognitive abilities.” Continue reading Meta Combines Research Units to Develop Open-Source AGI

CES: Session Details the Impact and Future of AI Technology

Dr. Fei-Fei Li, Stanford professor and co-director of Stanford HAI (Human-Centered AI), and Andrew Ng, venture capitalist and managing general partner at Palo Alto-based AI Fund discussed the current state and expected near-term developments in artificial intelligence. As a general purpose technology, AI development will both deepen, as private sector LLMs are developed for industry-specific needs, and broaden, as open source public sector LLMs emerge to address broad societal problems. Expect exciting advances in image models — what Li calls “pixel space.” When implementing AI, think about teams rather than individuals, and think about tasks rather than jobs. Continue reading CES: Session Details the Impact and Future of AI Technology

Stability AI Is Offering Paid Membership for Commercial Users

As the pressure ratchets up for AI companies to go beyond the wow factor and make money, Stability AI has formalized three subscription tiers as it seeks to expand commercial use of its open-source, multimodal core models. The Stability AI Membership offerings include a free tier for personal and research (i.e., non-commercial) use, a professional tier that costs $20 a month, and a custom-priced enterprise tier for large outfits. The company says that with the three tiers it is “striking a balance between fostering competitiveness and maintaining openness in AI technologies.” Continue reading Stability AI Is Offering Paid Membership for Commercial Users

EU Makes Provisional Agreement on Artificial Intelligence Act

The EU has reached a provisional agreement on the Artificial Intelligence Act, making it the first Western democracy to establish comprehensive AI regulations. The sweeping new law predominantly focuses on so-called “high-risk AI,” establishing parameters — largely in the form of reporting and third-party monitoring — “based on its potential risks and level of impact.” Parliament and the 27-country European Council must still hold final votes before the AI Act is finalized and goes into effect, but the agreement, reached Friday in Brussels after three days of negotiations, means the main points are set. Continue reading EU Makes Provisional Agreement on Artificial Intelligence Act

Intuitive Mammoth App Aims to Simplify Accessing Mastodon

Mozilla-backed Mammoth wants to lure social media users to the fediverse, presenting its latest iteration, Mammoth 2, as “the easiest way to quit Twitter/X for good and join Mastodon.” Having added a “For You” feed earlier this year, Mammoth 2 now debuts on the iPhone, iPad and Mac, delving deeper into news and curation. New “Smart Lists” are filled with recommended posts, suggested connections and accounts to follow. The future of social “is being built today on ActivityPub and Mastodon,” Mammoth’s creators claim, calling for “an open protocol anybody can build on,” as with “email or the open web.” Continue reading Intuitive Mammoth App Aims to Simplify Accessing Mastodon

IBM Announces Significant Advances in Quantum Computing

IBM has produced two quantum computing systems to meet its 2023 roadmap, one based on a chip named Condor, which at 1,121 functioning qubits is the largest transmon-based quantum processor released to date. Transmon-based chips use a type of superconducting qubit that is more error-resistant than typical qubits, which are notoriously unstable. The second IBM system uses three Heron chips, each with 133 qubits. The more modestly scaled Heron and its successor, Flamingo, play a vital role in IBM’s quantum plan, which boasts major progress as a result of these developments. Continue reading IBM Announces Significant Advances in Quantum Computing