Tech

Multimodal AI become accessible: new model runs on your laptop

DIGITAL BUSINESS MAGAZINE1111/2023202320232023

336 1 minute read

A new open-source artificial intelligence model named Obsidian, announced in an Oct. 30 Reddit post, represents a breakthrough in multimodal AI accessibility. Obsidian is the first 3b parameter multimodal AI — which makes it a model compact enough to run efficiently on a regular laptop.

Multimodal AI refers to AI systems that can process and connect data from different modes, such as text, images, audio, and video — in this case, the model accepts text and pictures as input, much like the latest version of OpenAI’s GPT-4V. While multimodal AI models like DALL-E 3 and GPT-4 have shown impressive capabilities, their enormous size makes them resource-intensive to run, requiring expensive high-end hardware — and their models are a closely guarded secret, so you could never run them even if you had the necessary specialized hardware.

The AI intelligence model, Obsidian, packs multimodal intelligence into a standard laptop’s memory

Obsidian changes this by packing multimodal intelligence into a model small enough to fit into a standard laptop’s memory and run at practical speeds. At 3 billion parameters, Obsidian builds upon the Capybara-3B model architecture, which achieves state-of-the-art performance compared to similarly sized models. The developer also announced on Reddit that a multimodal model based on the highly-praised Mistral open-source 7B model will soon follow.

Obsidian’s compact size is thanks to techniques adapted from the LLaMA model architecture. According to the Reddit post announcing Obsidian, it was pre-trained on a diverse synthesized multi-modal dataset, including text paired with corresponding images. This training methodology allowed it to develop strong language and vision capabilities despite its reduced parameters.

The result is an AI assistant with conversational skills and visual understanding that can fit in your backpack. Obsidian breaks down barriers to accessing AI, opening up new possibilities for on-device intelligence.

While still an early version, Obsidian’s efficient form factor sets an exciting precedent. It demonstrates that multimodal AI does not have to be locked up in giant data centers but can be made compact enough to be distributed widely.

Featured Image Credit: From Image Creation at Aimesoft; Thank you!

Radek Zielinski

Radek Zielinski is an experienced technology and financial journalist with a passion for cybersecurity and futurology.

Source link

DIGITAL BUSINESS MAGAZINE1111/2023202320232023

336 1 minute read

Multimodal AI become accessible: new model runs on your laptop

The AI intelligence model, Obsidian, packs multimodal intelligence into a standard laptop’s memory

Radek Zielinski

DIGITAL BUSINESS MAGAZINE

Unraveling the Complex Threads of Israel-Palestine: Navigating Racism and Antisemitism

New York Post’s slams activists removing posters of captives

Forward Looking Implications of Consumption Behavior in the Trump 2.0 Era

How Did Cody ‘Beef’ Franke Die? Updates on His ‘Unexpected’ Death – Hollywood Life

The Rookie: “Fast Andy” Is the Show Firing on All Cylinders

Jodie Turner-Smith Joins Spy Drama ‘The Agency’ from Showtime

Punctual’s Luxury Bus Charter Service

Punctual’s Luxury Bus Charter Service

Retirement Parties: Celebrate a Lifetime of Achievements with Unforgettable Transportation

Super Stretch Limousine Rental Service: Luxury and Elegance Redefined

The Secret to Dominating Search Results

Ellis Island Casino Sucked Into the F1 Vortex

The AI intelligence model, Obsidian, packs multimodal intelligence into a standard laptop’s memory

Radek Zielinski

DIGITAL BUSINESS MAGAZINE

Google Brain founder: AI firms use extinction fears for regulation

Tech startup Silicate set to remove C02 carbon permanently from the atmosphere

Related Articles

Kalshi fires back at FairPredicts with cease-and-desist over ‘Kalshi Lies’ campaign

Connecticut police raid Tolland property arresting 100 suspects in cockfighting gambling investigation

University researchers launch major independent UK gambling harms evidence center

Florida authorities bust Manatee County illegal gambling network during enforcement operation

Unraveling the Complex Threads of Israel-Palestine: Navigating Racism and Antisemitism

New York Post’s slams activists removing posters of captives

Forward Looking Implications of Consumption Behavior in the Trump 2.0 Era

How Did Cody ‘Beef’ Franke Die? Updates on His ‘Unexpected’ Death – Hollywood Life

The Rookie: “Fast Andy” Is the Show Firing on All Cylinders

Jodie Turner-Smith Joins Spy Drama ‘The Agency’ from Showtime

Punctual’s Luxury Bus Charter Service

Punctual’s Luxury Bus Charter Service

Retirement Parties: Celebrate a Lifetime of Achievements with Unforgettable Transportation

Super Stretch Limousine Rental Service: Luxury and Elegance Redefined

The Secret to Dominating Search Results

Ellis Island Casino Sucked Into the F1 Vortex