Osaurus brings both local and cloud AI models to your Mac

As AI fashions more and more turn out to be commoditized, startups are racing to construct the software program layer that sits on high of them. One fascinating entrant into this house is Osaurus, an open supply, Apple-only LLM server that lets customers transfer between totally different native AI fashions, both regionally or within the cloud, whereas maintaining their recordsdata and instruments all on their very own {hardware}.

Osaurus advanced out of the concept for a desktop AI companion, Dinoki, which Osaurus co-founder Terence Pae described as a kind of “AI-powered Clippy.” Dinoki’s prospects had requested him why they need to purchase the app in the event that they nonetheless needed to pay for tokens — the utilization models AI firms cost for processing prompts and producing responses.

That received Pae considering extra deeply about operating AI regionally.

“That’s how Osaurus began,” Pae, beforehand a software program engineer at Tesla and Netflix, informed Trendster over a name. The thought, he defined, was to attempt to run an AI assistant regionally. “You are able to do just about the whole lot in your Mac regionally, like shopping your recordsdata, accessing your browser, accessing your system configurations. I figured this is able to be a good way to place Osaurus as a private AI for people.”

Pae started constructing the instrument in public as an open supply mission, including options and fixing bugs alongside the way in which.

Picture Credit:Osaurus, Inc.

As we speak, Osaurus can flexibly join with regionally hosted AI fashions or cloud suppliers like OpenAI and Anthropic. Customers can freely select which AI fashions they’re utilizing and maintain different elements of the AI expertise on their very own {hardware}, just like the fashions’ personal reminiscence, or their recordsdata and instruments.

Provided that totally different AI fashions have totally different strengths, the benefit of this method is that customers can change to the AI mannequin that most closely fits their wants.

Such a construction makes Osaurus what’s known as a “harness” — a management layer that connects totally different AI fashions, instruments, and workflows by means of a single interface, just like instruments like OpenClaw or Hermes. Nonetheless, the distinction is that such instruments are sometimes aimed toward builders who know their manner round a terminal. And typically, like within the case of OpenClaw, they might pose safety points and holes to fret about.

Osaurus, in the meantime, presents an easy-to-use interface that customers can use and addresses safety considerations by operating issues in a hardware-isolated, digital sandbox. This limits the AI to a sure scope, maintaining your laptop and knowledge protected.

Picture Credit:Osaurus, Inc.

After all, the follow of operating AI fashions in your machine remains to be in its early days, on condition that it’s closely resource-intensive and hardware-dependent. To run native fashions, your system will want no less than 64GB of RAM. For operating bigger fashions, like DeepSeek v4, Pae recommends methods with about 128GB of RAM.

However Pae believes native AI’s wants will come down in time.

“I can see the potential of it, as a result of the intelligence per wattage — which is just like the metric for native AI — has been going up considerably. It’s by itself curve of innovation. Final 12 months, native AI might barely end sentences, however immediately it will possibly really run instruments, write code, entry your browser, and order stuff from Amazon … It’s simply getting higher and higher,” he stated.

Picture Credit:Osaurus, Inc.

Osaurus immediately can run MiniMax M2.5, Gemma 4, Qwen3.6, GPT-OSS, Llama, DeepSeek V4, and different fashions. It additionally helps Apple’s on-device basis fashions, Liquid AI’s LFM household of on-device fashions, and within the cloud, it will possibly hook up with OpenAI, Anthropic, Gemini, xAI/Grok, Venice AI, OpenRouter, Ollama, and LM Studio.

As a full MCP (Mannequin Context Protocol) server, you can provide any MCP-compatible consumer entry to your instruments as nicely. Plus, it ships with over 20 native plug-ins for Mail, Calendar, Imaginative and prescient, macOS Use, XLSX, PPTX, Browser, Music, Git, Filesystem, Search, Fetch, and extra.

Extra not too long ago, Osaurus was up to date to incorporate voice capabilities as nicely.

For the reason that mission went reside practically a 12 months in the past, it has been downloaded north of 112,000 instances, in keeping with its web site. The app competes with different instruments that allow you to run fashions regionally, like Ollama, Msty, LM Studio, and others, however gives a differentiated characteristic set and presents itself as a extra user-friendly choice for non-developers, too.

At the moment, Osaurus’ founders (who embrace co-founder Sam Yoo) are taking part within the New York-based startup accelerator Alliance. They’re additionally serious about subsequent steps, which might see Osaurus being provided to companies, like these within the authorized house or in healthcare, the place operating native LLMs might handle privateness considerations.

As the facility of native AI fashions grows, the group believes it might decrease the demand for AI knowledge facilities.

“We’re seeing this explosive progress within the AI house the place [cloud AI providers] must scale up utilizing knowledge facilities and infrastructure, however we really feel like individuals haven’t actually seen the worth of the native AI but,” Pae stated. “As an alternative of counting on the cloud, they’ll really deploy a Mac Studio on-prem, and it ought to use considerably much less energy. You continue to have the capabilities of the cloud, however you’ll not be depending on a knowledge heart to have the ability to run that AI,” he added.

While you buy by means of hyperlinks in our articles, we could earn a small fee. This doesn’t have an effect on our editorial independence.