Models & Comparisons

Claude Running Ollama Models

Last updated 2026-07-25

What's new

2026-07-25

A new AI design tool called Kimi K3 is challenging the top-ranked Claude, offering similar quality at 30% lower cost.
Both tools were tested on creating slides, dashboards, and websites, judged on output, cost, and speed.
Kimi K3 can be accessed via an API key from openrouter.com (a website that connects different AI tools) and used within Claude's coding system.
For audio narration, the presenter used Fish Audio (a cheap, voice-cloning service with 83+ languages and emotion customization) instead of 11 Labs (a more expensive alternative).

2026-07-19

The Vercel AI SDK (a free, open-source toolkit for building AI apps) lets you switch between different AI models (like OpenAI or Claude) with just one line of code change.
It simplifies text streaming (getting AI responses word by word instead of all at once) and structured data extraction (getting clean, organized data from AI) with minimal coding.
The SDK includes ready-made tools for creating chat interfaces and building AI agents (automated AI tasks) that can perform multi-step jobs and even integrate with your own code or databases.
Version 7 of the SDK introduces durable agents (agents that can pause and resume), tool approvals (human oversight for risky tasks), and built-in telemetry (tracking and debugging tools).

2026-07-16

Claude (an AI assistant) can automate boring tasks, like sorting emails into categories (leads, urgent, etc.) and drafting responses, saving you 5-10 hours weekly.
For leads, Claude can research companies, draft replies, and even schedule meetings using your calendar, streamlining your sales process.
After client calls, Claude can generate branded PDF proposals with scope, pricing, and signatures, saving time on manual proposal creation.
This setup can be adapted to various jobs, especially those involving sales, marketing, or regular research tasks.

2026-07-07

Claude (an AI assistant) can now help with marketing tasks, like creating content and finding customers, even if you're not tech-savvy.
You can use Claude to mimic your voice, generate images/videos, and connect with other marketing tools (like email or social media platforms).
Claude's "co-work mode" lets it access and edit files on your computer, making it more powerful than the basic chat mode.
A free resource pack is available to guide you through setting up Claude for marketing, including pre-made prompts and step-by-step instructions.

2026-07-04

Claude (an AI tool by Anthropic) is helping non-coders build real businesses, like Vulcan, which won a Virginia state contract for government software.
Claude Code (a feature of Claude) builds full apps, websites, or automates tasks in minutes, just by describing what you want in plain English.
Claude is "agentic" (it takes action on its own), works in parallel (multiple tasks at once), and remembers things, making it a powerful tool for businesses and jobs.
Over half of the startups in Y Combinator's (a famous startup accelerator) newest batch are building with Claude, showing its growing popularity and effectiveness.

2026-07-01

Claude design 2.0 (a tool for creating websites, apps, and more using AI) now uses credits more efficiently, so you won't run out as quickly.
You can now access Claude design within the Claude desktop app (a program you download to use Claude on your computer), making it easier to use.
Claude design can create presentations, taking inspiration from images you provide, and even includes speaker notes for each slide.
You can export your designs to various platforms like PowerPoint, PDF, Miro (a collaborative online whiteboard), and Figma (a web-based design tool).

2026-06-28

A new AI tool called Jarvis (an AI assistant) helps manage and summarize team activities, ensuring security and control within a company's own AWS (Amazon Web Services, a cloud computing platform) account.
This setup is designed for larger companies, non-profits, or organizations with strict guidelines, allowing them to securely use tools like Salesforce (a customer relationship management platform) or Slack (a communication tool) on mobile devices.
The platform built on AWS Bedrock (a service for building and scaling generative AI applications) can be emulated in other cloud environments like Azure or GCP (Google Cloud Platform, a suite of cloud computing services).
Users can create and manage multiple AI agents, set their roles, and connect them to communication tools like Telegram (a messaging app) or Slack, with all data and interactions secured within the AWS environment.

2026-06-25

Claude managed agents (AI workers that do tasks for you) can now run on their own schedule and safely use real passwords or API keys (secure codes that let software talk to each other), making them more useful for real work.
These agents act like employees, using your tools like email or Slack (a messaging app for teams) to complete tasks you describe in plain English, without needing you to set up complex workflows.
Claude has improved by allowing agents to run automatically and securely store sensitive information, addressing two major limitations that previously made AI agents feel like just demos or toys.
The service offers different AI models (Opus for complex tasks, Sonnet for everyday work, and Haiku for quick, cheap results) and a simple console (dashboard) to manage agents, view credits, and handle billing.

2026-06-19

A new free AI model called GLM 5.2 (a type of AI that generates text and code) from China is challenging top models like Claude Opus (a paid AI model from a US company called Anthropic) in performance.
The creator installed GLM 5.2 using a simple tool called Olama (a software that helps run AI models easily) and ran it alongside Claude Opus to compare their abilities.
They used both models to create games like a 3D shooter and Block Breaker, finding that while Claude Opus excelled at the shooter, GLM 5.2 performed better at Block Breaker.
The comparison showed that GLM 5.2 can compete with high-end models, offering a powerful free alternative for tasks like coding and game design.

2026-06-16

**Level up from basic use**: Treat Claude (an AI assistant) like more than a search engine; make it interview you for context and check its own work for accuracy.
**Create projects for persistent memory**: Use Claude's project feature to maintain context across sessions, building a customized workspace with a master prompt (a file of instructions about your role and preferences).
**Integrate with other tools**: Connect Claude to your email, calendar, and other apps to streamline workflows, reducing the need to switch between tabs and copy-paste information.
**Build visualizations and interactive artifacts**: Use Claude to create graphs, mockups, and mini apps within the chat for easier data interaction and learning.

2026-06-13

Claude code (a paid AI tool) now offers a cheaper model option, called Claude code router, to reduce costs for users.
Semrush (a tool for tracking website performance) has a feature called MCP (Marketplace Connector Platform) that can be used with Claude code to analyze competitors and find keywords.
Harbor SEO (a service for creating SEO-optimized content) offers a low-cost way to generate blog posts, with plans starting at 29 euros a month for 35 articles.

2026-06-07

Anthropic (the company behind Claude, an AI assistant) shared key lessons from building Claude skills (folders of instructions and resources that help Claude do tasks better), emphasizing that skills are more than just simple text files.
They categorized skills into nine types, like "library and API references" (tools for developers) and "business automation" (workflow helpers), advising to keep skills focused on one category for best results.
Anthropic recommends including a "gotchas" section in skills, highlighting common mistakes to help Claude avoid errors and improve performance over time.
They suggest using the file system for "progressive disclosure," organizing skill content in a way that reveals information step by step, making it easier for Claude to learn and use skills effectively.

2026-06-03

Skill Creator automatically builds reusable skills (tools Claude learns to use repeatedly) from plain English descriptions, no manual code writing required.
Superpowers makes Claude plan first, write tests (code checks that verify everything works), and self-review twice—catching mistakes before clients see them.
Businesses pay most for simple, boring skills that save time, money, or prevent errors—not flashy ones made just for social media videos.

Key points

What it is

**Claude with Ollama models** lets you use local, open-source AI models (AI you run on your own computer) inside Anthropic's Claude desktop app (a chat-based AI assistant).
This means you can use cheaper or specialized AI models for different tasks without switching tools, changing how developers build and test AI applications.

How to use it

Download and install Ollama from ollama.com, then run the command `ollama launch Claude` in your terminal (command-line interface) to set up and start the AI agent.
In the Claude desktop app, a new model picker drop-down will appear, showing all Ollama models you can choose from to handle tasks like organizing files or talking to your local apps.

Watch out for

Web search and extensions are not supported yet, and open-source models may misbehave if they weren't trained on Claude Code's tools or have too small a context window (amount of text they can remember).
To avoid mistakes, use a model with a sufficiently large context window and Claude skills like "superpowers" to force the model to plan before coding.

Tools named

Ollama (tool for running AI models on your own computer), Kimiko, GPT-OSS, Qwen, Dev Stroll, Mini Stroll, GLM, Mini Mix (AI models)

Lesson 1: What is Claude Running Ollama Models and why it matters

Claude running Ollama models means you can now use local, open-source AI models inside Anthropic's Claude desktop app. Instead of only using Anthropic's paid models, a dropdown lets you pick models like Kimiko or Quen from the Ollama lineup (Ollama is a tool for running AI models on your own computer). Claude will then handle real work autonomously—organizing files, talking to your local apps—using the model you selected.

This matters for AI development because it changes how developers build and test. You can use cheaper or specialized models for different tasks without switching tools. For example, Claude Code (Anthropic's coding agent) works with these local models too, cutting costs dramatically. One command—"Ollama launch Claude"—pulls the right model, sets up environment variables, and starts the agent in seconds.

There are two current limits: web search and extensions are not supported yet. Also, open-source models may misbehave because they weren't trained on Claude Code's tools, may have small context windows (amount of text they can remember), or might not follow the exact protocol Claude expects. Still, this integration signals a shift where AI tools work together rather than compete—embedding everywhere instead of walling off. Full setup instructions are at docs.ollama.com/integration/claudedesktop.

Sources

Lesson 2: How to use Claude Running Ollama Models: step-by-step

To use Claude with Ollama models (local AI models you run on your own computer), start by downloading Ollama from ollama.com for your operating system. After installation, open your terminal (command-line interface) and run the command `ollama launch Claude`. This single command pulls the right model, wires up environment variables (settings the system needs to connect tools), and starts the Claude agent for you — typically in about 10 seconds.

Once running, reload the Claude desktop app. A new model picker drop-down appears showing all Ollama models discovered automatically. You can choose from Kimiko, GPT-OSS, Qwen, Dev Stroll, Mini Stroll, GLM, or Mini Mix. Switch to Claude co-work, pick a model, and let it handle real tasks like organizing files or talking to your local apps autonomously (working on its own). The same drop-down works in both the desktop app and VS Code.

Two limits to know: web search is not supported yet, and extensions are not supported yet. Sub-agents (helper AI tasks) inherit your current model choice. For full setup details, visit docs.ollama.com/integration/claudedesktop.

If you prefer using Claude Code specifically with a custom model, first pull your model using `ollama pull` followed by the model name — for example, `ollama pull qwen3.5:9b`. Then run `ollama launch Claude` in your VS Code terminal. The terminal lets you choose which local model to run Claude with. Note that open-source models may misbehave if they were not trained on Claude Code's tools, have too small a context window (amount of text the model can remember at once) for Claude's system prompt, or do not follow the exact JSON protocol (data format rules) Claude Code expects.

Sources

Lesson 3: Best practices and pitfalls

Running Ollama models through Claude is now a single command: `ollama launch Claude`. This command automatically pulls the right model, wires up environment variables (settings the system needs to connect everything), and starts the agent (an AI that performs tasks for you) in about ten seconds. Before this shortcut, you had to install Ollama manually, pull a model, set three or four environment variables by hand, figure out model compatibility, and then launch the agent. That friction is gone.

The integration is deep. The Claude desktop app shows a drop-down with all your local Ollama models discovered automatically — Kimiko, GPT-OSS, Qwen, and others. You can also use Claude Code inside VS Code with the same drop-down. The `ollama launch Claude` command works there too.

But pitfalls exist. Open-source models running in Claude Code may misbehave for specific reasons. They might not have been trained on Claude Code's tools (the functions and commands Claude Code uses to work). They might have a context window (the amount of text the model can remember at once) too small for Claude's system prompt (initial instructions guiding behavior). And they might not follow the exact same JSON protocol (the structured data format) that Claude Code expects. Think of it like putting a motorcycle engine into a truck — the parts don't match.

To avoid mistakes, use a model with a sufficiently large context window. The Qwen 3.5 model with 64,000 context is a common choice for this reason. Also, use Claude skills (pre-built instructions that improve model behavior) like "superpowers," which forces the model to plan before coding, working in an isolated environment and writing tests first. This prevents chaotic behavior from models not trained on Claude's specific agent tools.

Sources