Published: 2026-06-03

Run Hundreds of Free Claude Code Agents Using a Local Model via Cowork 3P

Chapters / key moments (click to jump — plays here on the page)

Bart Slodyczka demonstrates running hundreds of Claude Code agents completely free by using Co-work on 3P — Anthropic's officially supported feature that lets you plug your own AI backend into Claude Code. The demo runs a multi-agent deep research task with six parallel agents, all powered by a local Gemma 4 E2B model served through LM Studio at zero API cost.

Source video

"Spin up 100's of agents for free in claude code" by Bart Slodyczka — Watch on YouTube →

Key Takeaways

Co-work on 3P is an officially supported Anthropic feature — not a hack. It allows any local or third-party model to serve as the backend for Claude Code and the Claude desktop app.
The setup requires the Claude desktop app, not just the CLI; developer mode must be enabled first via Help → Troubleshooting.
The critical step is renaming your model in LM Studio to match a Claude-compatible identifier (e.g. claude-opus-4.8) — Claude Code uses this to detect supported models.
No Anthropic API key is needed for this setup; you provide a placeholder value for the static API key field.
Real model used in the demo: Gemma 4 E2B running locally; cost per session = $0 regardless of how many agents run in parallel.

Setup Steps

Download the Claude desktop app — required (the CLI alone does not have the 3P settings UI).
Enable developer mode — Go to Help → Troubleshooting → Enable developer mode. A new "Developer" section appears in settings.
Open Configure third-party inference — In the Developer settings, click "Configure third-party inference".
Set connection type to Gateway — Required for local model endpoints.
Set credential type to Static API key — Enter any placeholder value; no real key needed for local models.
Paste your LM Studio URL — Get this from LM Studio's server settings (typically http://localhost:1234/v1).
Set auth to bearer — Leave as default.
In LM Studio: rename the model — Change the loaded model's display name to claude-opus-4.8 (or another Claude-compatible name) so Claude recognizes it.
Load the model in LM Studio — Click Load Model, select your model, set max tokens, and load.
Verify model discovery — Apply changes; Claude should detect "1 model found".

Commands & Code Mentioned

# LM Studio server URL (paste into Claude desktop third-party inference config)
http://localhost:1234/v1

# Model identifier to set in LM Studio (must match Claude naming)
claude-opus-4.8

# After setup, Claude Code desktop app shows local models in model picker
# No CLI commands needed — configured entirely through the desktop app UI

Related Guides

OpenClaw Setup Guide — standard Claude Code installation
Claude Cowork Setup — getting started with Claude's web interface
Cost Optimisation — strategies for reducing token spend