Using PlanExe with Anthropic

Anthropic makes the Claude family of models. PlanExe can use Anthropic models directly via the Anthropic API.

There are two ways to authenticate:

Method	Who it's for	Token format
API key	Anthropic API billing account	`sk-ant-api*`
Claude Code OAuth token	Claude Pro/Max subscribers	`sk-ant-oat*`

The Claude Code OAuth path lets you use PlanExe without a separate API billing account — if you already pay for Claude Pro or Max, you can generate a token and use it directly.

Authentication setup

Option A — Standard API key

Sign up at console.anthropic.com and add API credits.
Generate an API key from the Console.
Add to your .env:
```
ANTHROPIC_API_KEY=sk-ant-api...
```

Option B — Claude Code OAuth token (Claude Pro/Max subscribers)

If you have a Claude Pro or Max subscription, you can generate an OAuth token that works in place of an API key.

Requirements: - Claude Code installed (npm install -g @anthropic-ai/claude-code) - Active Claude Pro or Max subscription

Steps:

Run:
```
claude setup-token
```
This outputs a token starting with sk-ant-oat.
Add to your .env:
```
ANTHROPIC_API_KEY=sk-ant-oat01--...
```

PlanExe auto-detects the sk-ant-oat prefix and switches to the correct Bearer auth scheme automatically. No other changes needed.

Note: OAuth tokens may expire. If you get 401 errors, run claude setup-token again to refresh.

Choosing a model

PlanExe makes 60–100+ LLM calls per plan. Choose based on your priority:

Config ID	Model	Speed	Cost (est. per plan)	Notes
`anthropic-claude-haiku-4-5`	Haiku 4.5	Fastest	< $0.50	Best default for most users
`anthropic-claude-sonnet-4-6`	Sonnet 4.6	Medium	~$2–5	Better quality than Haiku
`anthropic-claude-sonnet-4-6-thinking`	Sonnet 4.6 + thinking	Slow	~$5–15	Extended reasoning per call
`anthropic-claude-opus-4-6`	Opus 4.6	Slow	~$15–40	Highest quality, highest cost
`anthropic-claude-opus-4-6-thinking`	Opus 4.6 + thinking	Very slow	~$30–80+	Maximum quality, use sparingly

For pinned versions (reproducible results), use: - anthropic-claude-haiku-4-5-pinned — locked to claude-haiku-4-5-20251001 - anthropic-claude-sonnet-4-5 — locked to claude-sonnet-4-5

Quickstart (Docker)

Install Docker (with Docker Compose).

Clone the repo:

git clone https://github.com/PlanExeOrg/PlanExe.git
cd PlanExe

Copy .env.docker-example to .env and configure:

ANTHROPIC_API_KEY=sk-ant-...
DEFAULT_LLM=anthropic-claude-haiku-4-5
PLANEXE_MODEL_PROFILE=custom
PLANEXE_LLM_CONFIG_CUSTOM_FILENAME=anthropic_claude.json

Start PlanExe:

docker compose up worker_plan frontend_single_user

Open http://localhost:7860 and submit a prompt.

To switch models, change DEFAULT_LLM in .env to any config ID from the table above, then restart:

docker compose restart worker_plan frontend_single_user

Extended thinking

Some entries in anthropic_claude.json enable extended thinking. These use a thinking_dict in the config:

"thinking_dict": {"type": "enabled", "budget_tokens": 10000}

budget_tokens controls how many tokens Claude can use for internal reasoning before writing its answer. Higher = slower + more expensive, but better structured output on complex tasks.

Trade-offs for PlanExe: - PlanExe runs 60–100+ calls per plan — thinking multiplies both cost and latency significantly - For most plans, Haiku or Sonnet without thinking produces good results - Use thinking variants only when plan quality is the top priority

Troubleshooting

401 Unauthorized - API key: check that ANTHROPIC_API_KEY is set correctly in .env. - OAuth token: run claude setup-token again — tokens expire.

400 credit balance too low - API key path: add credits at console.anthropic.com. - OAuth token path: OAuth tokens authenticate you, but API calls still consume credits from your Anthropic account. Top up at the Console if needed.

Check worker logs:

docker compose logs -f worker_plan

Report issues on Discord. Include your OS, hardware, and the error from log.txt in the output directory.

Next steps

Learn prompt quality: Prompt writing guide
Understand output sections: Plan output anatomy