2026年4月24日 09:24

9 ファイル変更 +102 -6

この更新の概要

プロンプトキャッシュのTTL（有効期限）をデフォルトの5分から1時間へ延長する設定ENABLE_PROMPT_CACHING_1Hが追加されました。また、現在の会話コンテキストをすべて継承してバックグラウンドでタスクを実行できる実験的な「フォーク型サブエージェント」機能が導入されています。これに伴い、環境変数CLAUDE_CODE_FORK_SUBAGENTによる機能有効化や、/forkコマンドの挙動変更に関する詳細な仕様が定義されました。

agent-sdk/cost-tracking +30 -1

プロンプトキャッシュのTTLを1時間に延長するための環境変数設定と、それに伴うコスト変動に関する説明が追加されました。

@@ -5,7 +5,7 @@ source: https://code.claude.com/docs/en/agent-sdk/cost-tracking.md

# Track cost and usage

> Learn how to track token usage, deduplicate parallel tool calls, and estimate costs with the Claude Agent SDK.

> Learn how to track token usage, estimate costs, and configure prompt caching with the Claude Agent SDK.

The Claude Agent SDK provides detailed token usage information for each interaction with Claude. This guide explains how to properly track usage and understand cost reporting, especially when dealing with parallel tool uses and multi-step conversations.

@@ -206,6 +206,35 @@ The Agent SDK automatically uses [prompt caching](https://platform.claude.com/do

Track these separately from `input_tokens` to understand caching savings. In TypeScript, these fields are typed on the [`Usage`](/en/agent-sdk/typescript#usage) object. In Python, they appear as keys in the [`ResultMessage.usage`](/en/agent-sdk/python#result-message) dict (for example, `message.usage.get("cache_read_input_tokens", 0)`).

### Extend the prompt cache TTL to one hour

Cache entries written by the SDK use a 5-minute TTL by default when you authenticate with an API key or run on Amazon Bedrock, Google Cloud Vertex AI, or Microsoft Foundry. If your workload runs many short sessions against the same system prompt and context with gaps longer than 5 minutes between them, the cache expires between sessions and each new session pays full input price.

To request a 1-hour TTL on cache writes, set the [`ENABLE_PROMPT_CACHING_1H`](/en/env-vars) environment variable. You can export it in your shell or container environment, or pass it through `options.env`.

The following example enables 1-hour TTL for an agent running on Bedrock:

```python Python theme={null}

options = ClaudeAgentOptions(

env={

"CLAUDE_CODE_USE_BEDROCK": "1",

"ENABLE_PROMPT_CACHING_1H": "1",

)

```

```typescript TypeScript theme={null}

const options = {

env: {

...process.env,

CLAUDE_CODE_USE_BEDROCK: "1",

ENABLE_PROMPT_CACHING_1H: "1",

};

```

Cache writes with a 1-hour TTL are billed at a higher rate than 5-minute writes, so enabling this trades higher write cost for more cache reads. See [prompt caching pricing](https://platform.claude.com/docs/en/build-with-claude/prompt-caching) for details. Claude subscription users already receive 1-hour TTL automatically and do not need to set this variable.

## Related documentation

- [TypeScript SDK Reference](/en/agent-sdk/typescript) - Complete API documentation

agent-sdk/python +1 -1

envオプションがプロセス環境を継承し、基礎となるCLIが読み取る環境変数をマージする仕様が明記されました。

@@ -792,7 +792,7 @@ class ClaudeAgentOptions:

| `env` | `dict[str, str]` | `{}` | Environment variables |

| `env` | `dict[str, str]` | `{}` | Environment variables merged on top of the inherited process environment. See [Environment variables](/en/env-vars) for variables the underlying CLI reads |

| `extra_args` | `dict[str, str \| None]` | `{}` | Additional CLI arguments to pass directly to the CLI |

agent-sdk/typescript +1 -1

envオプションの定義に、基礎となるCLIが参照する環境変数ドキュメントへのリンクと説明が追加されました。

@@ -330,7 +330,7 @@ Configuration object for the `query()` function.

| `env` | `Record<string, string \| undefined>` | `process.env` | Environment variables. See [Environment variables](/en/env-vars) for variables the underlying CLI reads. Set `CLAUDE_AGENT_SDK_CLIENT_APP` to identify your app in the User-Agent header |

| `extraArgs` | `Record<string, string \| null>` | `{}` | Additional arguments |

amazon-bedrock +4 -1

1時間のキャッシュTTLを要求する環境変数の設定例と、TTL延長時の料金に関する注意書きが追記されました。

@@ -244,9 +244,12 @@ export ANTHROPIC_MODEL='arn:aws:bedrock:us-east-2:your-account-id:application-in

# Optional: Disable prompt caching if needed

export DISABLE_PROMPT_CACHING=1

# Optional: Request 1-hour prompt cache TTL instead of the 5-minute default

export ENABLE_PROMPT_CACHING_1H=1

```

<Note>[Prompt caching](https://platform.claude.com/docs/en/build-with-claude/prompt-caching) may not be available in all regions.</Note>

<Note>[Prompt caching](https://platform.claude.com/docs/en/build-with-claude/prompt-caching) may not be available in all regions. Cache writes with a 1-hour TTL are billed at a higher rate than 5-minute writes.</Note>

#### Map each model version to an inference profile

commands +1 -1

/forkコマンドがサブエージェントのフォーク機能を呼び出すよう挙動を変更する設定についての記述が追加されました。

@@ -23,7 +23,7 @@ In the table below, `<arg>` indicates a required argument and `[arg]` indicates

| `/agents` | Manage [agent](/en/sub-agents) configurations |

| `/autofix-pr [prompt]` | Spawn a [Claude Code on the web](/en/claude-code-on-the-web#auto-fix-pull-requests) session that watches the current branch's PR and pushes fixes when CI fails or reviewers leave comments. Detects the open PR from your checked-out branch with `gh pr view`; to watch a different PR, check out its branch first. By default the remote session is told to fix every CI failure and review comment; pass a prompt to give it different instructions, for example `/autofix-pr only fix lint and type errors`. Requires the `gh` CLI and access to [Claude Code on the web](/en/claude-code-on-the-web#who-can-use-claude-code-on-the-web) |

| `/batch <instruction>` | **[Skill](/en/skills#bundled-skills).** Orchestrate large-scale changes across a codebase in parallel. Researches the codebase, decomposes the work into 5 to 30 independent units, and presents a plan. Once approved, spawns one background agent per unit in an isolated [git worktree](/en/common-workflows#run-parallel-claude-code-sessions-with-git-worktrees). Each agent implements its unit, runs tests, and opens a pull request. Requires a git repository. Example: `/batch migrate src/ from Solid to React` |

| `/branch [name]` | Create a branch of the current conversation at this point. Switches you into the branch and preserves the original, which you can return to with `/resume`. Alias: `/fork` |

| `/branch [name]` | Create a branch of the current conversation at this point. Switches you into the branch and preserves the original, which you can return to with `/resume`. Alias: `/fork`. When [`CLAUDE_CODE_FORK_SUBAGENT`](/en/env-vars) is set, `/fork` instead spawns a [forked subagent](/en/sub-agents#fork-the-current-conversation) and is no longer an alias for this command |

| `/btw <question>` | Ask a quick [side question](/en/interactive-mode#side-questions-with-%2Fbtw) without adding to the conversation |

| `/chrome` | Configure [Claude in Chrome](/en/chrome) settings |

| `/claude-api` | **[Skill](/en/skills#bundled-skills).** Load Claude API reference material for your project's language (Python, TypeScript, Java, Go, Ruby, C#, PHP, or cURL) and Managed Agents reference. Covers tool use, streaming, batches, structured outputs, and common pitfalls. Also activates automatically when your code imports `anthropic` or `@anthropic-ai/sdk` |

env-vars +1 -0

会話コンテキストを継承するサブエージェントを有効にするCLAUDE_CODE_FORK_SUBAGENT変数の説明が追加されました。

@@ -95,6 +95,7 @@ Claude Code supports the following environment variables to control its behavior

| `CLAUDE_CODE_EXIT_AFTER_STOP_DELAY` | Time in milliseconds to wait after the query loop becomes idle before automatically exiting. Useful for automated workflows and scripts using SDK mode |

| `CLAUDE_CODE_EXPERIMENTAL_AGENT_TEAMS` | Set to `1` to enable [agent teams](/en/agent-teams). Agent teams are experimental and disabled by default |

| `CLAUDE_CODE_FILE_READ_MAX_OUTPUT_TOKENS` | Override the default token limit for file reads. Useful when you need to read larger files in full |

| `CLAUDE_CODE_FORK_SUBAGENT` | Set to `1` to enable [forked subagents](/en/sub-agents#fork-the-current-conversation). A forked subagent inherits the full conversation context from the main session instead of starting fresh. When enabled, `/fork` spawns a forked subagent rather than acting as an alias for [`/branch`](/en/commands), and all subagent spawns run in the background. Interactive mode only |

| `CLAUDE_CODE_GIT_BASH_PATH` | Windows only: path to the Git Bash executable (`bash.exe`). Use when Git Bash is installed but not in your PATH. See [Windows setup](/en/setup#set-up-on-windows) |

| `CLAUDE_CODE_GLOB_HIDDEN` | Set to `false` to exclude dotfiles from results when Claude invokes the [Glob tool](/en/tools-reference). Included by default. Does not affect `@` file autocomplete, `ls`, Grep, or Read |

| `CLAUDE_CODE_GLOB_NO_IGNORE` | Set to `false` to make the [Glob tool](/en/tools-reference) respect `.gitignore` patterns. By default, Glob returns all matching files including gitignored ones. Does not affect `@` file autocomplete, which has its own [`respectGitignore` setting](/en/settings#available-settings) |

google-vertex-ai +4 -1

プロンプトキャッシュの自動有効化に関する記述が整理され、1時間TTL設定のオプションが追加されました。

@@ -158,6 +158,9 @@ export ANTHROPIC_VERTEX_PROJECT_ID=YOUR-PROJECT-ID

# Optional: Disable prompt caching if needed

export DISABLE_PROMPT_CACHING=1

# Optional: Request 1-hour prompt cache TTL instead of the 5-minute default

export ENABLE_PROMPT_CACHING_1H=1

# When CLOUD_ML_REGION=global, override region for models that don't support global endpoints

export VERTEX_REGION_CLAUDE_HAIKU_4_5=us-east5

export VERTEX_REGION_CLAUDE_4_6_SONNET=europe-west1

@@ -165,7 +168,7 @@ export VERTEX_REGION_CLAUDE_4_6_SONNET=europe-west1

Most model versions have a corresponding `VERTEX_REGION_CLAUDE_*` variable. See the [Environment variables reference](/en/env-vars) for the full list. Check [Vertex Model Garden](https://console.cloud.google.com/vertex-ai/model-garden) to determine which models support global endpoints versus regional only.

[Prompt caching](https://platform.claude.com/docs/en/build-with-claude/prompt-caching) is automatically supported when you specify the `cache_control` ephemeral flag. To disable it, set `DISABLE_PROMPT_CACHING=1`. For heightened rate limits, contact Google Cloud support. When using Vertex AI, the `/login` and `/logout` commands are disabled since authentication is handled through Google Cloud credentials.

[Prompt caching](https://platform.claude.com/docs/en/build-with-claude/prompt-caching) is enabled automatically. To disable it, set `DISABLE_PROMPT_CACHING=1`. To request a 1-hour cache TTL instead of the 5-minute default, set `ENABLE_PROMPT_CACHING_1H=1`; cache writes with a 1-hour TTL are billed at a higher rate. For heightened rate limits, contact Google Cloud support. When using Vertex AI, the `/login` and `/logout` commands are disabled since authentication is handled through Google Cloud credentials.

### 5. Pin model versions

microsoft-foundry +6 -0

キャッシュTTLを5分から1時間に延長するための環境変数設定方法と料金体系に関する案内が追加されました。

@@ -150,6 +150,12 @@ export ANTHROPIC_DEFAULT_HAIKU_MODEL='claude-haiku-4-5'

For current and legacy model IDs, see [Models overview](https://platform.claude.com/docs/en/about-claude/models/overview). See [Model configuration](/en/model-config#pin-models-for-third-party-deployments) for the full list of environment variables.

[Prompt caching](https://platform.claude.com/docs/en/build-with-claude/prompt-caching) is enabled automatically. To request a 1-hour cache TTL instead of the 5-minute default, set the following variable; cache writes with a 1-hour TTL are billed at a higher rate:

```bash

export ENABLE_PROMPT_CACHING_1H=1

```

## Azure RBAC configuration

The `Azure AI User` and `Cognitive Services User` default roles include all required permissions for invoking Claude models.

sub-agents +54 -0

会話履歴やツール定義を継承してバックグラウンド実行するフォーク型サブエージェントの操作方法や、通常のサブエージェントとの違いが詳説されました。

@@ -29,6 +29,7 @@ Claude Code includes several built-in subagents like **Explore**, **Plan**, and

- [How to create your own](#quickstart-create-your-first-subagent)

- [Full configuration options](#configure-subagents)

- [Patterns for working with subagents](#work-with-subagents)

- [Forked subagents](#fork-the-current-conversation)

- [Example subagents](#example-subagents)

## Built-in subagents

@@ -613,6 +614,8 @@ Claude decides whether to run subagents in the foreground or background based on

To disable all background task functionality, set the `CLAUDE_CODE_DISABLE_BACKGROUND_TASKS` environment variable to `1`. See [Environment variables](/en/env-vars).

When [fork mode](#fork-the-current-conversation) is enabled, every subagent spawn runs in the background regardless of the `background` field. Forks still surface permission prompts in your terminal as they occur instead of pre-approving; named subagents follow the pre-approval flow above.

### Common patterns

#### Isolate high-volume operations

@@ -715,6 +718,57 @@ Compaction events are logged in subagent transcript files:

The `preTokens` value shows how many tokens were used before compaction occurred.

## Fork the current conversation

Forked subagents are experimental and require Claude Code v2.1.117 or later. Behavior and configuration may change in future releases. Enable them by setting the [`CLAUDE_CODE_FORK_SUBAGENT`](/en/env-vars) environment variable to `1`.

A fork is a subagent that inherits the entire conversation so far instead of starting fresh. This drops the input isolation that subagents otherwise provide: a fork sees the same system prompt, tools, model, and message history as the main session, so you can hand it a side task without re-explaining the situation. The fork's own tool calls still stay out of your conversation and only its final result comes back, so your main context window stays clean. Use a fork when a named subagent would need too much background to be useful, or when you want to try several approaches in parallel from the same starting point.

Enabling fork mode changes Claude Code in three ways:

- Claude spawns a fork whenever it would otherwise use the [general-purpose](#built-in-subagents) subagent. Named subagents such as Explore still spawn as before.

- Every subagent spawn runs in the [background](#run-subagents-in-foreground-or-background), whether it is a fork or a named subagent. Set `CLAUDE_CODE_DISABLE_BACKGROUND_TASKS` to `1` to keep spawns synchronous.

- The `/fork` command spawns a fork instead of acting as an alias for [`/branch`](/en/commands).

You can start a fork yourself with `/fork` followed by a directive. Claude Code names the fork from the first words of the directive. The following example forks the conversation to draft test cases while you continue with the implementation in the main session:

```text

/fork draft unit tests for the parser changes so far

```

The fork appears in a panel below your prompt and runs in the background while you keep working. When it finishes, its result arrives as a message in your main conversation. The next section covers the panel controls for watching and steering forks while they run.

### Observe and steer running forks

Running forks appear in a panel below the prompt input, with one row for the main session and one for each fork. Use these keys to interact with the panel:

| Key | Action |

| :- | :- |

| `↑` / `↓` | Move between rows |

| `Enter` | Open the selected fork's transcript and send it follow-up messages |

| `x` | Dismiss a finished fork or stop a running one |

| `Esc` | Return focus to the prompt input |

### How forks differ from named subagents

A fork inherits everything the main session has at the moment it spawns. A named subagent starts from its own definition.

| | Fork | Named subagent |

| :- | :- | :- |

| Context | Full conversation history | Fresh context with the prompt you pass |

| System prompt and tools | Same as main session | From the subagent's [definition file](#write-subagent-files) |

| Model | Same as main session | From the subagent's `model` field |

| Permissions | Prompts surface in your terminal | [Pre-approved](#run-subagents-in-foreground-or-background) before launch, then auto-denied |

| Prompt cache | Shared with main session | Separate cache |

Because a fork's system prompt and tool definitions are identical to the parent, its first request reuses the parent's prompt cache. This makes forking cheaper than spawning a fresh subagent for tasks that need the same context.

When Claude spawns a fork through the Agent tool, it can pass `isolation: "worktree"` so the fork's file edits are written to a separate git worktree instead of your checkout.

### Limitations

Fork mode works only in interactive sessions. It is disabled in [non-interactive mode](/en/headless), which includes the Agent SDK. A fork cannot spawn further forks.

## Example subagents

These examples demonstrate effective patterns for building subagents. Use them as starting points, or generate a customized version with Claude.