mirror of https://github.com/anthropics/skills synced 2026-07-04 19:06:54 +00:00

Files

Lance Martin da20c92503 Add Opus 4.8 migration guide and model updates to claude-api skill (#1216 )

* Add Managed Agents self-hosted sandboxes + mid-session agent updates + MCP tool-output offload to claude-api skill

Self-hosted sandboxes: new shared/managed-agents-self-hosted-sandboxes.md for config:{type:"self_hosted"} — agent loop on Anthropic's orchestration, tool execution on customer infra via outbound-polling worker. Covers EnvironmentWorker.run()/.run_one() (Py/TS), ant beta:worker poll/run, mid-level work.poller()/WorkPoller (Py/TS/Go only; Go has no auto_stop opt-out), AgentToolContext/beta_agent_toolset/tool_runner(), monitoring (environments.work.stats/stop — x-api-key, call from outside worker host), runtime deps, cloud-vs-self_hosted delta table, credentials, security ownership split. Cross-refs in environments.md, overview.md (Reading Guide + rewrote cloud-only pitfall), api-reference.md (SDK row + naming-quirks + schema + work REST rows), tools.md (Who-runs-it carve-out), onboarding.md, live-sources.md.

Mid-session agent updates: sessions.update(session_id, agent={tools, mcp_servers}, vault_ids=[...]) — session-local override (doesn't bump agent version), full-replacement semantics, session must be idle. New core.md section + pointers in tools.md, api-reference.md (UpdateSession row), overview.md.

Large MCP tool outputs → files: >100K tokens → automatic offload to sandbox file; agent gets truncated preview + path. Plus: invalid vault credentials don't block sessions.create() — session.error event fires, auth retries on next idle→running. Both in tools.md.

* Point ant CLI install ref to live-sources.md (OSS has no anthropic-cli.md)

* Add Opus 4.8 model migration guide to claude-api skill

* Add prescriptive tool-description guidance for Opus 4.8 to claude-api skill

2026-05-28 22:02:26 -04:00

6.4 KiB

Raw Blame History

Managed Agents — Multiagent Sessions

A coordinator agent can delegate to other agents within one session. All agents share the container and filesystem; each runs in its own thread — a context-isolated event stream with its own conversation history, model, system prompt, tools, MCP servers, and skills (from that agent's own config). Threads are persistent: the coordinator can send a follow-up to a subagent it called earlier and that subagent retains its prior turns.

The SDK sets the managed-agents-2026-04-01 beta header automatically on all client.beta.{agents,sessions}.* calls; no additional header is required for multiagent.

Declare the roster on the coordinator

multiagent is a top-level field on agents.create() / agents.update() — not a tools[] entry. agents lists 1–20 roster entries. Nothing changes on sessions.create() — the roster is resolved from the coordinator's config.

orchestrator = client.beta.agents.create(
    name="Engineering Lead",
    model="claude-opus-4-8",
    system="You coordinate engineering work. Delegate code review to the reviewer and test writing to the test agent.",
    tools=[{"type": "agent_toolset_20260401"}],
    multiagent={
        "type": "coordinator",
        "agents": [
            reviewer.id,                                            # bare string — latest version
            {"type": "agent", "id": test_writer.id, "version": 4},  # pinned version
            {"type": "self"},                                       # the coordinator itself
        ],
    },
)

session = client.beta.sessions.create(agent=orchestrator.id, environment_id=env.id)

Roster entry	Shape	Notes
String shorthand	`"agent_abc123"`	References the latest version of a stored agent.
Agent reference	`{type: "agent", id, version?}`	Omit `version` to pin the latest at coordinator save time.
Self	`{type: "self"}`	The coordinator can spawn copies of itself.

Up to 20 unique agents in the roster; the coordinator may spawn multiple copies of each. One level of delegation only — depth > 1 is ignored.

Threads

The session-level event stream is the primary thread — it shows the coordinator's trace plus a condensed view of subagent activity (thread status transitions and cross-thread messages, not every subagent tool call). Drill into a specific subagent via the per-thread endpoints:

Operation	HTTP	SDK (`client.beta.sessions.threads.*`)
List threads	`GET /v1/sessions/{sid}/threads`	`.list(session_id)`
Retrieve one	`GET /v1/sessions/{sid}/threads/{tid}`	`.retrieve(thread_id, session_id=...)`
Archive	`POST /v1/sessions/{sid}/threads/{tid}/archive`	`.archive(thread_id, session_id=...)`
List thread events	`GET /v1/sessions/{sid}/threads/{tid}/events`	`.events.list(thread_id, session_id=...)`
Stream thread events	`GET /v1/sessions/{sid}/threads/{tid}/stream`	`.events.stream(thread_id, session_id=...)`

Each SessionThread carries id, status (running | idle | rescheduling | terminated), agent (a resolved snapshot of the agent config — id, name, model, system, tools, skills, mcp_servers, version), parent_thread_id (null for the primary thread, which is included in the list), archived_at, and optional stats/usage. Session status aggregates thread statuses — if any thread is running, session.status is running. Max 25 concurrent threads. When draining a per-thread stream, break on session.thread_status_idle (and check its stop_reason as you would for the session-level idle).

Multiagent events (on the session stream)

Event	Payload highlights	Meaning
`session.thread_created`	`session_thread_id`, `agent_name`	A new thread was created.
`session.thread_status_running`	`session_thread_id`, `agent_name`	Thread started activity.
`session.thread_status_idle`	`session_thread_id`, `agent_name`, `stop_reason`	Thread is awaiting input. Inspect `stop_reason` (same shape as `session.status_idle.stop_reason`).
`session.thread_status_rescheduled`	`session_thread_id`, `agent_name`	Thread is rescheduling after a retryable error.
`session.thread_status_terminated`	`session_thread_id`, `agent_name`	Thread was archived or hit a terminal error.
`agent.thread_message_sent`	`to_session_thread_id`, `to_agent_name`, `content`	Coordinator sent a follow-up to another thread.
`agent.thread_message_received`	`from_session_thread_id`, `from_agent_name`, `content`	An agent delivered its result to the coordinator.

Tool permissions and custom tools from subagent threads

When a subagent needs your client (an always_ask confirmation, or a custom tool result), the request is cross-posted to the primary thread with session_thread_id identifying the originating thread — so you only need to watch the session stream. Reply with user.tool_confirmation (carrying tool_use_id) or user.custom_tool_result (carrying custom_tool_use_id), and echo the session_thread_id from the originating event (the SDK param type and docstring expect it). The server also routes by the tool-use ID, so the echo is belt-and-suspenders rather than load-bearing — but include it.

for event_id in stop.event_ids:
    pending = events_by_id[event_id]
    confirmation = {
        "type": "user.tool_confirmation",
        "tool_use_id": event_id,
        "result": "allow",
    }
    if pending.session_thread_id is not None:
        confirmation["session_thread_id"] = pending.session_thread_id
    client.beta.sessions.events.send(session.id, events=[confirmation])

The same pattern applies to user.custom_tool_result.

Pitfalls

Don't put the roster on sessions.create() or in tools[]. multiagent is a top-level agent field; update the coordinator, then start a session that references it.
Don't assume shared context. Threads share the filesystem but not conversation history or tools. If the coordinator needs a subagent to act on something, it must say so in the delegated message (or write it to disk).
Depth > 1 is ignored. A subagent's own multiagent roster (if any) doesn't cascade — only the session's coordinator delegates.

For per-language bindings beyond Python, WebFetch https://platform.claude.com/docs/en/managed-agents/multi-agent.md (see shared/live-sources.md).

6.4 KiB Raw Blame History Unescape Escape