Anthropic just announced Dreaming for Claude Managed Agents at the Code with Claude developer conference, a scheduled background process that reviews each agent’s past sessions, extracts patterns, and writes structured playbooks the agent’s future sessions reference automatically. The Anthropic Dreaming feature is the first production-grade self-improvement capability shipped for managed agents at scale — early adopter Harvey reported a 6x improvement in completion rates after agents accumulated learnings between sessions. Anthropic also moved Outcomes (structured success measurement) and multi-agent orchestration from research preview into public beta. Together the three features mark a substantive maturation of Anthropic’s agent platform: agents that remember, agents that measure their own performance, and agents that coordinate with other agents under bounded autonomy.
What’s actually new
Dreaming runs as a scheduled background process — not a real-time inference path — that reviews an agent’s recent session history, identifies patterns the agent encountered, and produces two artifacts: plain-text learning notes (informal observations the agent can reference) and structured playbooks (machine-readable patterns for specific situations). Future sessions retrieve the relevant learnings as context, which lets the agent skip mistakes it has already made and apply techniques that have already worked. The pattern is analogous to how human professionals consolidate experience into expertise — except it happens automatically, between sessions, without manual prompt engineering or fine-tuning.
The key design choice is observability. Both the plain-text notes and structured playbooks are human-readable, version-controlled, and auditable. Operators can review what their agent has learned, edit incorrect or undesirable patterns, and roll back learnings if needed. The transparency addresses one of the chronic concerns about self-improving AI — opaque internal state that drifts without operator awareness. Dreaming makes the drift visible and controllable.
Outcomes is the second major announcement. Each managed agent now has a structured outcome model — what counts as success, what counts as failure, what intermediate signals matter. The agent itself reports against this model after each session. Outcomes data feeds into Dreaming so the agent learns to optimize for actual success rather than surface metrics like task completion or response length.
Multi-agent orchestration — the third announcement — moves into public beta. Claude managed agents can now invoke other Claude managed agents through the standard tool-use interface, with the broader Anthropic platform handling identity, authorization, conversation context, and observability across the agent network. Combined with the Agent2Agent protocol‘s recent Linux Foundation governance milestone, multi-agent workflows that span vendors are increasingly tractable.
The Harvey adoption story validates the capability. Harvey is the legal AI startup whose agents handle research, contract analysis, and similar workflows for major law firms. The 6x completion-rate improvement after Dreaming adoption represents the kind of step-change improvement that justifies Anthropic’s investment in the feature. The improvement comes from agents remembering filetype workarounds (a deposition transcript needs different handling than a contract), tool-specific patterns (their internal eDiscovery system has quirks worth remembering), and matter-specific context that previously required re-explanation in every session.
Why it matters
- Self-improvement just became operational. Production AI agents that get better over time without retraining or manual intervention is a capability shift. The 6x completion-rate improvement Harvey reported is the kind of multiplier that changes deployment economics dramatically.
- Observable self-improvement addresses the trust problem. Agents that learn opaquely produce trust erosion as operators wonder what the agent has internalized. Plain-text notes and structured playbooks that operators can read, edit, and version make self-improvement transparent.
- Outcomes as a first-class concept changes agent evaluation. Agents optimizing for “task completion” produce surface metrics; agents optimizing for measurable outcomes produce real impact. The Outcomes feature gives operators the framework to define what success means for each agent and measure it consistently.
- Multi-agent orchestration is now production-ready in Anthropic’s platform. Combined with A2A’s Linux Foundation governance, the multi-agent infrastructure question is largely resolved for 2026 deployment. The operational patterns are documented; the infrastructure is mature.
- The agent platform competition narrows. Anthropic’s Claude Managed Agents now has Dreaming, Outcomes, and multi-agent orchestration as differentiating features. OpenAI’s Agent SDK, Microsoft Copilot Studio, and Google’s Gemini Enterprise will need competitive responses through 2026 to maintain parity.
- Domain-specific agents become more economically viable. The Dreaming-induced improvement is largest for agents handling complex, idiosyncratic workflows where session-to-session learning compounds. Legal, healthcare, financial, scientific research, and similar verticals are the natural early adopters.
How to use Anthropic Dreaming today
Dreaming is available as a research preview feature on Claude Console for organizations with Claude Managed Agents access. Outcomes and multi-agent orchestration are in public beta. Three steps put a development team on the new features.
- Enable Dreaming on an existing managed agent. In the Claude Console, navigate to your agent’s configuration and toggle the Dreaming feature. Configure the dreaming schedule (typically nightly is appropriate for high-volume agents; weekly works for lower-volume ones). Set retention policies for the learnings notes and playbooks.
- Define Outcomes for each agent. Articulate what success means for the agent in structured form — completion criteria, intermediate signals, failure modes. The Outcomes definition becomes part of the agent’s configuration and feeds into Dreaming for goal-aligned learning.
- Compose multi-agent workflows. For workflows requiring multiple specialized agents, define each agent with its own scope, capabilities, and tools. Use the platform’s orchestration to compose them — the planner agent invokes specialist agents through tool calls, with results flowing back through the conversation context.
API integration follows the existing Claude Managed Agents pattern with new configuration fields: