Agent Infrastructure: memory, models, reliability, and provable outputs

The Multi-Model Database for AI Agents: Deploy SurrealDB with Docker Extension shows SurrealDB packaging vectors, graphs, documents, and relations into one low-latency engine via a Docker extension. Outcome engineers get a compact, production-ready context store for agent memory and RAG pipelines, removing brittle glue between vector stores and graph backends — Principle 11 (The Graph) and 06 (Legible Landscapes).

Temporal raises $300M to scale AI agent reliability platform reports a $300M raise to expand a cloud platform focused on making AI agents reliable. This signals infrastructure-level tooling for monitoring, retries, and failure isolation — the immune-system primitives outcome engineers need to keep agent fleets healthy and auditable (Principle 14).

Anthropic launches Claude Sonnet 4.6 with coding and consistency improvements, plus 1M-token context window in beta releases Sonnet 4.6 with coding improvements and a beta 1M-token context window. A true million‑token context shifts how you design agent memories, long-horizon planning, and tool chaining — update your context-engineering and state management patterns (Principle 06).

Models That Prove Their Own Correctness describes self‑proving models that output interactive proofs certifying correctness per input. Per‑input guarantees change validation from sampling metrics to verifiable artifacts you can audit and gate, directly feeding Principle 16 (Audit the Outcomes) and reshaping deployment contracts.

A Guide to Which AI to Use in the Agentic Era argues that model choice is inseparable from the harness and orchestration, since identical models behave differently in different harnesses. For outcome engineers that means benchmarking models in your actual harnesses and orchestration layers rather than on paper — a practical nudge toward Principle 09 (Agentic Coordination) and Principle 06 (Legible Landscapes).