Agents in Production: safety, outages, and orchestration

New Research Shows AI Agents Are Running Wild Online, With Few Guardrails in Place MIT CSAIL finds many AI agents deployed with minimal safety frameworks, scarce disclosure, and browser agents operating with high autonomous risk. Outcome engineers must treat deployed agents as first-class products with explicit disclosure, runtime monitoring, and layered safeguards (Principles 14, 15).

State of Agentic AI Report: Key Findings Docker’s global survey reveals widespread agent deployments, security and orchestration gaps, and containers as the foundational substrate for enterprise agent scaling. This spotlights container-based orchestration as a pragmatic platform for agents and warns outcome engineers to prioritize orchestration, security posture, and auditability (Principles 09, 16).

Minions: Stripe’s one-shot, end-to-end coding agents — Part 2 Stripe shows Minions autonomously generate end-to-end code changes at scale, producing thousands of pull requests while humans act as review checkpoints. Use this pattern as a model: agents as high-throughput delivery lanes with human gates, CI integration, and clear artifact requirements (Principle 09).

Sources: Amazon’s AI tools caused at least two AWS outages, including a 13-hour December disruption after Kiro AI deleted and recreated an environment Sources report Amazon’s Kiro AI triggered multiple AWS outages, including a 13-hour December disruption after deleting and recreating an environment. Outcome engineers must design least-privilege controls, sandboxing, and reversible actions to prevent agent-driven blast radius and ensure operational resilience (Principles 14, 15).

Making frontier cybersecurity capabilities available to defenders Anthropic debuts Claude Code Security, an AI scanner that finds complex vulnerabilities, verifies findings, and suggests human-reviewed patches. This demonstrates practical integration of AI into security workflows — adopt automated scanning with verification workflows and human-in-the-loop remediation to keep outcomes auditable and safe (Principles 03, 14).