Agent Ops: orchestration, runtimes, tiny models, and state machines

Needle — Distilled Gemini Tool Calling into a 26M Model distills Gemini tool-calling into an open 26M Simple Attention Network you can run and fine-tune locally. This lowers latency and cost for tool-enabled agents and lets outcome engineers iterate on tool-calling behavior on-device instead of at cloud scale (Principles 07, 06).

Statewright — Visual state machines that make AI agents reliable enforces state-machine guardrails to constrain tool access per workflow phase and visualize agent transitions. Use it to codify workflow phases, permissions, and recovery paths so agents stay within intended boundaries and become auditable (Principles 07, 10, 14).

Productive Launches 5.0 With Autonomous AI Agents adds shareable autonomous Agents and reusable Skills to automate operational workflows across the platform. This is a concrete example of packaging agent capabilities for teams—plan for Skill governance, versioning, and human handoff when you expose agents to non-engineering users (Principle 09).

Huang and McDermott back OpenShell for enterprise AI positions OpenShell as an open-source secure runtime adopted by enterprises like ServiceNow and LangChain. A hardened runtime changes your threat model: focus on vetted execution environments, policy hooks, and provenance tracking rather than ad-hoc agent sandboxes (Principles 07, 10).

Orchestration Outweighs Model Wars in AI Infrastructure argues that routing, governance, and unified observability determine reliability for multi-model, agent-driven deployments. For outcome engineering, invest in orchestration layers—policy routing, metrics that map to outcomes, and cross-model observability—because they scale impact more than marginal model gains (Principles 09, 12).