Agent Ops: SkillOpt, DeltaDB, Loopcraft, Context Compression & Billing
Microsoft’s open-source SkillOpt automatically upgrades AI agent skills without touching model weights. It evolves agent skill markdowns via optimization, boosting agent accuracy without modifying model weights. Outcome engineers can treat skills as updatable artifacts for fast, low-risk rollouts and reproducible improvements (Principle 08).
Software Is Made Between Commits — Zed launches DeltaDB, a delta-based VCS that records every edit and conversation so collaboration happens before commits. This makes the editor the system of record for agent-human edits and conversations, enabling traceable artifacts, reproducible agent actions, and finer-grained audit trails (Principles 03 & 11).
Loopcraft: The Art of Stacking Loops argues stacking autonomous loops replaces manual prompting and scales agent orchestration into composable systems. Outcome engineers must design loop composition, monitoring, and interruption semantics — orchestration becomes an architecture-level concern, not just prompt tuning (Principle 09).
Context compression finally works in production: new research cuts LLM input 16x without the accuracy hit shows Latent Context Language Models compress LLM input up to 16x while preserving accuracy on long-context tasks. For outcome engineers this reshapes state management and cost trade-offs for long-lived agents by cutting decoder compute and memory needs (Principle 06).
A DN42 scan by an AI agent ran up a $6,531 AWS bill reports an unattended agent triggered a $6,531 charge during network scanning. It’s a concrete operational warning: gate agent actions, enforce billing and credential safeguards, and bake safety checkpoints into agent lifecycles to prevent runaway costs and leaks (Principles 14 & 15).