← Latest Update

Agentic Engineering: Debug, Port, Harden, Distill

What is agentic engineering? Simon Willison outlines agentic engineering patterns that shift engineers from writing code to setting goals, building tooling, and enforcing verification. Outcome engineers should adopt these practices to design agents that iterate, prove results, and integrate validation into delivery (Principles 01, 03, 16).

Let your Coding Agent debug the browser session with Chrome DevTools MCP Google enables coding agents to attach to live browser sessions with user permission so agents can inspect DOM, network, and console state. This turns the browser into a live instrument for agent-driven debugging and verification—valuable when you need agents to reproduce and fix UI-level bugs (Principles 03, 15).

We Ran Real Attacks Against Our RAG Pipeline. Here’s What Actually Stopped Them. Amin Rj red-teams a ChromaDB+LM Studio RAG stack and demonstrates three effective defenses: embedding anomaly detection, access‑controlled retrieval, and prompt hardening. Put these controls into your retrieval and gate layers to harden agentic pipelines before production (Principles 14, 15).

Porting Software Has Been Trivial for a While — Here’s How You Do It. Geoffrey Huntley shows how ‘Ralph’ loops compress tests, cite implementations, and produce spec-first artifacts that let agents port and refactor codebases automatically. Use spec-driven artifacts to make systems agent-actionable and to preserve ground truth across language or platform migrations (Principles 06, 11).

Tree Search Distillation for Language Models Using PPO The author distills MCTS-guided tree-of-thought reasoning into a 1.5B LM via online PPO, improving Countdown performance by ~8.2 percentage points. That pathway helps outcome engineers deploy smaller, faster models that retain deliberative reasoning—useful for production agents where latency and cost matter (Principles 06, 16).