Agent orchestration, attestation, and standards

Andrej Karpathy talks about “Claws” introduces Claws, a personal-agent layer of containerized, schedulable, message-driven systems that run locally to orchestrate and persist agent workflows. Outcome engineers should view Claws as a pattern for building composable local runtimes that keep state, scheduling, and message flows explicit — practical alignment with Principle 09 and Principle 07.

Cord: Coordinating Trees of AI Agents demonstrates a runtime for agents to build and execute dynamic task trees with dependencies, parallelism, and live human questions. This directly changes how you decompose problems and propagate context across agent teams, making dynamic orchestration and the agent graph first-class concerns (Principle 09, Principle 11).

Interview with Notion CEO Ivan Zhao — custom Notion AI agents launching, agents build 50%+ of databases reveals Notion’s custom agents are already creating more than half of new databases and shipping real product artifacts. Treat this as a production case study: design agents to produce verifiable artifacts and human-review checkpoints if you want agents to own deliverables (Principle 03, Principle 09).

NIST agentic AI initiative looks to get handle on security launches a program to define standards and testing for agentic AI to reduce systemic cyber risk. For outcome engineers, this signals imminent baseline requirements for security, evaluation, and compliance — integrate adversarial testing and threat models into agent pipelines now (Principle 10, Principle 14).

How Tinfoil Proves Exactly What Model Is Running describes Modelwrap, which cryptographically binds published weights to a running server and attests to the exact model serving requests. Use this technique to guarantee model identity and enable auditable inference in production systems — a concrete tool for Ground Truth and provable integrity (Principle 02, Principle 07).