Agents as Infrastructure — On-device models, context graphs, and security

Flash-MoE: Running a 397B Parameter Model on a MacBook Pro with 48GB RAM. The project runs a 397B MoE model on a 48GB MacBook Pro via SSD weight streaming and hand-tuned Metal kernels, claiming production-quality tool-calling. If you build agents, this changes deployment assumptions—weight streaming and efficient kernels make heavy models viable at the edge, forcing a rethink of latency, cost and data locality (Principle 07).

Microsoft outlines agentic AI security strategy with new Defender, Entra and Purview capabilities. Microsoft positions agents as a core security layer and adds Defender, Entra and Purview defenses for enterprise agentic AI. Outcome engineers must treat identity, endpoint and data governance as first-class concerns when designing agent platforms to avoid unchecked actions and data leaks (Principle 10, Principle 14).

Exclusive: Interloom raises $16.5M to capture ‘tacit knowledge’ and power AI agents. Interloom is building a continuous context graph that maps tacit operational knowledge to power enterprise agents and automate decisions. For practitioners, a context graph becomes the substrate for agent memory, routing and authorization—evaluate it as your canonical source of corporate state and intent (Principle 06, Principle 11).

Agentic AI business applications are here – scaling them from experimentation to production is the next step. The piece argues enterprises must treat agentic AI as first-class IT, adding governance, modular architecture and human checkpoints to scale prototypes into production. Use this as a short checklist: orchestration, auditability, human-in-the-loop controls and modular services belong in your design from day one (Principle 09, Principle 15).

The visibility gap holding back the agentic SOC. The article shows agents fail in SOCs when poor network visibility denies necessary context—observability is a prerequisite for effective agentic defense. If you ship security or monitoring agents, invest in telemetry and context engineering first; without sufficient visibility agents will make high-impact blind decisions (Principle 06, Principle 11).