Agent Infrastructure: Swarms, Skills, Models, Hardware, Web APIs
Building SQLite with a Small Swarm — Six agents collaborate to build a 19k-line Rust SQLite clone with 282 passing tests, showing agent parallelism can ship real systems code. This demonstrates multi-agent orchestration and parallel task decomposition can produce production-quality artifacts, giving a concrete pattern for Agentic Coordination and Teamwork.
Alibaba debuts Qwen 3.5 with visual agentic capabilities, claims 60% cost reduction and 8× large-workload improvement — Alibaba releases Qwen 3.5 with built-in visual agentic abilities and claims major cost and throughput gains for large workloads. Visual agents at this scale shift how you design perception+action pipelines and rethink cost/performance tradeoffs when composing agentic workflows.
NVIDIA Blackwell Ultra Delivers up to 50x Better Performance and 35x Lower Costs for Agentic AI — NVIDIA ships GB300 NVL72 hardware that promises up to 50× throughput per megawatt and 35× lower cost per token for low-latency agentic AI. This reduces the primary infrastructure constraint for high-concurrency, long-context agents, so outcome engineers should revisit concurrency, latency, and cost models for deployment.
WebMCP Proposal — WebMCP proposes a schema-driven web API that exposes web app functions as discoverable tools so browser and LLM agents can act inside interfaces with shared context. A standard like this makes agent actions more composable and legible, simplifying integration of agents into product UIs and automating safe, auditable interactions.
SkillsBench: Benchmarking How Well Agent Skills Work Across Diverse Tasks — SkillsBench evaluates curated and self-generated agent Skills across 86 tasks and finds curated Skills materially boost task pass rates while self-generated Skills provide little benefit. For outcome engineering, this argues for investing in validated, reusable skill libraries and verification pipelines rather than relying on brittle, on-the-fly skill synthesis.