Reviewers & improvement

Reviewer agents, rollouts, operator corrections becoming versioned StrategyRules.

Harness Improvement Loops Need Replayable Environments

Why harness improvement needs replayable episodes, bounded mutations, scorecards, source closure, and promotion gates.

How ContextOS treats autotune as a gated loop over traces, scorecards, replay sets, bounded candidates, approval, and rollout.

How to build a compliance reviewer agent with a typed verdict envelope, rubric, golden set, and change-control queue.

How to extend the reviewer pattern for reliability: timeouts, retries, idempotency, fallback behavior, and rollback declarations.

A five-stage rollout model for Context Packs: shadow, internal, low-risk, monitored expansion, full release, and rollback.

How one operator correction becomes a reviewed, replayed, versioned StrategyRule that prevents repeat agent failures.