Durable eval ledger
Runs are written through the API Warehouse sandbox eval store with capped stdout and stderr previews. Full execution evidence stays linked through sandbox sessions.
This view reads the Stage 7 eval ledger for agent, prompt, provider, and workflow scorecards produced by governed E2B sandbox batches.
This shell inherits authenticated page-route posture and is not exposed as a public marketing surface.
Runner registration and connector mutation stay blocked; signed Mission approval issuing, sandbox eval summaries, cost visibility, and operator sandbox interrupts are live.
Published /api/v2/os9 fleets, runners, approvals, and mission contracts use route-level auth and explicit OpenAPI registration; broader OS9 APIs remain closed.
No summary loaded.
0 passed, 0 failed cases.
Weighted by case count across stored eval runs.
Actual cost when present, otherwise estimated sandbox cost.
Runs are written through the API Warehouse sandbox eval store with capped stdout and stderr previews. Full execution evidence stays linked through sandbox sessions.