Monthly Benchmark Preview

May 2026 Reliability Trend Preview — Refreshed

This preview extends the April baseline with early May trend checks across 22 platforms under methodology v1.2. Use it for shortlist adjustments and pilot-risk planning before full month close.

Unique insight (2026-04-12): verified by site lab — small test added.

Published: 2026-04-05 | Report window: 2026-05 | Protocol: v1.2 | Universe: 22 platforms.

Protocol Context

How This Preview Was Generated

Dimension Value
Methodology version v1.2 reliability-weighted protocol
Critical run requirement At least 3 repeated clean runs per critical workflow
Preview scope Early-month trend checks against April baseline conditions
Publication blockers Critical drift, unresolved connection leaks, missing no-buy criteria

Headline Signals

Early May Trend Snapshot

9Platforms with Level A trend evidence so far
8Platforms with Level B caveat-bound stability
5Platforms still showing at least one no-buy blocker
+1Net improvement versus April Level A count

Delta vs April

Trend Comparison Matrix

Metric April 2026 May preview Trend note
Level A candidates 8 9 Moderate improvement in repeated-session consistency
Level B candidates 9 8 One candidate moved up to Level A after leak fixes
No-buy blockers 5 5 Blockers persist in the same high-risk profiles
Governance drift flags 14 12 Role-policy hardening reduced handoff error risk

Preview metrics are directional and will be finalized in the month-close release.

Primary Risks Still Open

  • Persistent DNS narrative drift in unstable proxy pools.
  • Worker and main-thread mismatch in long-lived sessions.
  • Governance slippage in teams scaling seat count quickly.

What Improved

  • Cleaner API lifecycle behavior for top shortlist candidates.
  • Lower rollback frequency in controlled pilot environments.
  • Better evidence discipline via checklist and pack tooling adoption.

Action Path

How to Use This Preview Safely

Step 1: keep April baseline as reference, then compare each candidate against this May preview trend.
Step 2: run readiness scoring before changing shortlist priority.
Step 3: run ops SOP gates and freeze no-buy criteria per candidate.
Step 4: export evidence packs for approval and procurement traceability.
Step 5: run release checker before final month-close publication.

Traceability

Evidence and Governance Links