CrestingRadarGet featured →

Rank #859 · on radar since 2026-07-03

eval-harness

Behavior-regression testing for LLM agents. 4-class attribution, 6-field FAIL schema, $-cost gating, flaky detection. Bash + jq. Works with opencode today, runner-pluggable.

Visit homepage ↗evalbashdeveloper-toolsopencode+10GitHub

Momentum

42.2
24h7d

Why it's ranked

Every score decomposes into published factors — the same math for every tool, paid or not. Read the methodology →

Velocity (weighted, cohort-normalized)0.438
Signal decay0.995
Corroboration1.000
Quality gate1.000

Raw signals (30 days)

github · forks4 latest · 2 snapshots
github · stars5 latest · 2 snapshots