CrestingRadarGet featured →

Rank #92 · on radar since 2026-07-03

awesome-evals

A curated, non-BS library of the best resources for building and evaluating AI agents — papers, blogs, talks, tools, benchmarks. Maintained by BenchFlow.

Visit homepage ↗agent-evaluationllmrl-environmentsbenchmarks+5GitHub

Momentum

91.6
24h7d

Why it's ranked

Every score decomposes into published factors — the same math for every tool, paid or not. Read the methodology →

Velocity (weighted, cohort-normalized)0.950
Signal decay0.995
Corroboration1.000
Quality gate1.000

Raw signals (30 days)

github · forks48 latest · 2 snapshots
github · stars657 latest · 2 snapshots