Rank #92 · on radar since 2026-07-03
awesome-evals
A curated, non-BS library of the best resources for building and evaluating AI agents — papers, blogs, talks, tools, benchmarks. Maintained by BenchFlow.
Visit homepage ↗agent-evaluationllmrl-environmentsbenchmarks+5
Momentum
91.6
24h–7d–
Why it's ranked
Every score decomposes into published factors — the same math for every tool, paid or not. Read the methodology →
| Velocity (weighted, cohort-normalized) | 0.950 |
| Signal decay | 0.995 |
| Corroboration | 1.000 |
| Quality gate | 1.000 |
Raw signals (30 days)
github · forks48 latest · 2 snapshots
github · stars657 latest · 2 snapshots