Rank #945 · on radar since 2026-07-03
agent-arena
Evidence-first multi-agent debate skill: get a second opinion by pitting Codex × Claude Code (or GLM/DeepSeek/Qwen) to independently review, red-team & judge high-stakes code and architecture decisions.
Visit homepage ↗llm-as-judgeopenai-codexagent-skillopencode+15
Momentum
42.2
24h–7d–
Why it's ranked
Every score decomposes into published factors — the same math for every tool, paid or not. Read the methodology →
| Velocity (weighted, cohort-normalized) | 0.438 |
| Signal decay | 0.995 |
| Corroboration | 1.000 |
| Quality gate | 1.000 |
Raw signals (30 days)
github · forks5 latest · 2 snapshots
github · stars26 latest · 2 snapshots