Rank #945 · on radar since 2026-07-03

agent-arena

Evidence-first multi-agent debate skill: get a second opinion by pitting Codex × Claude Code (or GLM/DeepSeek/Qwen) to independently review, red-team & judge high-stakes code and architecture decisions.

Visit homepage ↗llm-as-judgeopenai-codexagent-skillopencode+15

Momentum

42.2

24h–7d–

Why it's ranked

Every score decomposes into published factors — the same math for every tool, paid or not. Read the methodology →

Velocity (weighted, cohort-normalized)	0.438
Signal decay	0.995
Corroboration	1.000
Quality gate	1.000

Raw signals (30 days)

github · forks5 latest · 2 snapshots

github · stars26 latest · 2 snapshots