Live Demo — No Account Required

MULTI-AGENT DECISION VETTING

Watch 5 AI models from different providers debate a real decision. Each agent proposes, critiques, and votes independently. Consensus is measured, not assumed.

Decision Question

Should our startup adopt AI-powered code review as a mandatory step in our CI/CD pipeline?

✦claude-sonnet

◆gpt-4o

◈gemini-pro

▲mistral-large

★grok-2

0/10 events | Round 1/2

✦Claude SonnetPROPOSAL

85% confidence

Yes, adopt it as mandatory. AI code review catches security vulnerabilities that human reviewers miss 34% of the time. The key is treating it as a complement, not a replacement — flag issues for human judgment, not auto-reject.

◆GPT-4oPROPOSAL

78% confidence

Mandatory is too strong. AI review tools have 15-25% false positive rates on complex codebases. Making it mandatory creates alert fatigue and slows velocity. Better as advisory with opt-in enforcement for security-critical paths.

◈Gemini ProCRITIQUE

72% confidence

Both proposals miss the cost dimension. AI review at scale costs $0.02-0.15 per PR. For a team doing 200 PRs/week, that is $200-1,500/month. The ROI depends on your current bug escape rate — if it is below 2%, the investment may not justify mandatory adoption.

▲Mistral LargeCRITIQUE

76% confidence

I challenge the 34% claim. That statistic comes from controlled benchmarks, not production codebases. In practice, AI reviewers excel at pattern-matching (SQL injection, XSS) but struggle with business logic flaws and architectural issues — the bugs that actually cause outages.

★Grok 2PROPOSAL

88% confidence

The framing is wrong. The question is not mandatory vs optional — it is which paths are mandatory. Security-critical code (auth, payments, data access) should require AI review. Feature code should get advisory review. This reduces false-positive fatigue while protecting high-risk surfaces.

✦Claude SonnetVOTE

84% confidence

I revise my position. Path-based mandatory review is the pragmatic middle ground.

✓ SUPPORT

◆GPT-4oVOTE

81% confidence

Agreed. Tiered enforcement addresses my velocity concern while maintaining security coverage.

✓ SUPPORT

◈Gemini ProVOTE

79% confidence

Support, with the caveat that ROI should be measured after 90 days to validate the cost-benefit.

✓ SUPPORT

▲Mistral LargeVOTE

74% confidence

Conditional support. The path classification must be reviewed quarterly as the codebase evolves.

✓ SUPPORT

✓Consensus EngineCONSENSUS

82% confidence

Consensus reached (4/5 support, 1 conditional). Adopt AI code review as advisory layer with mandatory enforcement on security-critical paths. Measure false-positive rate and ROI at 90 days.

\u2726 Multi-Model

5 different AI models from 5 providers. No single point of failure or bias.

\u2694 Adversarial

Agents critique each other. Weak arguments get challenged. Consensus is earned.

$ Auditable

Every decision gets a cryptographic receipt. Full provenance trail. Audit-ready.

Run debates on your own decisions

Free tier includes 10 debates/month. No credit card required. First verdict in under 5 minutes.

GET STARTED FREE LEARN MORE