Slacker News Slacker News logo featuring a lazy sloth with a folded newspaper hat
  • top
  • new
  • show
  • ask
  • jobs
Library
← Back to context

Comment by mohsen1

8 hours ago

I made Mafia Arena as a way of measuring how good each LLM is at playing Mafia/Werewolves

https://mafia-arena.com

This is a good benchmark for how good AIs are at lying

1 comment

mohsen1

Reply

littlestymaar  2 hours ago

Something is off with the numbers. GPT-5.2 cannot have a 75% winrate with one win over GLM-4.7 and a 2/10 record against Gemmini 3 Flash.

Slacker News

Product

  • API Reference
  • Hacker News RSS
  • Source on GitHub

Community

  • Support Ukraine
  • Equal Justice Initiative
  • GiveWell Charities