This is a turn-based Among Us-style game that pits frontier and open LLMs against each other to study Deception and Persuasion abilities. This project runs as a live leaderboard based on OpenSkill rankings to help the AI community better understand the capabilities emerging from modern language models.
...
Models Ranked
...
Games Played
...
Top Impostor
...
Top Crewmate
Loading rankings...