yyyhy/nash-arena
by Various
Chess and Card Game Arena For LLM
MCP
yyyhy/nash-arena
Added 1 June 2026
Overview
Provides an arena for evaluating large language models through chess and card games. Built in Python, it allows developers to test and compare LLM strategic reasoning in turn-based game scenarios.
Best for
Best for
Developers researching LLM capabilities in game-based strategic reasoning
Use cases
- Benchmarking LLM decision-making in chess and card games
- Comparing performance of different language models on structured game tasks
- Testing strategic reasoning and planning abilities of LLMs
Notes
Provides an arena for evaluating large language models through chess and card games. Built in Python, it allows developers to test and compare LLM strategic reasoning in turn-based game scenarios.
2 stars on GitHub. Last updated 2026-03-20. Licensed MIT.
Use cases
- Benchmarking LLM decision-making in chess and card games
- Comparing performance of different language models on structured game tasks
- Testing strategic reasoning and planning abilities of LLMs
Pros
- Open source and freely available
- Focused specifically on LLM game-playing evaluation
- Simple Python codebase easy to modify
Cons
- Very low community adoption (2 stars)
- Unclear documentation and setup instructions
- Limited to chess and card games only
Indexed from awesome-mcp-servers-punkpeye and enriched against its public facts.
Pros
- Open source and freely available
- Focused specifically on LLM game-playing evaluation
- Simple Python codebase easy to modify
Cons
- Very low community adoption (2 stars)
- Unclear documentation and setup instructions
- Limited to chess and card games only
Pairs with
Other entries in the index that connect to this one. Click through to see the chain.
Cline
Cline
Open-source autonomous coding agent that lives inside VS Code. BYO model key, watch it work.
Continue
Continue.dev
Open-source AI code assistant for VS Code and JetBrains. Customisable, BYO model, built for enterprise.
Aider
Paul Gauthier
Terminal-first AI pair programmer. Edits files in your repo, commits with sensible messages, runs your tests.