Mahjax: A GPU-Accelerated Mahjong Simulator for Reinforcement Learning in JAX
Quick Take
Mahjax is a GPU-accelerated Mahjong simulator for reinforcement learning, implemented in JAX.
Key Points
- Supports large-scale rollout parallelization on GPUs.
- Achieves up to 2 million steps per second.
- Validates reinforcement learning effectiveness against baseline policies.
Reader Mode unavailable (could not extract clean content).
Want this in your inbox every morning?
Daily brief at your local 8am — bilingual EN/中文, free.
More from arXiv cs.AI
See more →From Prompts to Protocols: An AI Agent for Laboratory Automation
An AI agent integrates large language models for automating laboratory protocols, enhancing efficiency and accuracy.

