R2D-RL: A RoboCup 2D Soccer Environment for Multi-Agent Reinforcement Learning
Quick Answer
R2D-RL is a new reinforcement learning environment that bridges RoboCup 2D Soccer Simulation with Python-based MARL workflows, enabling advanced multi-agent training.
Quick Take
R2D-RL is a new reinforcement learning environment that bridges RoboCup 2D Soccer Simulation with Python-based MARL workflows, enabling advanced multi-agent training. It features configurable opponents, hybrid action spaces, and supports parallel execution, providing benchmarks for 11-vs-11 scenarios and front-goal challenges.
Key Points
- R2D-RL connects RCSS2D and HELIOS clients via shared-memory communication.
- Supports full-field and scenario-based training with configurable opponents.
- Includes hybrid parameterized action spaces and action masks for enhanced control.
- Offers expected possession value (EPV)-based reward shaping for improved learning.
- Provides benchmarks for 11-vs-11 matches and front-goal scenarios.
Paper Resources
Article Excerpt
From source RSS / original summaryarXiv:2606. 18786v1 Announce Type: new Abstract: Robot soccer is a challenging testbed for multi-agent reinforcement learning because it combines partial observability, cooperative and adversarial interaction, sparse rewards, and long-horizon tactical behavior. RoboCup 2D Soccer Simulation (RCSS2D) provides a mature robot-soccer platform, but its competition-oriented server-client architecture is difficult to use directly with modern Python-based MARL workflows.
We introduce R2D-RL, a reinforcement learning environment that connects RCSS2D and HELIOS-based player clients to a Python MARL interface through shared-memory communication and cycle-level synchronization. R2D-RL supports full-field and scenario-based training with configurable opponents, Base discrete and Hybrid parameterized action spaces, action masks, expected possession value (EPV)-based reward shaping, and parallel execution.
We provide front-goal scenarios and an 11-vs-11 full-field benchmark, together with baseline results.
Reader Mode unavailable (could not extract clean content).
Want this in your inbox every morning?
Daily brief at your local 8am — bilingual EN/中文, free.
More from arXiv cs.AI
See more →Arbor: Tree Search as a Cognition Layer for Autonomous Agents
Arbor introduces a multi-agent framework utilizing structured tree search for optimizing LLM inference, achieving up to 193% throughput-latency improvement compared to vendor-optimized systems. It employs an Orchestrator and Critic agent for stability and coordination, demonstrating hardware-agnostic performance with minimal variance.