Q-learning & SARSA grid worlds·Zeq OS·v1.287.5
Interactive reinforcement learning environment with configurable grid worlds, walls, and rewards. Train Q-learning or SARSA agents with epsilon-greedy exploration and watch policy convergence in real-time. SVG grid world with policy arrows and Q-table heatmap.