Build an RL post-training pipeline and fine-tune a base model to play FrogsGame. Evaluated on 500 hidden boards across 4 difficulty tiers (easy/medium/hard/expert, 125 each). Pre-training baseline: 19% overall (easy 45%, medium 22%, hard 8%, expert 2%).