A new approach trains reusable, frozen skills that compose to solve unseen grid layouts with 94% accuracy, challenging the conventional wisdom that hierarchical RL requires exhaustive training on every variation.