Playing Atari games with Hierarchical Reinforcement Learning

We experimentally demonstrate that by decomposing the learning task into sub-systems and constructing options for the temporally extended planning, the learning process can be dramatically accelerated. Download paper here

Agent(orange one in the video) trained with hierarchical reinforcement learning playing Tennis